>CYP34A8
B0213.14 499 aa
Length = 499
Score = 2442 (859.6 bits), Expect =
4.5e-257, P = 4.5e-257
Identities = 484/499 (96%), Positives =
484/499 (96%)
Query: 1
MLLILILLAVAAVLTANLWRARQKLPKGPTPLPLIGNFHQLFYLSWKTGSLVAAFNELKK 60
MLLILILLAVAAVLTANLWRARQKLPKGPTPLPLIGNFHQLFYLSWKTGSLVAAFNELKK
Sbjct: 1
MLLILILLAVAAVLTANLWRARQKLPKGPTPLPLIGNFHQLFYLSWKTGSLVAAFNELKK 60
Query: 61
QYGKVFTVWMGPKPSVYICDYDIAHETHVKRANIFGTRYSVGGMEYIREGRGIIGSNGDF 120
QYGKVFTVWMGPKPSVYICDYDIAHETHVKRANIFGTRYSVGGMEYIREGRGIIGSNGDF
Sbjct: 61
QYGKVFTVWMGPKPSVYICDYDIAHETHVKRANIFGTRYSVGGMEYIREGRGIIGSNGDF 120
Query: 121
WLEHRRFALMTLRNFGVGRTIMEDKIMDEYRYRFKDFKRTHFKNGAIQVNASSIFDLLVG 180
WLEHRRFALMTLRNFGVGRTIMEDKIMDEYRYRFKDFKRTHFKNGAIQVNASSIFDLLVG
Sbjct: 121
WLEHRRFALMTLRNFGVGRTIMEDKIMDEYRYRFKDFKRTHFKNGAIQVNASSIFDLLVG 180
Query: 181
SIINQLLVSERFEQDDQEFEKLKTSLAEALENISIIEGFLPLWVLKSPLMKWRTKITFAP 240
SIINQLLVSERFEQDDQEFEKLKTSLAEALENISIIEGFLPLWVLKSPLMKWRTKITFAP
Sbjct: 181
SIINQLLVSERFEQDDQEFEKLKTSLAEALENISIIEGFLPLWVLKSPLMKWRTKITFAP 240
Query: 241
---------------VAAIENGTHTLSEEGDDFVDAFIVKMEKDKKEGIDSSFTYVLSII 285
VAAIENGTHTLSEEGDDFVDAFIVKMEKDKKEGIDSSFT
Sbjct: 241 FDFIFELGNRGIQRRVAAIENGTHTLSEEGDDFVDAFIVKMEKDKKEGIDSSFT------
294
Query: 286
RKFSIACFSLETLAIDLFDLWQAGQETTSTTLTWACVCLLNHPEVVEKLRKELTEVTGGT 345
LETLAIDLFDLWQAGQETTSTTLTWACVCLLNHPEVVEKLRKELTEVTGGT
Sbjct: 295
---------LETLAIDLFDLWQAGQETTSTTLTWACVCLLNHPEVVEKLRKELTEVTGGT 345
Query: 346
RGVSLTDRTKTPYLNANINEFQRIASILNVNLFRVLEEDTTIDSQPVPAGALVTTNLSML 405
RGVSLTDRTKTPYLNANINEFQRIASILNVNLFRVLEEDTTIDSQPVPAGALVTTNLSML
Sbjct: 346
RGVSLTDRTKTPYLNANINEFQRIASILNVNLFRVLEEDTTIDSQPVPAGALVTTNLSML 405
Query: 406
HTDEEIFKNPQEFRPERFMENNNLEKRLIPFGIGKRACPGESLARAELFLITGNMILDYD 465
HTDEEIFKNPQEFRPERFMENNNLEKRLIPFGIGKRACPGESLARAELFLITGNMILDYD
Sbjct: 406
HTDEEIFKNPQEFRPERFMENNNLEKRLIPFGIGKRACPGESLARAELFLITGNMILDYD 465
Query: 466 LEPVGTLPKIETTTPFAPMKRPPVYDIRFVPRSQ
499
LEPVGTLPKIETTTPFAPMKRPPVYDIRFVPRSQ
Sbjct: 466
LEPVGTLPKIETTTPFAPMKRPPVYDIRFVPRSQ 499
Compare to seq below
>CYP34A7
B0213.12 499 aa
Length = 499
Score = 2193 (772.0 bits), Expect =
1.1e-230, P = 1.1e-230
Identities = 405/498 (81%), Positives =
456/498 (91%)
Query: 1
MLLILILLAVAAVLTANLWRARQKLPKGPTPLPLIGNFHQLFYLSWKTGSLVAAFNELKK 60
ML+ILIL+A+AAVLT NLWRARQKLP GPTPLP+IGNFHQLFY WK G LVA F++ +K
Sbjct: 1
MLIILILVAIAAVLTVNLWRARQKLPNGPTPLPIIGNFHQLFYNGWKYGGLVAGFDQFRK 60
Query: 61
QYGKVFTVWMGPKPSVYICDYDIAHETHVKRANIFGTRYSVGGMEYIREGRGIIGSNGDF 120
QYGKVFTVWMGP P+V ICD+D+AHETHVK+A+ FG RY+ G MEYIREG+GIIGSNGDF
Sbjct: 61
QYGKVFTVWMGPIPAVQICDFDVAHETHVKKAHTFGHRYTFGAMEYIREGKGIIGSNGDF 120
Query: 121
WLEHRRFALMTLRNFGVGRTIMEDKIMDEYRYRFKDFKRTHFKNGAIQVNASSIFDLLVG 180
WLEHRRFALMTLRNFG+GR I+EDKIM+EYRYRF+DFK+T+FK+GAIQVNASS+FDLLVG
Sbjct: 121
WLEHRRFALMTLRNFGLGRNIIEDKIMEEYRYRFEDFKKTNFKDGAIQVNASSLFDLLVG 180
Query: 181 SIINQLLVSERFEQDDQEFEKLKTSLAEALENISIIEGFLPLWVLKSPLMKWRTKITFAP
240
SIINQLLVSERFEQDD+EFE+LKT+LA ALEN SIIEG LPLW+LKS MKWRTK TFAP
Sbjct: 181
SIINQLLVSERFEQDDEEFEELKTNLAMALENGSIIEGVLPLWMLKSRFMKWRTKTTFAP 240
Query: 241 FDFIFELGNRGIQRRVAAIENGTHTLSEEGDDFVDAFIVKMEKDKKEGIDSSFTLETLAI
300
FDF+FE+G +GIQRRVAAIENGTHTLSEEGDDFVDAFIVKMEKDKK+GIDSSFTLETLA+
Sbjct: 241 FDFVFEVGKKGIQRRVAAIENGTHTLSEEGDDFVDAFIVKMEKDKKDGIDSSFTLETLAV
300
Query: 301
DLFDLWQAGQETTSTTLTWACVCLLNHPEVVEKLRKELTEVTGGTRGVSLTDRTKTPYLN 360
DLFDLWQAGQETTSTTLTWAC CLLNHPEVVEKLRKELTEVTGG RGVSLTDRTKTPYLN
Sbjct: 301
DLFDLWQAGQETTSTTLTWACACLLNHPEVVEKLRKELTEVTGGARGVSLTDRTKTPYLN 360
Query: 361
ANINEFQRIASILNVNLFRVLEEDTTIDSQPVPAGALVTTNLSMLHTDEEIFKNPQEFRP 420
A INE QRI+SILNVNL R+LEED
ID PVPAG TT L++LHTDEE FKN +EF P
Sbjct: 361
ATINEVQRISSILNVNLLRILEEDAVIDGHPVPAGTAFTTQLALLHTDEETFKNHKEFIP 420
Query: 421
ERFMENNNLEKRLIPFGIGKRACPGESLARAELFLITGNMILDYDLEPVGTLPKIETTTP 480
ERF+ENNNLEKRLIPFGIGKR+CPGESLA+AEL+LI GN+++D+DL+PVG +PKIE+ TP
Sbjct: 421
ERFLENNNLEKRLIPFGIGKRSCPGESLAKAELYLIIGNLVIDFDLKPVGAIPKIESPTP 480
Query: 481 FAPMKRPPVYDIRFVPRS 498
F+P+KRPPVYDIRF+ R+
Sbjct: 481 FSPVKRPPVYDIRFISRA 498
>CYP23A1
B0304.3 U39472 529 aa
(revised 12/8/2003)
Length = 529
Score = 2756 (970.2 bits), Expect =
2.4e-290, P = 2.4e-290
Identities = 529/534 (99%), Positives =
529/534 (99%)
Query: 1
MPSLRNAILLTIATTFLYFRIMRTLNFDNYMATYIYFLTCITLYAIYELNYKRRRLPNGP 60
MPSLRNAILLTIATTFLYFRIMRTLNFDNYMATYIYFLTCITLYAIYELNYKRRRLPNGP
Sbjct: 1
MPSLRNAILLTIATTFLYFRIMRTLNFDNYMATYIYFLTCITLYAIYELNYKRRRLPNGP 60
Query: 61
VPWLVAGNMPSFINVNNVDVLFQSWKQQYGGIFTVWIGPIPLVMVSDLPTIKKYFIQHAD 120
VPWLVAGNMPSFINVNNVDVLFQSWKQQYGGIFTVWIGPIPLVMVSDLPTIKKYFIQHAD
Sbjct: 61
VPWLVAGNMPSFINVNNVDVLFQSWKQQYGGIFTVWIGPIPLVMVSDLPTIKKYFIQHAD 120
Query: 121
SFSNRWRNFVTDSIMEGSNGIVQIDGNKWREQRRFALHTLRDFGVGKPLMEQMITLEVTS 180
SFSNRWRNFVTDSIMEGSNGIVQIDGNKWREQRRFALHTLRDFGVGKPLMEQMITLEVTS
Sbjct: 121
SFSNRWRNFVTDSIMEGSNGIVQIDGNKWREQRRFALHTLRDFGVGKPLMEQMITLEVTS 180
Query: 181
LMNHMEKSCGLDGKELHLCPSIAVCVGNIINNMLFGLRFNQDNSYMHRLHQLLDDQSHTV 240
LMNHMEKSCGLDGKELHLCPSIAVCVGNIINNMLFGLRFNQDNSYMHRLHQLLDDQSHTV
Sbjct: 181 LMNHMEKSCGLDGKELHLCPSIAVCVGNIINNMLFGLRFNQDNSYMHRLHQLLDDQSHTV
240
Query: 241
MQPIMGAYIAFPVTSKIPIINGEWNRLMGIKNELLEFLETQIEGHRMNWKDEMIEQEPED 300
MQPIMGAYIAFPVTSKIPIINGEWNRLMGIKNELLEFLETQIEGHRMNWKDEMIEQEPED
Sbjct: 241
MQPIMGAYIAFPVTSKIPIINGEWNRLMGIKNELLEFLETQIEGHRMNWKDEMIEQEPED 300
Query: 301
LTYAYMIEVEKRKRNGEDVGFFDDQQLKMLLLDLFFAGMETTVTTLKWAFLLMAKNQKVQ 360
LTYAYMIEVEKRKRNG
FDDQQLKMLLLDLFFAGMETTVTTLKWAFLLMAKNQKVQ
Sbjct: 301
LTYAYMIEVEKRKRNG-----FDDQQLKMLLLDLFFAGMETTVTTLKWAFLLMAKNQKVQ 355
Query: 361
KNVQAELDSIGQPMIEIQHRTRLPYVQATINEIQRIANILPINLLRTVAEDIEIDGYNFK 420
KNVQAELDSIGQPMIEIQHRTRLPYVQATINEIQRIANILPINLLRTVAEDIEIDGYNFK
Sbjct: 356
KNVQAELDSIGQPMIEIQHRTRLPYVQATINEIQRIANILPINLLRTVAEDIEIDGYNFK 415
Query: 421 KGDLIIPQISILMNDPEIFENPEEFNPSRFLDEDNNVKKIDEFLPFSIGRRQCLGESLAR
480
KGDLIIPQISILMNDPEIFENPEEFNPSRFLDEDNNVKKIDEFLPFSIGRRQCLGESLAR
Sbjct: 416
KGDLIIPQISILMNDPEIFENPEEFNPSRFLDEDNNVKKIDEFLPFSIGRRQCLGESLAR 475
Query: 481 AELYLVFANLIQNFNFEVADDVTTERVLGLTVSPVEYSCKITRRGLDHNQNSVK
534
AELYLVFANLIQNFNFEVADDVTTERVLGLTVSPVEYSCKITRRGLDHNQNSVK
Sbjct: 476
AELYLVFANLIQNFNFEVADDVTTERVLGLTVSPVEYSCKITRRGLDHNQNSVK 529
Compare to the sequence below:
>cb25.fpc3052
ACC=CAAC01000064 86% to CYP23A1
Length = 534
Score = 2430 (855.4 bits), Expect =
8.4e-256, P = 8.4e-256
Identities = 454/534 (85%), Positives =
493/534 (92%)
Query: 1
MPSLRNAILLTIATTFLYFRIMRTLNFDNYMATYIYFLTCITLYAIYELNYKRRRLPNGP 60
MPS++ AILL++ T+FLYFRIM++
D+Y TYIY C Y YE
NYKRRRLPNGP
Sbjct: 1
MPSIKYAILLSVVTSFLYFRIMKSFEMDSYTTTYIYLFFCTISYIFYESNYKRRRLPNGP 60
Query: 61
VPWLVAGNMPSFINVNNVDVLFQSWKQQYGGIFTVWIGPIPLVMVSDLPTIKKYFIQHAD 120
+PWLVAGNMPSFINV NVD LF
WKQ+YGGIFTVWIGPIPLVMVSDLPTIKKYFIQHAD
Sbjct: 61
MPWLVAGNMPSFINVKNVDDLFLYWKQRYGGIFTVWIGPIPLVMVSDLPTIKKYFIQHAD 120
Query: 121
SFSNRWRNFVTDSIMEGSNGIVQIDGNKWREQRRFALHTLRDFGVGKPLMEQMITLEVTS 180
+FSNRWRNFVTDSIMEGSNGIVQIDG+KWREQRRFALHTLRDFGVG+PLMEQMITLEVT+
Sbjct: 121 AFSNRWRNFVTDSIMEGSNGIVQIDGDKWREQRRFALHTLRDFGVGRPLMEQMITLEVTT
180
Query: 181
LMNHMEKSCGLDGKELHLCPSIAVCVGNIINNMLFGLRFNQDNSYMHRLHQLLDDQSHTV 240
LMNHM KSCGL
KE++LCPSIAVCVGNIINNMLFGLRFNQDNSYMHRLH LLDDQSHTV
Sbjct: 181
LMNHMAKSCGLSTKEVNLCPSIAVCVGNIINNMLFGLRFNQDNSYMHRLHSLLDDQSHTV 240
Query: 241
MQPIMGAYIAFPVTSKIPIINGEWNRLMGIKNELLEFLETQIEGHRMNWKDEMIEQEPED 300
MQPIMGAYIAFPVT+K+P INGEWNRLMGIK ELLEFLE QI+ HR NWK+EM+EQEPED
Sbjct: 241
MQPIMGAYIAFPVTTKVPFINGEWNRLMGIKKELLEFLEGQIQKHRENWKEEMMEQEPED 300
Query: 301 LTYAYMIEVEKRKRNGEDVGFFDDQQLKMLLLDLFFAGMETTVTTLKWAFLLMAKNQKVQ
360
LTYAYMIEVEKRKR GEDVGFFDDQQLKMLLLDLFFAGMETTVTTLKWAFLLM+KN
+VQ
Sbjct: 301 LTYAYMIEVEKRKRAGEDVGFFDDQQLKMLLLDLFFAGMETTVTTLKWAFLLMSKNPRVQ
360
Query: 361 KNVQAELDSIGQPMIEIQHRTRLPYVQATINEIQRIANILPINLLRTVAEDIEIDGYNFK
420
+ VQ ELDSI QPM+EIQHRTRLPY+QATINEIQRIANILPINLLRTVAEDI+IDGY FK
Sbjct: 361
RKVQEELDSIAQPMVEIQHRTRLPYIQATINEIQRIANILPINLLRTVAEDIDIDGYQFK 420
Query: 421 KGDLIIPQISILMNDPEIFENPEEFNPSRFLDEDNNVKKIDEFLPFSIGRRQCLGESLAR
480
KGDL IPQISILMNDPEIF+NP++F P RFLDE+ NVKKIDEFLPFSIGRRQCLGESLAR
Sbjct: 421
KGDLTIPQISILMNDPEIFKNPKDFCPERFLDENLNVKKIDEFLPFSIGRRQCLGESLAR 480
Query: 481
AELYLVFANLIQNFNFEVADDVTTERVLGLTVSPVEYSCKITRRGLDHNQNSVK 534
AELYL+FANL+QNF FEV++DVTTERVLGLTVSPV+Y+CKI+RRGLDHN+N VK
Sbjct: 481
AELYLIFANLMQNFKFEVSEDVTTERVLGLTVSPVQYTCKISRRGLDHNENCVK 534
>CYP29A4
B0331 500 aa 4 small changes,
probably errors in my old sequence
Length = 500
Score = 2532 (891.3 bits), Expect =
1.3e-266, P = 1.3e-266
Identities = 481/486 (98%), Positives =
481/486 (98%)
Query: 1
MSILIPVALALLFVYLLSFYDTIRLMRKFWIYGGKMPGPPAHPIFGNASLFKNKTTKDFV 60
MSILIPVALALLFVYLLSFYDTIRLMRKFWIYGGKMPGPPAHPIFGNASLFKNKT DFV
Sbjct: 1
MSILIPVALALLFVYLLSFYDTIRLMRKFWIYGGKMPGPPAHPIFGNASLFKNKTXSDFV 60
Query: 61
ELFVQLAHEARSKGANLMRTQVMNRIYVWPLNGKTAATILESSTEVNKGDDYAFLVPWLG 120
ELFVQLAHEARSKGANLMRTQVMNRIYVWPLNGKTAATILESSTEVNKGDDYAFLVPWLG
Sbjct: 61 ELFVQLAHEARSKGANLMRTQVMNRIYVWPLNGKTAATILESSTEVNKGDDYAFLVPWLG
120
Query: 121
GGLLMEKGEKWKSHRRILTPAFHFAKLEGYLDVFNSESKILIDCLEKIAETQETVDLFPF 180
GGLLMEKGEKWKSHRRILTPAFHFAKLEGYLDVFNSES ILIDCLEKIAETQETVDLFPF
Sbjct: 121
GGLLMEKGEKWKSHRRILTPAFHFAKLEGYLDVFNSES-ILIDCLEKIAETQETVDLFPF 179
Query: 181
FKRCTLDIICGTAMGIKLDAQNVHNLGYVQAVEGFNKLTVEYSLNPFLWNRFVYWALGYQ 240
FKRCTLDIICGTAMGIKLDAQNVHNLGYVQAVEGFNKLTV YSLNPFLWNRFVYWALGYQ
Sbjct: 180
FKRCTLDIICGTAMGIKLDAQNVHNLGYVQAVEGFNKLTVSYSLNPFLWNRFVYWALGYQ 239
Query: 241
KMHDDFLYTLKKFTNDAIVERRTVIASGEIEKETSKRKMNFLDILLNSEESNELTSDEIR 300
KMHDDFLYTLKKFTNDAIVERRTVIASGEIEKETSKRKMNFLDILLNSEESNELTSDEIR
Sbjct: 240
KMHDDFLYTLKKFTNDAIVERRTVIASGEIEKETSKRKMNFLDILLNSEESNELTSDEIR 299
Query: 301 KEVDTFMFAGHDTTSTSLSWLCWNIAHNPEVQENVYKEIISIFGEDPNQDVTSENINRLE
360
KEVDTFMFAGHDTTSTSLSWLCWNIAHNPEVQENVYKEIISIFGEDPNQDVTSENINRLE
Sbjct: 300
KEVDTFMFAGHDTTSTSLSWLCWNIAHNPEVQENVYKEIISIFGEDPNQDVTSENINRLE 359
Query: 361 YTERVLKESKRMFPPVPGFQRKLTKDIVIDGITIPSEGNITISPTVLHCNPFVYQNPEKF
420
YTERVLKESKRMFPPVPGFQRKLTKDIVI GITIPSEGNITISPTVLHCNPFVYQNPEKF
Sbjct: 360 YTERVLKESKRMFPPVPGFQRKLTKDIVIGGITIPSEGNITISPTVLHCNPFVYQNPEKF
419
Query: 421
DPDRFLPEECAKRHSYDYIPFSAGLRNCIGQKFSILNEKVMLIHILRNFKLEPKLEFYET 480
DPDRFLPEECAKRHSYDYIPFSAGLRNCIGQKFSILNEKVMLIHILRNFKLEPKLEFYET
Sbjct: 420
DPDRFLPEECAKRHSYDYIPFSAGLRNCIGQKFSILNEKVMLIHILRNFKLEPKLEFYET 479
Query: 481 KPLFEV 486
KPLFEV
Sbjct: 480 KPLFEV 485
C03G6.14;
CE07888. This gene is badly
misassembled
MFLVLIFLALSCWLIIRQYQKVSRLPPGP
skipped sequence here
KYGNIFTLWVGPVPHVSICDY 50
ETSHEVFVKGANKYADIAHAPLFRELRQEMGVLVTNGSHWSTMKRFALHT
100
FRDMGVGKDLMETRIMEELDARCADTDKSATDGVTVAQAGDFFDLTVGSI
150
INSILVGKRFEEHNKDDFLKIKEAMGAAFEVFSPFDMAVPVWFLRTFFRS
200
RYDMMMTTQNTAKRFAAAEAVK
skipped seq here
SIETLKTMIIDLWMTGQETTTTTLISGF
250
TQLLLHPEVMVKAREEILKITENGSRHLSLTDRTSTPYLNAMIGEIQRHA
300
SILNVSFWKINKELTYMGGHPVDAGALVTAQLSALHVNDTIFKNPQEFDP
350
ERFIRDEELLQKVIPFGVGKRSCIGESLARAELYL
C-term is wrong seq, but my C-term may also be wrong
ISLLCICGTSNKSFLHKYLPLWNSVALLLPVKLSRPAFAALISSLPLDKLLPHI
>C03G6.14 C13B3.B CYP35A1 502 aa
yellow is
at an intron boundary
MFLVLIFLALSCWLIIRQYQKVSRLPPGPVSFPIIGNLPHIIYYLWATGGIVS
TLDLFRKKYGNIFTLWVGPVPHVSICDYETSHEVFVKGANKYADIAHAPLFR
ELRRW*MGVLVTNGSHWSTMKRFALHTFRDMGVGKDLMETRIMEELDARC
ADTDKSATDGVTVAQAGDFFDLTVGSIINSILVGKRFEEHNKDDFLKIKEAMG
AAFEVFSPFDMAVPVWFLRTFFRSRYDMMMTTQNTAKRFAAAEAVKRIED
IKSGAYEIDESNIEDYTDAFLLKIQKDGEDLDFNIETLKTMIIDLWMTGQETTTT
TLISGFTQLLLHPEVMVKAREEILKITENGSRHLSLTDRTSTPYLNAMIGEIQRHASI
LNVSFWKINKELTYMGGHPVDAGALVTAQLSALHVNDTIFKNPQEFDPERFIRDEE
LLQKVIPFGVGKRSCIGESLARAELYL
VRHLQYRISNFLFRLLGTFCSAINSNRMEH
CRQQSCCRIAPEKDHSSWK*
18318
IIGNLLLRYKFEPHGTLSTTELLPYSAGKRPFKLEMKFVKI 18196 correct C-term
Compare my
35A1 assembly to 35A2 below
>CYP35A2
C03G6.15 C13B3.A 495
aa
Length = 496
Score = 2005 (705.8 bits), Expect =
9.2e-211, P = 9.2e-211
Identities = 386/505 (76%), Positives =
431/505 (85%)
Query: 1
MFLVLIFLALSCWLIIRQYQKVSRLPPGPVSFPIIGNLPHIIYYLWATGGIVSTLDLFRK
60
MF VL F L +LI+RQYQKVSRLPPGP+S P+IGNLP IIYYLW+TGGIVSTLDLFRK
Sbjct: 1
MFFVLFFSVLLGYLIVRQYQKVSRLPPGPISLPLIGNLPQIIYYLWSTGGIVSTLDLFRK
60
Query: 61
KYGNIFTLWVGPVPHVSICDYETSHEVFVKGANKYADIAHAPLFRELRRW*MGVLVTNGS 120
+YGNIFTLWVGP+PHVSI
DYETSHEVFVK A KYAD HAP+ R++R +GVL+TNG
Sbjct: 61
RYGNIFTLWVGPIPHVSIADYETSHEVFVKNAGKYADKFHAPVMRDVRSK-IGVLITNGD 119
Query: 121
HWSTMKRFALHTFRDMGVGKDLMETRIMEELDARCADTDKSATDGVTVAQAGDFFDLTVG 180
HW M+RF+L FR+MGVGKD+METRIMEELDARC+D DK
AT+GVT+ A +FFDLTVG
Sbjct: 120
HWQEMRRFSLQAFRNMGVGKDIMETRIMEELDARCSDIDKLATNGVTITHASEFFDLTVG 179
Query: 181
SIINSILVGKRFEEHNKDDFLKIKEAMGAAFEVFSPFDMAVPVWFLRTFFRSRYDMMMTT 240
SIINSILVGKRFEE K +FLKIKE M
A+FE FSPFDM PVWFL+TFF+ RYD + +
Sbjct: 180
SIINSILVGKRFEEDTKHEFLKIKETMDASFETFSPFDMTAPVWFLKTFFKHRYDKIWSA 239
Query: 241 QNTAKRFAAAEAVKRIEDIKSGAYEIDESNIEDYTDAFLLKIQKDGEDLDFNIETLKTM
300
Q TAK FAAAEA+KR+E IKSG
Y IDE+N++DYTDAFLLKIQK+GE DFNIETLKTM
Sbjct: 240 QETAKNFAAAEAIKRVESIKSGKYVIDENNLQDYTDAFLLKIQKEGESKDFNIETLKTM
298
Query: 301
IIDLWMTGQETTTTTLISGFTQLLLHPEVMVKAREEILKITENGSRHLSLTDRTSTPYLN 360
IIDLWMTGQETTTTTLISGF QLLLHPEVM+KAREEILKITENGSRHLSLTDRTSTPY+N
Sbjct: 299 IIDLWMTGQETTTTTLISGFNQLLLHPEVMIKAREEILKITENGSRHLSLTDRTSTPYVN
358
Query: 361
AMIGEIQRHASILNVSFWKINKELTYMGGHPVDAGALVTAQLSALHVNDTIFKNPQEFDP 420
A+IGEIQRHASILNVSFWKINKE TYMGGHPVDAGALVT+QLSALHVN+T+FKNPQEF+P
Sbjct: 359
AVIGEIQRHASILNVSFWKINKEFTYMGGHPVDAGALVTSQLSALHVNETVFKNPQEFNP 418
Query: 421
ERFIRDEELLQKVIPFGVGKRSCIGESLARAELYLVRHLQYRISNFLFRLLGTFCSAINS 480
ERFIRD +LLQKVIPFGVGKR+C+GESLA+AELYLVR + YR S TF A NS
Sbjct: 419
ERFIRDGKLLQKVIPFGVGKRNCLGESLAKAELYLVRFVFYRSSV-------TFSFATNS 471
Query: 481 NRMEHCRQQSCCRIAPEKDHSSWK* 505
N M + S C A EKDHSSWK*
Sbjct: 472 NNMANYPLPSLCHTALEKDHSSWK* 496
>CYP35A2
C03G6.15 C13B3.A 495
aa
Length = 496
Score = 2342 (824.4 bits), Expect =
1.8e-246, P = 1.8e-246
Identities = 451/454 (99%), Positives =
453/454 (99%)
My C-term
may be wrong see below. By
comparison to the briggsae seqs
It looks
like all my C-term exons for elegans 35As are wrong. They may be in an alternative reading frame from the correct
sequences.
Query: 1
MFFVLFFSVLLGYLIVRQYQKVSRLPPGPISLPLIGNLPQIIYYLWSTGGIVSTLDLFRK 60
MFFVLFFSVLLGYLIVRQYQKVSRLPPGPISLPLIGNLPQIIYYLWSTGGIVSTLDLFRK
Sbjct: 1
MFFVLFFSVLLGYLIVRQYQKVSRLPPGPISLPLIGNLPQIIYYLWSTGGIVSTLDLFRK 60
Query: 61 RYGNIFTLWVGPIPHVSIADYETSHEVFVKNAGKYADKFHAPVMRDVRNDIGVLITNGDH 120
RYGNIFTLWVGPIPHVSIADYETSHEVFVKNAGKYADKFHAPVMRDVR+ IGVLITNGDH
Sbjct: 61
RYGNIFTLWVGPIPHVSIADYETSHEVFVKNAGKYADKFHAPVMRDVRSKIGVLITNGDH 120
Query: 121
WQEMRRFSLQAFRNMGVGKDIMETRIMEELDARCSDIDKLATNGVTITHASEFFDLTVGS 180
WQEMRRFSLQAFRNMGVGKDIMETRIMEELDARCSDIDKLATNGVTITHASEFFDLTVGS
Sbjct: 121
WQEMRRFSLQAFRNMGVGKDIMETRIMEELDARCSDIDKLATNGVTITHASEFFDLTVGS 180
Query: 181
IINSILVGKRFEEDTKHEFLKIKETMDASFETFSPFDMTAPVWFLKTFFKHRYDKIWSAQ 240
IINSILVGKRFEEDTKHEFLKIKETMDASFETFSPFDMTAPVWFLKTFFKHRYDKIWSAQ
Sbjct: 181
IINSILVGKRFEEDTKHEFLKIKETMDASFETFSPFDMTAPVWFLKTFFKHRYDKIWSAQ 240
Query: 241
ETAKNFAAAEAIKRVESIKSGKYVIDENNLQDYTDAFLLKIQKEGESKDFNIETLKTMII 300
ETAKNFAAAEAIKRVESIKSGKYVIDENNLQDYTDAFLLKIQKEGESKDFNIETLKTMII
Sbjct: 241
ETAKNFAAAEAIKRVESIKSGKYVIDENNLQDYTDAFLLKIQKEGESKDFNIETLKTMII 300
Query: 301
DLWMTGQETTTTTLISGFNQLLLHPEVMIKAREEILKITENGSRHLSLTDRTSTPYVNAV 360
DLWMTGQETTTTTLISGFNQLLLHPEVMIKAREEILKITENGSRHLSLTDRTSTPYVNAV
Sbjct: 301
DLWMTGQETTTTTLISGFNQLLLHPEVMIKAREEILKITENGSRHLSLTDRTSTPYVNAV 360
Query: 361
IGEIQRHASILNVSFWKINKEFTYMGGHPVDAGALVTSQLSALHVNETVFKNPQEFNPER 420
IGEIQRHASILNVSFWKINKEFTYMGGHPVDAGALVTSQLSALHVNETVFKNPQEFNPER
Sbjct: 361
IGEIQRHASILNVSFWKINKEFTYMGGHPVDAGALVTSQLSALHVNETVFKNPQEFNPER 420
Query: 421
FIRDGKLLQKVIPFGVGKRNCLGESLAKAELYLI 454
FIRDGKLLQKVIPFGVGKRNCLGESLAKAELYL+
Sbjct: 421
FIRDGKLLQKVIPFGVGKRNCLGESLAKAELYLV 454
>C03G6.15 C13B3.A CYP35A2 495 aa
yellow is
at intron boundary
MFFVLFFSVLLGYLIVRQYQKVSRLPPGPISLPLIGNLPQIIYYLWSTGGIVST
LDLFRKRYGNIFTLWVGPIPHVSIADYETSHEVFVKNAGKYADKFHAPVMR
DVRSKIGVLITNGDHWQEMRRFSLQAFRNMGVGKDIMETRIMEELDARCSD
IDKLATNGVTITHASEFFDLTVGSIINSILVGKRFEEDTKHEFLKIKETMDASF
ETFSPFDMTAPVWFLKTFFKHRYDKIWSAQETAKNFAAAEAIKRVESIKSGKYVID
ENNLQDYTDAFLLKIQKEGESKDFNIETLKTMIIDLWMTGQETTTTTLISGFNQL
LLHPEVMIKAREEILKITENGSRHLSLTDRTSTPYVNAVIGEIQRHASILNVSFWKIN
KEFTYMGGHPVDAGALVTSQLSALHVNETVFKNPQEFNPERFIRDGKLLQKVIPF
GVGKRNCLGESLAKAELYL
VRFVFYRSSVTFSFATNSNNMANYPLPSLCHTALEKDHSSWK*
Probable
correct C-term =
IFGNLLLRYKFEQHGKLSTTELMPYSAGKRPFKLEMKFVKI*
cDNA
evidence
>gi|30742705|gb|CB400978.1|CB400978 UniGene info OSTF185G5_1 AD-wrmcDNA
Caenorhabditis elegans cDNA.
Length = 544
Score = 210 bits (534), Expect = 7e-56
Identities = 100/102 (98%), Positives =
101/102 (99%)
Frame = +3
Query:
1
LDLFRKRYGNIFTLWVGPIPHVSIADYETSHEVFVKNAGKYADKFHAPVMRDVRSKIGVL 60
LDLFRKRYGNIFTLWVGPIPHVSIADYETSHEVFVKNAGKYADKFHAPVMRDVR+ IGVL
Sbjct: 135
LDLFRKRYGNIFTLWVGPIPHVSIADYETSHEVFVKNAGKYADKFHAPVMRDVRNDIGVL 314
Query:
61
ITNGDHWQEMRRFSLQAFRNMGVGKDIMETRIMEELDARCSD 102
ITNGDHWQEMRRFSLQAFRNMGVGKDIMETRIMEELDARCSD
Sbjct: 315
ITNGDHWQEMRRFSLQAFRNMGVGKDIMETRIMEELDARCSD 440
>CYP35A.e
cb25.fpc0023 76% to CYP35A.b briggsae seq on bottom
Length = 495
Score = 2054 (723.0 bits), Expect =
5.9e-216, P = 5.9e-216
Identities = 379/494 (76%), Positives =
445/494 (90%)
Query: 1
MFFVLFFSVLLGYLIVRQYQKVSRLPPGPISLPLIGNLPQIIYYLWSTGGIVSTLDLFRK 60
MFFVL L
+L+VRQYQKVSRLPPGP+SLPLIGNLPQI+YYL++TGG+VSTLD FRK
Sbjct: 1
MFFVLIVFTFLTWLVVRQYQKVSRLPPGPVSLPLIGNLPQIVYYLYTTGGVVSTLDFFRK 60
Query: 61
RYGNIFTLWVGPIPHVSIADYETSHEVFVKNAGKYADKFHAPVMRDVRNDIGVLITNGDH 120
RYGNIFTLWVGPIPHVSIADYETSHEVFVKNA KYADKFH+P+ R++R+ G+L NGDH
Sbjct: 61
RYGNIFTLWVGPIPHVSIADYETSHEVFVKNANKYADKFHSPIFREMRSKRGILTANGDH 120
Query: 121
WQEMRRFSLQAFRNMGVGKDIMETRIMEELDARCSDIDKLATNGVTITHASEFFDLTVGS 180
WQEMRRF+L
FRNMGVGKD+METRIMEEL+ARC+DIDK A NGVT T A+EFFDLTVGS
Sbjct: 121
WQEMRRFALFTFRNMGVGKDLMETRIMEELNARCADIDKAAVNGVTTTQAAEFFDLTVGS 180
Query: 181 IINSILVGKRFEEDTKHEFLKIKETMDASFETFSPFDMTAPVWFLKTFFKHRYDKIWSAQ
240
IINSILVGKRFEE K++FLKIKE
MDASFETFSPFDMT PVW LK FF R++K+ + Q
Sbjct: 181
IINSILVGKRFEEHNKNDFLKIKEVMDASFETFSPFDMTMPVWILKNFFPRRFEKMRNGQ 240
Query: 241 ETAKNFAAAEAIKRVESIKSGKYVIDENNLQDYTDAFLLKIQKEGESKDFNIETLKTMII
300
E++K FAA EA+KR++ IK+G+YVIDENN QDYTDAFL K+QK+GE++D+ +E+LKTMI+
Sbjct: 241
ESSKQFAAKEALKRIDEIKAGRYVIDENNFQDYTDAFLWKMQKDGENEDYKVESLKTMIL 300
Query: 301
DLWMTGQETTTTTLISGFNQLLLHPEVMIKAREEILKITENGSRHLSLTDRTSTPYVNAV 360
DLW+TGQETTTTTLISGFNQLLLH + M KAR E+ +ITENGSR++SL+DR TPY+NAV
Sbjct: 301
DLWITGQETTTTTLISGFNQLLLHSKFMEKARAELFEITENGSRNVSLSDRPKTPYLNAV 360
Query: 361
IGEIQRHASILNVSFWKINKEFTYMGGHPVDAGALVTSQLSALHVNETVFKNPQEFNPER 420
IGEIQRHASILNV+FW+ N E TYMGGH
VD+GALVT+QLSALHVNETVF++P++F+PER
Sbjct: 361
IGEIQRHASILNVNFWRWNNEPTYMGGHMVDSGALVTAQLSALHVNETVFEHPEKFDPER 420
Query: 421
FIRDGKLLQKVIPFGVGKRNCLGESLAKAELYLIFGNLLLRYKFEQHGKLSTTELMPYSA 480
FIRD KLLQKVIPFG+GKR+CLGESLA++ELYLIFGNLLLRYKF+ HG+LST E+MPYSA
Sbjct: 421
FIRDEKLLQKVIPFGLGKRSCLGESLARSELYLIFGNLLLRYKFQPHGELSTREIMPYSA 480
Query: 481 GKRPFKLEMKFVKI 494
GKRPFKLEM+F+K+
Sbjct: 481 GKRPFKLEMQFIKV 494
C03G6.15;
CE35217. = correct seq
MFFVLFFSVLLGYLIVRQYQKVSRLPPGPISLPLIGNLPQIIYYLWSTGG 50
IVSTLDLFRKRYGNIFTLWVGPIPHVSIADYETSHEVFVKNAGKYADKFH 100
APVMRDVRNDIGVLITNGDHWQEMRRFSLQAFRNMGVGKDIMETRIMEEL 150
DARCSDIDKLATNGVTITHASEFFDLTVGSIINSILVGKRFEEDTKHEFL 200
KIKETMDASFETFSPFDMTAPVWFLKTFFKHRYDKIWSAQETAKNFAAAE 250
AIKRVESIKSGKYVIDENNLQDYTDAFLLKIQKEGESKDFNIETLKTMII 300
DLWMTGQETTTTTLISGFNQLLLHPEVMIKAREEILKITENGSRHLSLTD 350
RTSTPYVNAVIGEIQRHASILNVSFWKINKEFTYMGGHPVDAGALVTSQL 400
SALHVNETVFKNPQEFNPERFIRDGKLLQKVIPFGVGKRNCLGESLAKAE 450
LYL IFGNLLLRYKFEQHGKLSTTELMPYSAGKRPFKLEMKFVKI
>CYP33B1
C25E10.2 U50311 495 aa
Length = 496
Score = 2536 (892.7 bits), Expect =
4.9e-267, P = 4.9e-267
Identities = 485/496 (97%), Positives =
489/496 (98%)
My seq
wrong at green region yellow is intron boundary
Query: 1
MILIFFALCTLFFLIHQYLWRRRGLPPGPTPIPIFGNLFQLSGSEAPGISIFQKWKDQYG 60
MILIFFALCTLFFLIHQYLWRRRGLPPGPTPIPIFGNLFQLSGSEAPGISIFQKWKDQYG
Sbjct: 1
MILIFFALCTLFFLIHQYLWRRRGLPPGPTPIPIFGNLFQLSGSEAPGISIFQKWKDQYG 60
Query: 61 PIFTFYMGPVPFVVLTDYQDIKETVIKDGDTYADKYLSPEFNKYFRGGEYGIMDISGDRW 120
PIFTFYMG PFVVLTDYQDIKETVIKDGDTYADKYLSPEFNKYFR ++ + +SGDRW
Sbjct: 61 PIFTFYMGIAPFVVLTDYQDIKETVIKDGDTYADKYLSPEFNKYFREMQFTKL-LSGDRW 119
Query: 121
KEHRKFAVLQLRELGVGKPLMESKILIEAEELIKKLKTAEILNEDFLLQSELDVAVGSVI 180
KEHRKFAVLQLRELGVGKPLMESKILIEAEELIKKLKTAEILNEDFLLQSELDVAVGSVI
Sbjct: 120
KEHRKFAVLQLRELGVGKPLMESKILIEAEELIKKLKTAEILNEDFLLQSELDVAVGSVI 179
Query: 181
NQFLFGYRFDRSKLFEFTRIKTLVNNFMEEVGKPLGVLAFTCHGIPSFLVKLMVSGIEEQ 240
NQFLFGYRFDRSKLFEFTRIKTLVNNFMEEVGKPLGVLAFTCHGIPSFLVKLMVSGIEEQ
Sbjct: 180
NQFLFGYRFDRSKLFEFTRIKTLVNNFMEEVGKPLGVLAFTCHGIPSFLVKLMVSGIEEQ 239
Query: 241
KRELFRFLRKQIDGAKSQINYEEEHNEDFVEAYLRKKFQREQKNDFDSYCDSQLENVCFD 300
KRELFRFLRKQIDGAKSQINYEEEHNEDFVEAYLRKKFQREQKNDFDSYCDSQLENVCFD
Sbjct: 240
KRELFRFLRKQIDGAKSQINYEEEHNEDFVEAYLRKKFQREQKNDFDSYCDSQLENVCFD 299
Query: 301
IWAAGFDTLTNTVGFLIAYAINYPEMQMLIHQEIDNYLAHHSRLLTLADKNALVYFNAFA 360
IWAAGFDTLTNTVGFLIAYAINYPEMQMLIHQEIDNYLAHHSRLLTLADKNALVYFNAFA
Sbjct: 300
IWAAGFDTLTNTVGFLIAYAINYPEMQMLIHQEIDNYLAHHSRLLTLADKNALVYFNAFA 359
Query: 361
NEAQRVSNILPMNLPHALTRDVKLKGYHLKKGTGVIHQIANVMTDETIFKDSQRFDPNRF 420
NEAQRVSNILPMNLPHALTRDVKLKGYHLKKGTGVIHQIANVMTDETIFKDSQRFDPNRF
Sbjct: 360
NEAQRVSNILPMNLPHALTRDVKLKGYHLKKGTGVIHQIANVMTDETIFKDSQRFDPNRF 419
Query: 421
IDENGKLKKIEELCPFSMGKRQCIGEGLARMEIFLLAANLFNYFEFLPASDGLPSLYKDF 480
IDENGKLKKIEELCPFSMGKRQCIGEGLARMEIFLLAANLFNYFEFLPASDGLPSLYKDF
Sbjct: 420 IDENGKLKKIEELCPFSMGKRQCIGEGLARMEIFLLAANLFNYFEFLPASDGLPSLYKDF
479
Query: 481 SLVSHVIPYKCRQYIP 496
SLVSHVIPYKCRQYIP
Sbjct: 480 SLVSHVIPYKCRQYIP 495
>CYP32A1
C26F1.2 U53148 508
aa also Y97E10 contig266
Length = 522
Score = 1508 (530.8 bits), Expect =
4.3e-158, P = 4.3e-158
Identities = 283/321 (88%), Positives =
297/321 (92%)
Query: 1
MIIVISIVIGYVIYLVVVNFQQILELWRINRKCAQNLSMVNGPPALPLVGSAHLFKWNPY 60
MIIVISIVIGYVIYLVVVNFQQILELWRINRKCAQNLSMVNGPPALPLVGSAHLFKWNPY
Sbjct: 1
MIIVISIVIGYVIYLVVVNFQQILELWRINRKCAQNLSMVNGPPALPLVGSAHLFKWNPY 60
Query: 61
AFTFQMEGWAQKYLFGRAKYGEIAAPNNEVDGIMLLWIGPVPIVFLGTSECIRPVLESNT 120
AFTFQMEGWAQKYLFGRAKYGEIAAPNNEVDGIMLLWIGPVPIVFLGTSECIRPVLESNT
Sbjct: 61 AFTFQMEGWAQKYLFGRAKYGEIAAPNNEVDGIMLLWIGPVPIVFLGTSECIRPVLESNT
120
Query: 121
NISKPSQYDKMSEWIGTGLLTSTHEKWFHRRKMLTPTFHFTIIQDYFPVFVRNAEVLADA 180
NISKPSQYDKMSEWIGTGLLTSTHEKWFHRRKMLTPTFHFTIIQDYFPVFVRNAEVLADA
Sbjct: 121
NISKPSQYDKMSEWIGTGLLTSTHEKWFHRRKMLTPTFHFTIIQDYFPVFVRNAEVLADA 180
Query: 181
VELHVDGDYFDAFPYFKRCTLDIICETAMGIQVNAQLGHNNEYVHAVKRISEIVWNHMKF 240
VELHVDGDYFDAFPYFKRCTLDIICETAMGIQVNAQLGHNNEYVHAVKRISEIVWNHMKF
Sbjct: 181
VELHVDGDYFDAFPYFKRCTLDIICETAMGIQVNAQLGHNNEYVHAVKRISEIVWNHMKF 240
Query: 241
PWLWLKPIWYLTGLGFEFDRNVRMTNNFVRKVDAADFKIENQKNYYYLGIVLP---EAFK 297
PWLWLKPIWYLTGLGFEFDRNVRMTNNFVRKVDAAD + +K
+L ++L E
Sbjct: 241
PWLWLKPIWYLTGLGFEFDRNVRMTNNFVRKVDAADNEASEKKRKAFLDLLLTIQKEEGT 300
Query: 298 INFQVIQERKELLNEDGNEAS 318
++ + I+E + +G++ +
Sbjct: 301 LSDEDIREEVDTFMFEGHDTT 321
Score = 1184 (416.8 bits), Expect =
9.2e-124, P = 9.2e-124
Identities = 241/307 (78%), Positives =
254/307 (82%)
Query: 252
TGLGFEFDRNVRMTNNFVRKVDAADFKIENQKNYYYLGIV-------LPEAFKINFQVIQ 304
T +G + + + N +V V + N + +L + L F N ++
Sbjct: 207
TAMGIQVNAQLGHNNEYVHAVKRISEIVWNHMKFPWLWLKPIWYLTGLGFEFDRNVRMTN 266
Query: 305
ERKELLNEDGNEASEKKRKAFLDLLLTIQKEEGTLSDEDIREEVDTFMFEGHDTTSSGIG 364
++
NEASEKKRKAFLDLLLTIQKEEGTLSDEDIREEVDTFMFEGHDTTSSGIG
Sbjct: 267
NFVRKVDAADNEASEKKRKAFLDLLLTIQKEEGTLSDEDIREEVDTFMFEGHDTTSSGIG 326
Query: 365 FTILWLGFYPECQKKLQKELDEVFGFETNQPPSMDDIKKCSYLEKCIKESLRMFPSVPLI
424
FTILWLGFYPEC FETNQPPSMDDIKKCSYLEKCIKESLRMFPSVPLI
Sbjct: 327 FTILWLGFYPEC-------------FETNQPPSMDDIKKCSYLEKCIKESLRMFPSVPLI
373
Query: 425
ARRLSEDVTINHPSGQKIVLPAGLAACVSPIAAARDPRAWPDPDTYNPDNFDIDAIAGRD 484
ARRLSEDVTINHPSGQKIVLPAGLAACVSPIAAARDPRAWPDPDTYNPDNFDIDAIAGRD
Sbjct: 374
ARRLSEDVTINHPSGQKIVLPAGLAACVSPIAAARDPRAWPDPDTYNPDNFDIDAIAGRD 433
Query: 485
PYAYIPFSAGPRNCIGQKFALLEQKTILSTFFRKYEVESLQTEENLRPVPELILRPYNGM 544
PYAYIPFSAGPRNCIGQKFALLEQKTILSTFFRKYEVESLQTEENLRPVPELILRPYNGM
Sbjct: 434
PYAYIPFSAGPRNCIGQKFALLEQKTILSTFFRKYEVESLQTEENLRPVPELILRPYNGM 493
Query: 545 KIKIKRREAADYVV 558
KIKIKRREAADYVV
Sbjct: 494 KIKIKRREAADYVV 507
Cyan not
in my seq (intron?) C-terms do not agree
Is my seq
frameshifted at the end?
C26F1.2;
CE32809.
MIIVISIVIGYVIYLVVVNFQQILELWRINRKCAQNLSMVNGPPALPLVG 50
SAHLFKWNPYAFTFQMEGWAQKYLFGRAKYGEIAAPNNEVDGIMLLWIGP
100
VPIVFLGTSECIRPVLESNTNISKPSQYDKMSEWIGTGLLTSTHEKWFHR
150
RKMLTPTFHFTIIQDYFPVFVRNAEVLADAVELHVDGDYFDAFPYFKRCT
200
LDIICETAMGIQVNAQLGHNNEYVHAVKRISEIVWNHMKFPWLWLKPIWY
250
LTGLGFEFDRNVRMTNNFVRKVDAAD
FKIENQKNYYYLGIVLPEAFKINFQVIQERKELLNEDG
NEASEKKRKAFLDLLLTIQKEEGTLSDEDIREEVDT
350
FMFEGHDTTSSGIGFTILWLGFYPECQKKLQKELDEVFGFETNQPPSMDD
400
IKKCSYLEKCIKESLRMFPSVPLIARRLSEDVTINHPSGQKIVLPAGLAA
450
CVSPIAAARDPRAWPDPDTYNPDNFDIDAIAGRDPYAYIPFSAGPRNCIG
500
QKFALLEQKTILSTFFRKYEVESLQTEENLRPVPELILRPYNGMKIKIKR
550
REAADYVVL
>C26F1.2 U53148 CYP32A1 508 aa also Y97E10 contig266
MIIVISIVIGYVIYLVVVNFQQILELWRINRKCAQNLSMVNGPPALPLVGSAHLFKW
NPYAFTFQMEGWAQKYLFGRAKYGEIAAPNNEVDGIMLLWIGPVPIVFLGTSECI
RPVLESNTNISKPSQYDKMSEWIGTGLLTSTHEKWFHRRKMLTPTFHFTIIQD
YFPVFVRNAEVLADAVELHVDGDYFDAFPYFKRCTLDIICETAMGIQVNAQL
GHNNEYVHAVKRISEIVWNHMKFPWLWLKPIWYLTGLGFEFDRNVRMTNN
FVRKVDAAD
NEASEKKRKAFLDLLLTIQKEEGTLSDEDIREEVDTFMFEGHDT
TSSGIGFTILWLGFYPECFETNQPPSMDDIKKCSYLEKCIKESLRMFPSVPLIAR
RLSEDVTINHPSGQKIVLPAGLAACVSPIAAARDPRAWPDPDTYNPDNFDID
AIAGRDPYAYIPFSAGPRNCIGQKFALLEQKTILSTFFRKYEVESLQTEENLRP
VPELILRPYNGMKIKIKRREAADYVV
FTTFFRMKLDECEK*
>CYP25A1
C36A4.1 Z66495 496 aa
corrected 9/14/98
Length = 497
Score =
1060 (373.1 bits), Expect = 4.0e-244, Sum P(2) = 4.0e-244
Identities = 212/229 (92%), Positives =
213/229 (93%)
Query: 1
MALLILSSLVISIFTFFIYIILARRERFKLREKIGLSGPEPHWFLGNLKQTAERKEKLGY 60
MALLILSSLVISIFTFFIYIILARRERFKLREKIGLSGPEPHWFLGNLKQTAERK
Sbjct: 1
MALLILSSLVISIFTFFIYIILARRERFKLREKIGLSGPEPHWFLGNLKQTAERK----- 55
Query: 61 DDANRWFNELHEQYGETFGIYYGSQMNIVISNEKDIKEVFIKNFSNFSDRSVPSIYEANQ
120
DANRWFNELHEQYGETFGIYYGSQMNIVISNEKDIKEVFIKNFSNFSDRSVPSIYEANQ
Sbjct: 56 -DANRWFNELHEQYGETFGIYYGSQMNIVISNEKDIKEVFIKNFSNFSDRSVPSIYEANQ
114
Query: 121
LTASLLMNSYSSGWKHTRSAIAPIFSTGKMKAMQETINSKVDLFLDILREKASSGQKWDI 180
LTASLLMNSYSSGWKHTRSAIAPIFSTGKMKAMQETINSKVDLFLDILREKASSGQKWDI
Sbjct: 115
LTASLLMNSYSSGWKHTRSAIAPIFSTGKMKAMQETINSKVDLFLDILREKASSGQKWDI 174
Query: 181
YDDFQGLTLDVIGKCAFAIDSNCQRDRNDVFYHPVTVKITINNFTYFHS 229
YDDFQGLTLDVIGKCAFAIDSNCQRDRNDVFY
V K I+N HS
Sbjct: 175
YDDFQGLTLDVIGKCAFAIDSNCQRDRNDVFY--VNAKKYISNIDIRHS 221
Score = 1283 (451.6 bits), Expect =
4.0e-244, Sum P(2) = 4.0e-244
Identities = 236/236 (100%), Positives =
236/236 (100%)
Query: 275
YERRRGGEGSDSVDLLKLLLNREDDKSKPMTKQEVIENCFAFLLAGYETTSTAMTYCSYL 334
YERRRGGEGSDSVDLLKLLLNREDDKSKPMTKQEVIENCFAFLLAGYETTSTAMTYCSYL
Sbjct: 261
YERRRGGEGSDSVDLLKLLLNREDDKSKPMTKQEVIENCFAFLLAGYETTSTAMTYCSYL 320
Query: 335
LSKYPNVQQKLYEEIMEAKENGGLTYDSIHNMKYLDCVYKETLRFYPPHFSFIRRLCRED 394
LSKYPNVQQKLYEEIMEAKENGGLTYDSIHNMKYLDCVYKETLRFYPPHFSFIRRLCRED
Sbjct: 321
LSKYPNVQQKLYEEIMEAKENGGLTYDSIHNMKYLDCVYKETLRFYPPHFSFIRRLCRED 380
Query: 395
ITIRGQFYPKGAIVVCLPHTVHRNPENWDSPEEFHPERFENWEEKSSSLKWIPFGVGPRY 454
ITIRGQFYPKGAIVVCLPHTVHRNPENWDSPEEFHPERFENWEEKSSSLKWIPFGVGPRY
Sbjct: 381
ITIRGQFYPKGAIVVCLPHTVHRNPENWDSPEEFHPERFENWEEKSSSLKWIPFGVGPRY 440
Query: 455
CVGMRFAEMEFKTTIVKLLDTFELKQFEGEADLIPDCNGVIMRPNDPVRLHLKPRN 510
CVGMRFAEMEFKTTIVKLLDTFELKQFEGEADLIPDCNGVIMRPNDPVRLHLKPRN
Sbjct: 441
CVGMRFAEMEFKTTIVKLLDTFELKQFEGEADLIPDCNGVIMRPNDPVRLHLKPRN 496
C36A4.1;
CE03070. cyan not in my seq, green instead
MALLILSSLVISIFTFFIYIILARRERFKLREKIGLSGPEPHWFLGNLKQ 50
TAERKEKLGYDDANRWFNELHEQYGETFGIYYGSQMNIVISNEKDIKEVF
100
IKNFSNFSDRSVPSIYEANQLTASLLMNSYSSGWKHTRSAIAPIFSTGKM
150
KAMQETINSKVDLFLDILREKASSGQKWDIYDDFQGLTLDVIGKCAFAID
200
SNCQRDRNDVFY
HPVTVKITINNFTYFHSSSPGTFHFLESTLQIHTTGRCRNSTCRRTVKCVGFRQDKAKFCSD
YERRRGGEGSDSVDLLKLLLNREDDK
300
SKPMTKQEVIENCFAFLLAGYETTSTAMTYCSYLLSKYPNVQQKLYEEIM
350
EAKENGGLTYDSIHNMKYLDCVYKETLRFYPPHFSFIRRLCREDITIRGQ
400
FYPKGAIVVCLPHTVHRNPENWDSPEEFHPERFENWEEKSSSLKWIPFGV
450
GPRYCVGMRFAEMEFKTTIVKLLDTFELKQFEGEADLIPDCNGVIMRPND
500
PVRLHLKPRN
510
>C36A4.1 Z66495 CYP25A1 496 aa corrected 9/14/98
MALLILSSLVISIFTFFIYIILARRERFKLREKIGLSGPEPHWFLGNLKQTAERK
DANRWFNELHEQYGETFGIYYGSQMNIVISNEKDIKEVFIKNFSNFSDRSVPSI
YEANQLTASLLMNSYSSGWKHTRSAIAPIFSTGKMKAMQETINSKVDLFLDI
LREKASSGQKWDIYDDFQGLTLDVIGKCAFAIDSNCQRDRNDVFY
VNAKKYISNIDIRHSKIIAASVLLPELSTFWKALYKYTPLADAEIPLVEGLSNV
YERRRGGEGSDSVDLLKLLLNREDDKSKPMTKQEVIENCFAFLLAGYETTSTA
MTYCSYLLSKYPNVQQKLYEEIMEAKENGGLTYDSIHNMKYLDCVYKETLRFY
PPHFSFIRRLCREDITIRGQFYPKGAIVVCLPHTVHRNPENWDSPEEFHPERFE
NWEEKSSSLKWIPFGVGPRYCVGMRFAEMEFKTTIVKLLDTFELKQFEGEAD
LIPDCNGVIMRPNDPVRLHLKPRN*
My green
seq matches other CYP25As see below
>CYP25A2
C36A4.2 Z66495 496 aa
Length = 497
Score = 234 (82.4 bits), Expect =
2.3e-22, P = 2.3e-22
Identities = 44/54 (81%), Positives =
51/54 (94%)
Query: 1
VNAKKYISNIDIRHSKIIAASVLLPELSTFWKALYKYTPLADAEIPLVEGLSNV 54
VNA+K+I+NIDIRHSK+IAAS
+LPEL+ W+ALYKYTPLADAEIPLVEGLSNV
Sbjct: 207
VNARKFIANIDIRHSKVIAASFILPELAPLWRALYKYTPLADAEIPLVEGLSNV 260
>CYP25A2
C36A4.2 Z66495 496 aa
Length = 497
Score = 1327 (467.1 bits), Expect =
9.8e-272, Sum P(2) = 9.8e-272
Identities = 260/266 (97%), Positives =
260/266 (97%)
Query: 1
MALLILTSILILLVSFIIYILFARREQFKLREKIGLTGPEPHWFMGNLKQIVDRKEKLGY 60
MALLILTSILILLVSFIIYILFARREQFKLREKIGLTGPEPHWFMGNLKQIVDRK
Sbjct: 1
MALLILTSILILLVSFIIYILFARREQFKLREKIGLTGPEPHWFMGNLKQIVDRK----- 55
Query: 61 DDSNKWFNELHKQYGETFGIYFGAQMNIVLSNEEDIKEVFIKNFSNFSDRIVPPIFDSNQ
120
DSNKWFNELHKQYGETFGIYFGAQMNIVLSNEEDIKEVFIKNFSNFSDRIVPPIFDSNQ
Sbjct: 56 -DSNKWFNELHKQYGETFGIYFGAQMNIVLSNEEDIKEVFIKNFSNFSDRIVPPIFDSNQ
114
Query: 121
LNQSLLQNTYATGWKHTRSAIAPIFSTGKMKAMQETIHSKVDLFLDILREKASSGQKWDI 180
LNQSLLQNTYATGWKHTRSAIAPIFSTGKMKAMQETIHSKVDLFLDILREKASSGQKWDI
Sbjct: 115
LNQSLLQNTYATGWKHTRSAIAPIFSTGKMKAMQETIHSKVDLFLDILREKASSGQKWDI 174
Query: 181
YEDFQGLTLDVIGKCAFAIDSNCQRDRNDVFYVNARKFIANIDIRHSKVIAASFILPELA 240
YEDFQGLTLDVIGKCAFAIDSNCQRDRNDVFYVNARKFIANIDIRHSKVIAASFILPELA
Sbjct: 175
YEDFQGLTLDVIGKCAFAIDSNCQRDRNDVFYVNARKFIANIDIRHSKVIAASFILPELA 234
Query: 241 PLWRALYKYTPLADAEIPLVEGLSNV
266
PLWRALYKYTPLADAEIPLVEGLSNV
Sbjct: 235 PLWRALYKYTPLADAEIPLVEGLSNV
260
Score = 1277 (449.5 bits), Expect =
9.8e-272, Sum P(2) = 9.8e-272
Identities = 236/236 (100%), Positives =
236/236 (100%)
Query: 287 YERRRGGEGSDSVDLLKLLLNREDDKSKPMTKQEVIENCFAFLLAGYETTSTAMTYCSYL
346
YERRRGGEGSDSVDLLKLLLNREDDKSKPMTKQEVIENCFAFLLAGYETTSTAMTYCSYL
Sbjct: 261
YERRRGGEGSDSVDLLKLLLNREDDKSKPMTKQEVIENCFAFLLAGYETTSTAMTYCSYL 320
Query: 347
LSKYPNVQQKLYEEIMEAKENGGLTYDSIHNMKYLDYVYKETLRCYPPVIHFINRRCLAD 406
LSKYPNVQQKLYEEIMEAKENGGLTYDSIHNMKYLDYVYKETLRCYPPVIHFINRRCLAD
Sbjct: 321
LSKYPNVQQKLYEEIMEAKENGGLTYDSIHNMKYLDYVYKETLRCYPPVIHFINRRCLAD 380
Query: 407
ITIRGQFYPKGSVVTCLPHTVHLNPENWDSPEEFHPERFENWEEKSSSLKWIPFGVGPRY 466
ITIRGQFYPKGSVVTCLPHTVHLNPENWDSPEEFHPERFENWEEKSSSLKWIPFGVGPRY
Sbjct: 381
ITIRGQFYPKGSVVTCLPHTVHLNPENWDSPEEFHPERFENWEEKSSSLKWIPFGVGPRY 440
Query: 467
CVGMRFAEMEFKTTIVKLLDTFELKQFKGEADLIPDCNGVIMRPKDPVRLLLKPRN 522
CVGMRFAEMEFKTTIVKLLDTFELKQFKGEADLIPDCNGVIMRPKDPVRLLLKPRN
Sbjct: 441
CVGMRFAEMEFKTTIVKLLDTFELKQFKGEADLIPDCNGVIMRPKDPVRLLLKPRN 496
C36A4.2;
CE03071. cyan not in my seq extra intron seq?
MALLILTSILILLVSFIIYILFARREQFKLREKIGLTGPEPHWFMGNLKQ 50
IVDRKEKLGYDDSNKWFNELHKQYGETFGIYFGAQMNIVLSNEEDIKEVF
100
IKNFSNFSDRIVPPIFDSNQLNQSLLQNTYATGWKHTRSAIAPIFSTGKM
150
KAMQETIHSKVDLFLDILREKASSGQKWDIYEDFQGLTLDVIGKCAFAID
200
SNCQRDRNDVFYVNARKFIANIDIRHSKVIAASFILPELAPLWRALYKYT
250
PLADAEIPLVEGLSNV
TPFLLENFCSFSLILTKIFS
YERRRGGEGSDSVD
300
LLKLLLNREDDKSKPMTKQEVIENCFAFLLAGYETTSTAMTYCSYLLSKY
350
PNVQQKLYEEIMEAKENGGLTYDSIHNMKYLDYVYKETLRCYPPVIHFIN
400
RRCLADITIRGQFYPKGSVVTCLPHTVHLNPENWDSPEEFHPERFENWEE
450
KSSSLKWIPFGVGPRYCVGMRFAEMEFKTTIVKLLDTFELKQFKGEADLI
500
PDCNGVIMRPKDPVRLLLKPRN
C06B3.3;
CE27662. CYP35C1
MFFILLFVSIISFLTARQFLKAKRLPPGPFSLPLIGNAHQVGYQLWRTGG 50
VTNMLNHFRKEYGDIFTLWLGPIPHVNITNYELSHEVFVKNSTKYADKHV
100
SPMIDYVRKGNGVFFSNGDKWQELRRFSMLTMRNMGMGRDLMEEKILSEL
150
DARCAEINEKSIDGTTVLQVNEFFDLTVGSIINNMLLGFRFDERTKSRFL
200
TMKHMFDEGMDKMTPLFFTLPVWALQKFLAKDFNSIIKDQFEIIDYVSVD
250
AIKRSRDFMNGDYEIDPNNVEDIVDAFLLKMKQNPKSDVYDENNLKMLIT
300
DLWITGQETTTTTLVSAFIQFLNNPQVMDTVQKELIKVTNGGSRQLSLRD
350
KTETPYLNATIAEVQRHASILNINFWRINNEPTVIGGHPVDSGCLIASQL
400
SALHTNEKIFENPEKFNPERFIRNENLMQQTIPFGIGKRSCLGESLARAE
450
LYLIIGNLLLRYNFESSGKMPSTRETVPFGFAKRCEAFDMKVTKI
>CYP35C1
C06B3.3 Z77652 508 aa
Length = 509
Score = 2351 (827.6 bits), Expect =
3.3e-249, Sum P(2) = 3.3e-249
Identities = 452/457 (98%), Positives =
454/457 (99%)
M C-term
is wrong
Query: 1 MFFILLFVSIISFLTARQFLKAKRLPPGPFSLPLIGNAHQVGYQLWRTGGVTNMLNHFRK
60
MFFILLFVSIISFLTARQFLKAKRLPPGPFSLPLIGNAHQVGYQLWRTGGVTNMLNHFRK
Sbjct: 1
MFFILLFVSIISFLTARQFLKAKRLPPGPFSLPLIGNAHQVGYQLWRTGGVTNMLNHFRK 60
Query: 61
EYGDIFTLWLGPIPHVNITNYELSHEVFVKNSTKYADKHVSPMIDYVRKGNGVFFSNGDK 120
EYGDIFTLWLGPIPHVNITNYELSHEVFVKNSTKYADKHVSPMIDYVRKGNGVFFSNGDK
Sbjct: 61
EYGDIFTLWLGPIPHVNITNYELSHEVFVKNSTKYADKHVSPMIDYVRKGNGVFFSNGDK 120
Query: 121
WQELRRFSMLTMRNMGMGRDLMEEKILSELDARCAEINEKSIDGTTVLQVNEFFDLTVGS 180
WQELRRFSMLTMRNMGMGRDLMEEKILSELDARCAEINEKSIDGTTVLQVNEFFDLTVGS
Sbjct: 121
WQELRRFSMLTMRNMGMGRDLMEEKILSELDARCAEINEKSIDGTTVLQVNEFFDLTVGS 180
Query: 181
IINNMLLGFRFDERTKSRFLTMKHMFDEGMDKMTPLFFTLPVWALQKFLAKDFNSIIKDQ 240
IINNMLLGFRFDERTKSRFLTMKHMFDEGMDKMTPLFFTLPVWALQKFLAKDFNSIIKDQ
Sbjct: 181
IINNMLLGFRFDERTKSRFLTMKHMFDEGMDKMTPLFFTLPVWALQKFLAKDFNSIIKDQ 240
Query: 241
FEIIDYVSVDAIKRSRDFMNGDYEIDPNNVEDIVDAFLLKMKQNPKSDVYD---ENNLKM 297
FEIIDYVSVDAIKRSRDFMNGDYEIDPNNVEDIVDAFLLKMKQNPKSDVY+ ENNLKM
Sbjct: 241
FEIIDYVSVDAIKRSRDFMNGDYEIDPNNVEDIVDAFLLKMKQNPKSDVYESFSENNLKM 300
Query: 298
LITDLWITGQETTTTTLVSAFIQFLNNPQVMDTVQKELIKVTNGGSRQLSLRDKTETPYL 357
LITDLWITGQETTTTTLVSAFIQFLNNPQVMDTVQKELIKVTNGGSRQLSLRDKTETPYL
Sbjct: 301
LITDLWITGQETTTTTLVSAFIQFLNNPQVMDTVQKELIKVTNGGSRQLSLRDKTETPYL 360
Query: 358
NATIAEVQRHASILNINFWRINNEPTVIGGHPVDSGCLIASQLSALHTNEKIFENPEKFN 417
NATIAEVQRHASILNINFWRINNEPTVIGGHPVDSGCLIASQLSALHTNEKIFENPEKFN
Sbjct: 361 NATIAEVQRHASILNINFWRINNEPTVIGGHPVDSGCLIASQLSALHTNEKIFENPEKFN
420
Query: 418
PERFIRNENLMQQTIPFGIGKRSCLGESLARAELYLI 454
PERFIRNENLMQQTIPFGIGKRSCLGESLARAELYL+
Sbjct: 421
PERFIRNENLMQQTIPFGIGKRSCLGESLARAELYLV 457
>gi|17538369|ref|NM_069185.1|
Caenorhabditis elegans cytochrome P450 (ccp-31A1), mRNA
Length =
1269
Score = 780 bits (2014), Expect = 0.0
Identities = 407/487 (83%), Positives =
412/487 (84%), Gaps = 1/487 (0%)
Frame = +1
By attempting to make this into a functional gene
without gaps or stop codons the actual sequence has been asssembled incorrectly
as an mRNA
Query:
1
MGVIILAVLLASATVIAWLLYKHLRMRQALKHLNQPRSYPIIGHGLITKPDPEGFMNQVI 60
MGVIILAVLLASATVIAWLLYKHLRMRQALKHLNQPRSYPIIGHGLITKPDPEGFMNQVI
Sbjct: 1 MGVIILAVLLASATVIAWLLYKHLRMRQALKHLNQPRSYPIIGHGLITKPDPEGFMNQVI
180
Query:
61
GMGYLYPDPRMCLLWIGPFPCLMLYSGDLVEAIFSSTKHLNRGFAYVLLEPWLGISILTS 120
GMGYLYPDPRMCLLWIGPFPCLMLYSGDLVEAIFSSTKHLNR
Sbjct: 181
GMGYLYPDPRMCLLWIGPFPCLMLYSGDLVEAIFSSTKHLNR------------------ 306
Query: 121
QKEQWRLXXXXXXXXXXXXXXXXXXXXXXXXXKILVQKLCCLGADERVDVLSVIALCTLD 180
ILVQKLCCLGADERVDVLSVIALCTLD
Sbjct: 307
---------------------------------ILVQKLCCLGADERVDVLSVIALCTLD 387
Query: 181
IICETSMGIAIGAQLAENNEYVWAVHTINKLISKRTNNPLMTEDGRTHEKCLHIFHDFTK 240
IICETSMGIAIGAQLAENNEYVWAVHTINKLISKRTNNPLM
I++ +
Sbjct: 388
IICETSMGIAIGAQLAENNEYVWAVHTINKLISKRTNNPLMWNS--------FIYNLYGS 543
Query: 241
KLIGERK*ALQENDYKMEGRLAFLDLLLEMVNSG-MDETDVQAEGNTFMLEGHDTTSTGL 299
+I +
ENDYKMEGRLAFLDLLLEMVNSG MDETDVQAEGNTFMLEGHDTTSTGL
Sbjct: 544
FIINK------ENDYKMEGRLAFLDLLLEMVNSGQMDETDVQAEGNTFMLEGHDTTSTGL 705
Query: 300
MWAVHLLGNHPDVQRKVQAELDEVMGDDEDVTIEHLSRMKYLECALKEALRLFPSVLIIT 359
MWAVHLLGNHPDVQRKVQAELDEVMGDDEDVTIEHLSRMKYLECALKEALRLFPSVLIIT
Sbjct: 706
MWAVHLLGNHPDVQRKVQAELDEVMGDDEDVTIEHLSRMKYLECALKEALRLFPSVLIIT 885
Query: 360
RELSDDQVIGGFNIPKGVTFLLNLYLVHRDPAQWKDPDVFDPDRFHPENSIGRKSFAFIP 419
RELSDDQVIGGFNIPKGVTFLLNLYLVHRDPAQWKDPDVFDPDRFHPENSIGRKSFAFIP
Sbjct: 886
RELSDDQVIGGFNIPKGVTFLLNLYLVHRDPAQWKDPDVFDPDRFHPENSIGRKSFAFIP 1065
Query: 420
FSAGSRNCIGQRFALMEEKVIMAHLLRNFNIKAVELMHEVRPKMEIIVRPVTPIHMKLTR 479
FSAGSRNCIGQRFALMEEKVIMAHLLRNFNIKAVELMHEVRPKMEIIVRPVTPIHMKLTR
Sbjct:
1066FSAGSRNCIGQRFALMEEKVIMAHLLRNFNIKAVELMHEVRPKMEIIVRPVTPIHMKLTR 1245
Query: 480
RRPIASP 486
RRPIASP
Sbjct:
1246RRPIASP 1266
Compare to
31A2
>CYP31A2
F22B3 Z68336 487 aa
Length = 488
Score = 2236 (787.1 bits), Expect =
3.0e-235, P = 3.0e-235
Identities = 435/488 (89%), Positives =
442/488 (90%)
Query: 1
MGVIILAVLLASATVIAWLLYKHLRMRQALKHLNQPRSYPIIGHGLITKPDPEGFMNQVI 60
MGVII AVLLA ATVIAWLLYKHLRMRQ LKHLNQPRSYPI+GHGLITKPDPEGFMNQVI
Sbjct: 1
MGVIIPAVLLAMATVIAWLLYKHLRMRQVLKHLNQPRSYPIVGHGLITKPDPEGFMNQVI 60
Query: 61
GMGYLYPDPRMCLLWIGPFPCLMLYSGDLVEAIFSSTKHLNRGFAYVLLEPWLGISILTS 120
GMGYLYPDPRMCLLWIGPFPCLMLYS DLVE IFSSTKHLN+GFAYVLLEPWLGISILTS
Sbjct: 61 GMGYLYPDPRMCLLWIGPFPCLMLYSADLVEPIFSSTKHLNKGFAYVLLEPWLGISILTS
120
Query: 121
QKEQWRLXXXXXXXXXXXXXXXXXXXXXXXXXKILVQKLCCLGADERVDVLSVIALCTLD 180
QKEQWR
KILVQKLCCLGADE VDVLSVI LCTLD
Sbjct: 121
QKEQWRPKRKLLTPTFHYDILKDFLPIFNEQSKILVQKLCCLGADEEVDVLSVITLCTLD 180
Query: 181
IICETSMGIAIGAQLAENNEYVWAVHTINKLISKRTNNPLMTEDGRTHEKCLHIFHDFTK 240
IICETSMG AIGAQLAENNEYVWAVHTINKLISKRTNNPL+TEDGRTHEKCL I HDFTK
Sbjct: 181
IICETSMGKAIGAQLAENNEYVWAVHTINKLISKRTNNPLITEDGRTHEKCLRILHDFTK 240
Query: 241
KLIGERK*ALQENDYKMEGRLAFLDLLLEMVNSG-MDETDVQAEGNTFMLEGHDTTSTGL 299
K+I ERK ALQENDYKMEGRLAFLDLLLEMV SG MDETDVQAE +TFM EGHDTTSTGL
Sbjct: 241
KVIVERKEALQENDYKMEGRLAFLDLLLEMVKSGQMDETDVQAEVDTFMFEGHDTTSTGL 300
Query: 300 MWAVHLLGNHPDVQRKVQAELDEVMGDDEDVTIEHLSRMKYLECALKEALRLFPSVLIIT
359
MWA+HLLGNHP+VQRKVQAELDEVMGDDEDVTIEHLSRMKYLECALKEALRLFPSV IIT
Sbjct: 301
MWAIHLLGNHPEVQRKVQAELDEVMGDDEDVTIEHLSRMKYLECALKEALRLFPSVPIIT 360
Query: 360
RELSDDQVIGGFNIPKGVTFLLNLYLVHRDPAQWKDPDVFDPDRFHPENSIGRKSFAFIP 419
RELSDDQVIGG NIPKGVTFLLNLYLVHRDPAQWKDPDVFDPDRF PENSIGRKSFAFIP
Sbjct: 361
RELSDDQVIGGVNIPKGVTFLLNLYLVHRDPAQWKDPDVFDPDRFLPENSIGRKSFAFIP 420
Query: 420
FSAGSRNCIGQRFALMEEKVIMAHLLRNFNIKAVELMHEVRPKMEIIVRPVTPIHMKLTR 479
FSAGSRNCIGQRFALMEEKVIMAHLLRNFNIKAVELMHEVRPKMEIIVRPVTPIHMKLTR
Sbjct: 421
FSAGSRNCIGQRFALMEEKVIMAHLLRNFNIKAVELMHEVRPKMEIIVRPVTPIHMKLTR 480
Query: 480 RRPIASP* 487
RRPI SP*
Sbjct: 481 RRPIVSP* 488
compare to
cDNA
>gi|18246951|dbj|BJ104281.1|BJ104281 UniGene info BJ104281 unpublished
oligo-capped cDNA library, C. elegans L1 stage
Caenorhabditis elegans cDNA clone yk1059h10 5'.
Length = 601
Score = 164 bits (415), Expect = 4e-42
Identities = 84/100 (84%), Positives =
86/100 (86%), Gaps = 8/100 (8%)
Frame = +1
Query:
14
DVLSVITLCTLDIICETSMGKAIGAQLAENNEYVWAVHTINKLISKRTNNPL-------- 65
DVLSVI
LCTLDIICETSMG AIGAQLAENNEYVWAVHTINKLISKRTNNPL
Sbjct:
1
DVLSVIALCTLDIICETSMGIAIGAQLAENNEYVWAVHTINKLISKRTNNPLMWNSFIYN 180
Query:
66
ITEDGRTHEKCLRILHDFTKKVIVERKEALQENDYKMEGR 105
+TEDGRTHEKCL I HDFTKK+I ERK ALQENDYKMEGR
Sbjct: 181
LTEDGRTHEKCLHIFHDFTKKLIGERK*ALQENDYKMEGR
300
>gi|1122764|emb|Z68213.1|CEC01F6
Caenorhabditis elegans cosmid C01F6, complete sequence
Length =
31281
Score =
61.6 bits (148), Expect(5) = 0.0
Identities = 31/31 (100%), Positives =
31/31 (100%)
Frame = -2
Query:
1
MGVIILAVLLASATVIAWLLYKHLRMRQALK 31
MGVIILAVLLASATVIAWLLYKHLRMRQALK
Sbjct:
8876 MGVIILAVLLASATVIAWLLYKHLRMRQALK 8784
Score = 162 bits (410), Expect(5) = 0.0
Identities = 85/135 (62%), Positives =
89/135 (65%)
Frame = -3
Query:
31
KHLNQPRSYPIIGHGLITKPDPEGFMNQVIGMGYLYPDPRMCLLWIGPFPCLMLYSGDLV 90
+HLNQPRSYPIIGHGLITKPDPEGFMNQVIGMGYLYPDPRMCLLWIGPFPCLMLYSGDLV
Sbjct:
8740 QHLNQPRSYPIIGHGLITKPDPEGFMNQVIGMGYLYPDPRMCLLWIGPFPCLMLYSGDLV 8561
Query:
91
EAIFSSTKHLNRGFAYVLLEPWLGISILTSQKEQWRLXXXXXXXXXXXXXXXXXXXXXXX 150
EAIFSSTKHLNR
+ I + W
Sbjct:
8560 EAIFSSTKHLNR------------VRIRFA*TMAWN-------QYFDEPERTMASFFDFL 8438
Query:
151 XXKILVQKLCCLGAD 165
KILVQKLCCLGA+
Sbjct:
8437 LLKILVQKLCCLGAE 8393
Score =
55.8 bits (133), Expect = 6e-09
Identities = 25/25 (100%), Positives =
25/25 (100%)
Frame = -1
Query:
103 GFAYVLLEPWLGISILTSQKEQWRL 127
GFAYVLLEPWLGISILTSQKEQWRL
Sbjct:
8526 GFAYVLLEPWLGISILTSQKEQWRL 8452
Score = 63.9 bits (154), Expect(5) = 0.0
Identities = 32/32 (100%), Positives = 32/32
(100%)
Frame = -2
Query:
166
ERVDVLSVIALCTLDIICETSMGIAIGAQLAE 197
ERVDVLSVIALCTLDIICETSMGIAIGAQLAE
Sbjct:
8345 ERVDVLSVIALCTLDIICETSMGIAIGAQLAE 8250
Score =
56.2 bits (134), Expect(5) = 0.0
Identities = 25/27 (92%), Positives =
26/27 (96%)
Frame = -1
Query:
195 LAENNEYVWAVHTINKLISKRTNNPLM
221
L +NNEYVWAVHTINKLISKRTNNPLM
Sbjct:
8211 LFQNNEYVWAVHTINKLISKRTNNPLM 8131
Score
= 480 bits (1236), Expect(5) = 0.0
Identities = 238/240 (99%), Positives =
239/240 (99%), Gaps = 1/240 (0%)
Frame = -3
Query:
222
TEDGRTHEKCLHIFHDFTKKLIGERK*ALQENDYKMEGRLAFLDLLLEMVNSG-MDETDV 280
TEDGRTHEKCLHIFHDFTKKLIGERK*ALQENDYKMEGRLAFLDLLLEMVNSG MDETDV
Sbjct:
8059 TEDGRTHEKCLHIFHDFTKKLIGERK*ALQENDYKMEGRLAFLDLLLEMVNSGQMDETDV 7880
Query:
281
QAEGNTFMLEGHDTTSTGLMWAVHLLGNHPDVQRKVQAELDEVMGDDEDVTIEHLSRMKY 340
QAEGNTFMLEGHDTTSTGLMWAVHLLGNHPDVQRKVQAELDEVMGDDEDVTIEHLSRMKY
Sbjct:
7879 QAEGNTFMLEGHDTTSTGLMWAVHLLGNHPDVQRKVQAELDEVMGDDEDVTIEHLSRMKY 7700
Query:
341 LECALKEALRLFPSVLIITRELSDDQVIGGFNIPKGVTFLLNLYLVHRDPAQWKDPDVFD
400
LECALKEALRLFPSVLIITRELSDDQVIGGFNIPKGVTFLLNLYLVHRDPAQWKDPDVFD
Sbjct:
7699 LECALKEALRLFPSVLIITRELSDDQVIGGFNIPKGVTFLLNLYLVHRDPAQWKDPDVFD 7520
Query:
401
PDRFHPENSIGRKSFAFIPFSAGSRNCIGQRFALMEEKVIMAHLLRNFNIKAVELMHEVR 460
PDRFHPENSIGRKSFAFIPFSAGSRNCIGQRFALMEEKVIMAHLLRNFNIKAVELMHEV+
Sbjct:
7519 PDRFHPENSIGRKSFAFIPFSAGSRNCIGQRFALMEEKVIMAHLLRNFNIKAVELMHEVK 7340
Score = 60.5 bits (145), Expect = 4e-09
Identities = 28/29 (96%), Positives =
29/29 (100%)
Frame = -2
Query:
458 EVRPKMEIIVRPVTPIHMKLTRRRPIASP
486
+VRPKMEIIVRPVTPIHMKLTRRRPIASP
Sbjct:
6887 QVRPKMEIIVRPVTPIHMKLTRRRPIASP 6801
Sequence
F23A7.3 matches this sequence NM_077558.
This seq is a fusion protein with P450 CYP29A1 on the bottom and some
unknown sequence on the top. F23A7.3 matches the top only (CYAN). I think this may be an artifactual
fusion. This sequence should be
unfused and the annotation of F23A7.3 as a P450 should be stopped. Note that the first exon of 29A1 has
been skipped in this fusion.
These
genes are not linked in C. briggsae
NM_077558
2199 bp
mRNA
linear INV
21-NOV-2003
DEFINITION Caenorhabditis elegans cytochrome 450
family member (XM328), mRNA.
MVRKSFKRNHSSTAEDDNKDFDIKKVSDGLSSSISEPFVVANFP
KNKQWDIDEMTTNEKYIAYLPTVVKVTPAHKKRLLMNAELNNLFENLRTHKTGPLDTE
IDGANFVAIEKDVEPHLIRKIQLLPNGMLKLIWPVLPRNEYESNSNNQEEIDATKKEL
SETKKQLETERQKSIESQRFLEKSCNEKVSKLAVENQKLKNKMAQKSDETDSAYRMIL
ELKKKHEQELQMKNSTIMKLEDNV KFNNSSNEQFSKMTTENQQLKDWMATKTKEVEHA
AKMMLELKQKHIEELQMKTTQITKLENDVKLNGNFMDHISELTEENHNLKEQLKQKTD
AETKMIAEMKKKHRGERHYMNSRITKLEDALEYKDNQLSKREIAHNKLLDQLS
EITEIFRQTADETRSQGKSVMKYHILGKLYVWPLDGKTIAKLVESTTELNKGDDYNFFLPWLG
GGVLVEGFGERWRTHRKLLTPTFHFAKLEGYLEVFYSETKIMIEHLEKFADNEETVDM
FPYIKRCALDIICGAAIGTKINAQMHHNHPYVKAVEGFNSMAISHAINPSYQIPAIYW
ALGLKKQKDAHLNTMKTFTVNVIADRKAAIASGEVEKETSKRKMNFLDILLN N
Missing
sequence of CYP29A1 here.
GMTIP
SGANVSIAPLALHSNAQVFPNPNKFDPDRFLPDEIAKRNAYDFMPFSAGLRNCIGEKF
ALLNEKVMMIHILKNFRLEPMGGFYSTKPMFEAVARPSNGISVKLIRRQF"
>CYP29A1 C44C10.2
CE05409 Z69787 502 aa
also Y102F5 AL022276 61000-63000 region
MALILIIILICLLSFKPWSWKTIQLIXKYRQYDDKIPGPPTHPIFGNTETFKNKSQT
EITEIFRQTADETRSQGKSVMKYHILGKLYVWPLDGKTIAKLVESTTELNKGDDYNFF
LPWLGGGVLVEGFGERWRTHRKLLTPTFHFAKLEGYLEVFYSETKIMIEHLEKFA
DNEETVDMFPYIKRCALDIICGAAIGTKINAQMHHNHPYVKAVEGFNSMAISHAIN
PSYQIPAIYWALGLKKQKDAHLNTMKTFTVNVIADRKAAIASGEVEKETSKRKMN
FLDILLN
SEESNSLTSEDIRQEVDTFMFAGHDTTTTSVSWACWNLAHNPDIQEKVY
EEIVHIFGEEPNDEVTSEGISXLEYTERVLNESKRIIAPVPSLQRKLINDMEFG
GMTIP
SGANVSIAPLALHSNAQVFPNPNKFDPDRFLPDEIAKRNAYDFMPFSAGLRNCIGE
KFALLNEKVMMIHILKNFRLEPMGGFYSTKPMFEAVARPSNGISVKLIRRQF*
F23A7.3;
CE09577.
MVRIICKRKTAEDKPNAFEFKKVNKGLTSPSDGPFVIAHFPKNMIWEIDQ 50
ETSNEKYIAYFPTIVSSPDTKRLMENDQLKMLYEKLRDHKSGPLKTQIDG 100
TNFVAVEQDAEPHQIESIQLLPNGNLKLYWPVICRDELEEDTNKEFNEMK 150
KDLYETKRELTEVRISLMNLEAEHRTLQTNCNRKDTQLFERHEEIQQLKD 200
HIQWKSKDTEVAAKMVFQMQNKHKEEVEKYTSTISKLKEEV VAKDKEIVE 250
NKTANDNMVSALLGLINKQRRVGT
>gi|17550997|ref|NM_077558.1|
Caenorhabditis elegans cytochrome 450 family member (XM328), mRNA
Length =
2199
Score = 208 bits (530), Expect = 2e-52
Identities = 117/248 (47%), Positives =
167/248 (67%), Gaps = 7/248 (2%)
Frame = +1
Query:
1 MVRIICKRK---TAEDKPNAFEFKKVNKGLTSPSDGPFVIAHFPKNMIWEIDQETSNEKY
57
MVR KR TAED F+ KKV+ GL+S PFV+A+FPKN
W+ID+ T+NEKY
Sbjct:
1 MVRKSFKRNHSSTAEDDNKDFDIKKVSDGLSSSISEPFVVANFPKNKQWDIDEMTTNEKY
180
Query:
58 IAYFPTIV--SSPDTKRLMENDQLKMLYEKLRDHKSGPLKTQIDGTNFVAVEQDAEPHQI
115
IAY
PT+V + KRL+ N +L
L+E LR HK+GPL T+IDG NFVA+E+D EPH I
Sbjct: 181
IAYLPTVVKVTPAHKKRLLMNAELNNLFENLRTHKTGPLDTEIDGANFVAIEKDVEPHLI
360
Query: 116
ESIQLLPNGNLKLYWPVICRDELEEDTN--KEFNEMKKDLYETKRELTEVRISLMNLEAE
173
IQLLPNG LKL WPV+ R+E E ++N
+E + KK+L ETK++L R ++E++
Sbjct: 361
RKIQLLPNGMLKLIWPVLPRNEYESNSNNQEEIDATKKELSETKKQLETER--QKSIESQ
534
Query: 174
HRTLQTNCNRKDTQLFERHEEIQQLKDHIQWKSKDTEVAAKMVFQMQNKHKEEVEKYTST
233
R L+ +CN K ++L
E Q+LK+ + KS +T+ A +M+ +++
KH++E++ ST
Sbjct: 535
-RFLEKSCNEKVSKL---AVENQKLKNKMAQKSDETDSAYRMILELKKKHEQELQMKNST
702
Query: 234
ISKLKEEV 241
I KL++
V
Sbjct: 703
IMKLEDNV 726
CYP31A2
>gi|4263218|gb|AC006720.1| Download subject sequence spanning the
HSP Caenorhabditis elegans cosmid Y17G9B, complete sequence
Length = 19620
Score = 223 bits (567), Expect = 3e-59
Identities = 119/152 (78%), Positives =
120/152 (78%), Gaps = 31/152 (20%)
Frame = +3
Query:
1
DVLSVITLCTLDIICETSMGKAIGAQLAE----------------NNEYVWAVHTINKLI 44
DVLSVITLCTLDIICETSMGKAIGAQLAE
NNEYVWAVHTINKLI
Sbjct:
12522 DVLSVITLCTLDIICETSMGKAIGAQLAEVREILQSQHF*TFFFQNNEYVWAVHTINKLI 12701
Query:
45
SKRTNNPLMWNSFIYNII---------------TEDGRTHEKCLRILHDFTKKVIVERKE 89
SKRTNNPLMWNSFIYN+
TEDGRTHEKCLRILHDFTKKVIVERKE
Sbjct:
12702 SKRTNNPLMWNSFIYNLYDSFIIKKVNSILFFRTEDGRTHEKCLRILHDFTKKVIVERKE 12881
This seq
is almost identical to CYP31A2 (only 4 amino acid differences, plus some minor errors at intron boundaries. There is a large fragment missing in the middle
My
conclusion is that this is a very recent pseudogene of CYP31A2 = CYP31A5P
>CYP31A2
F22B3 Z68336 487 aa
Length = 488
Score = 1239 (436.1 bits), Expect =
4.8e-143, Sum P(2) = 4.8e-143
Identities = 237/251 (94%), Positives =
240/251 (95%)
Query: 1 MGVIIPAVLLASATVIAWLIYKHLRMRQVLKHLNQPRSYPIVGHGLITKPDPEGFMNQVI
60
MGVIIPAVLLA ATVIAWL+YKHLRMRQVLKHLNQPRSYPIVGHGLITKPDPEGFMNQVI
Sbjct: 1 MGVIIPAVLLAMATVIAWLLYKHLRMRQVLKHLNQPRSYPIVGHGLITKPDPEGFMNQVI
60
Query: 61
GMGYLYPDPRMCLLWIGPFPCLMLYSADLVEPIFSSTKHLNKGFAYVLLEPWLGISILTS 120
GMGYLYPDPRMCLLWIGPFPCLMLYSADLVEPIFSSTKHLNKGFAYVLLEPWLGISILTS
Sbjct: 61
GMGYLYPDPRMCLLWIGPFPCLMLYSADLVEPIFSSTKHLNKGFAYVLLEPWLGISILTS 120
Query: 121
QKEQWRPKRKLLTPTFHYDILKDFLPIFNEQSKILIQKLCCLGVADEEVDVLSVITLCTL
180
QKEQWRPKRKLLTPTFHYDILKDFLPIFNEQSKIL+QKLCCLG ADEEVDVLSVITLCTL
Sbjct: 121
QKEQWRPKRKLLTPTFHYDILKDFLPIFNEQSKILVQKLCCLG-ADEEVDVLSVITLCTL
179
Query: 181 DIICETSMGKAIGAQLAENNEYVWAVHTINKLISKRTNNPLMWNSFIYNLTEDGRTHEKC 240
DIICETSMGKAIGAQLAENNEYVWAVHTINKLISKRTNNPL +TEDGRTHEKC
Sbjct: 180
DIICETSMGKAIGAQLAENNEYVWAVHTINKLISKRTNNPL--------ITEDGRTHEKC 231
Query: 241 LHILHDFTKKV 251
L ILHDFTKKV
Sbjct: 232 LRILHDFTKKV 242
Score = 148 (52.1 bits), Expect =
4.8e-143, Sum P(2) = 4.8e-143
Identities = 30/40 (75%), Positives =
34/40 (85%)
Query: 239
KCLHILHDFTKKVRPKMEIIVRPVTPIHMKLTRRRPIVSP 278
K + ++H+
VRPKMEIIVRPVTPIHMKLTRRRPIVSP
Sbjct: 452
KAVELMHE----VRPKMEIIVRPVTPIHMKLTRRRPIVSP 487
>CYP31A2 F22B3 Z68336
487 aa
MGVIIPAVLLAMATVIAWLLYKHLRMRQVLKHLNQPRSYPIVGHGLITKPDPEGFM
NQVIGMGYLYPDPRMCLLWIGPFPCLMLYSADLVEPIFSSTKHLNKGFAYVLLEP
WLGISILTSQKEQWRPKRKLLTPTFHYDILKDFLPIFNEQSKILVQKLCCLGADEEV
DVLSVITLCTLDIICETSMGKAIGAQLAENNEYVWAVHTINKLISKRTNNPLITEDG
RTHEKCLRILHDFTKK VIVERKEALQENDYKMEGRLAFL
DLLLEMVKSGQMDETD
VQAEVDTFMFEGHDTTSTGLMWAIHLLGNHPEVQRKVQAELDEVMGDDEDVTIEH
LSRMKYLECALKEALRLFPSVPIITRELSDDQVIGGVNIPKGVTFLLNLYLVHRDPA
QWKDPDVFDPDRFLPENSIGRKSFAFIPFSAGSRNCIGQRFALMEEKVIMAHLLRN
FNIKAVELMHE
VRPKMEIIVRPVTPIHMKLTRRRPIVSP*
>gi|6425490|emb|AL132865.1|CEY62E10A
Caenorhabditis elegans YAC Y62E10A, complete sequence
Length =
57237
Score = 60.1 bits (144),
Expect(3) = e-112
Identities = 29/31 (93%), Positives = 30/31 (96%)
Frame = +3
Query: 1
MGVIIPAVLLAMATVIAWLLYKHLRMRQVLK 31
MGVIIPAVLLA ATVIAWL+YKHLRMRQVLK
Sbjct: 42477
MGVIIPAVLLASATVIAWLIYKHLRMRQVLK 42569
Score = 285 bits
(730), Expect(3) = e-112
Identities = 134/145 (92%), Positives = 139/145 (95%), Gaps =
1/145 (0%)
Frame = +1
Query: 23
HLRMRQV-LKHLNQPRSYPIVGHGLITKPDPEGFMNQVIGMGYLYPDPRMCLLWIGPFPC 81
HL++ +
+HLNQPRSYPIVGHGLITKPDPEGFMNQVIGMGYLYPDPRMCLLWIGPFPC
Sbjct: 42586
HLKIN*INFQHLNQPRSYPIVGHGLITKPDPEGFMNQVIGMGYLYPDPRMCLLWIGPFPC 42765
Query: 82
LMLYSADLVEPIFSSTKHLNKGFAYVLLEPWLGISILTSQKEQWRPKRKLLTPTFHYDIL 141
LMLYSADLVEPIFSSTKHLNKGFAYVLLEPWLGISILTSQKEQWRPKRKLLTPTFHYDIL
Sbjct: 42766
LMLYSADLVEPIFSSTKHLNKGFAYVLLEPWLGISILTSQKEQWRPKRKLLTPTFHYDIL 42945
Query: 142 KDFLPIFNEQSKILVQKLCCLGADE 166
KDFLPIFNEQSKIL+QKLCCLG E
Sbjct: 42946
KDFLPIFNEQSKILIQKLCCLGVAE 43020
Score = 102 bits
(254), Expect(3) = e-112
Identities = 55/73 (75%), Positives = 56/73 (76%), Gaps = 17/73
(23%)
Frame = +3
Query: 166
EEVDVLSVITLCTLDIICETSMGKAIGAQLAE-----------------NNEYVWAVHTI 208
EEVDVLSVITLCTLDIICETSMGKAIGAQLAE
NNEYVWAVHTI
Sbjct: 43068
EEVDVLSVITLCTLDIICETSMGKAIGAQLAEVREILPSI*ISKCFFFQNNEYVWAVHTI 43247
Query: 209 NKLISKRTNNPLI 221
NKLISKRTNNPL+
Sbjct: 43248 NKLISKRTNNPLM
43286
Score = 84.3 bits (207), Expect = 2e-16
Identities = 39/42 (92%), Positives = 41/42 (97%)
Frame = +1
Query: 222
TEDGRTHEKCLRILHDFTKKVIVERKEALQENDYKMEGRLAF 263
TEDGRTHEKCL ILHDFTKKVIVERKEALQ++DYKMEGRLAF
Sbjct: 43885
TEDGRTHEKCLHILHDFTKKVIVERKEALQDSDYKMEGRLAF 44010
This gene missing 195 amino acids here
Score = 60.5 bits (145), Expect = 4e-09
Identities = 28/29 (96%), Positives = 29/29 (100%)
Frame = +1
Query: 459 EVRPKMEIIVRPVTPIHMKLTRRRPIVSP 487
+VRPKMEIIVRPVTPIHMKLTRRRPIVSP
Sbjct: 44824
QVRPKMEIIVRPVTPIHMKLTRRRPIVSP 44910
Y62E10A.15; CE28717.
MGVIIPAVLLASATVIAWLIYKHLRMRQVLK
(0)
HLNQPRSYPIVGHGLITKP 50
DPEGFMNQVIGMGYLYPDPRMCLLWIGPFPCLMLYSADLVEPIFSSTKHL
100
NKGFAYVLLEPWLGISILTSQKEQWRPKRKLLTPTFHYDILKDFLPIFNE
150
QSKILIQKLCCLGVAD (2)
EEVDVLSVITLCTLDIICETSMGKAIGAQLAE
(0)
NNEYVWAVHTINKLISKRTNNPI (2)
(LMWNSFIYNL) delete this
seq
TEDGRTHEKCLHILHDFTKK 250
VIVERKEALQDSDYKMEGRLAF
(add this seq)
(sequence gap)
VRPKMEIIVRPVTPIHMKLTRRRPIVSP*
New pseudogene seq =
CYP31A5P
MGVIIPAVLLASATVIAWLIYKHLRMRQVLK
(0)
HLNQPRSYPIVGHGLITKP
DPEGFMNQVIGMGYLYPDPRMCLLWIGPFPCLMLYSADLVEPIFSSTKHL
NKGFAYVLLEPWLGISILTSQKEQWRPKRKLLTPTFHYDILKDFLPIFNE
QSKILIQKLCCLGVAD (2)
EEVDVLSVITLCTLDIICETSMGKAIGAQLAE
(0)
NNEYVWAVHTINKLISKRTNNPI (2)
TEDGRTHEKCLHILHDFTKK
VIVERKEALQDSDYKMEGRLAF
(sequence gap)
VRPKMEIIVRPVTPIHMKLTRRRPIVSP*
Y5H2B.5 = CYP32B1
Y5H2B.5; CE26222.
MLAAALVLLLTYFAYLIFRQKDDILQFLHVRKICTREFKKLRGPPAVLIFGT
LWYFKKDPVEMVYQAQAWFSEYTLAPDNCGVLKVAVNVEVS (0)
LGFQLWLGPIPAVNIARGEIAK ()
IVLDSSVNISKSSQYNKLKEWIGDGLLI
()
STGDKWRSRRKMLTQTFHF ()
AVLKEYQKIFGAQGKILVEVLQLRANNKFSFDIMPYIKRCALDIICETAM
GCSISSQRGANDEYVNSVRRLSEIVWNYEKAPQFWLKPIWYLFGDGFEFN
RHVKLTTDFTRDVIENRKKELKTHNSEQNETKKLAFLDYLLKSQEEHPDI
LTDEGIREEVDTFMFEGHDTTSSGITFAVWFLGQFPEYQQRVHDELDEIF
GEDFERIPNSEDIQKMVYLEQCIKETLRMTPPVPFVSRKLTEDVKI
()
PHATKPDLLLPAGINCMINIITIMKDARYFERPYEFFPEHFSPERVAAREPFAFVPFSAGPRNCI
()
GQKFALLEEKVLLSWIFRNFTVTSMTKFPEEMPIPELILKPQFGTQVLLRNRRKL
>EMBOSS_001_4
EHFGISKRILLVGTNFYILNTFKYFRNGIPSPGMVQRVHTGSG*LWCVEGSGKC*SFSVK
LGFQLWLGPIPAVNIARGEI
>EMBOSS_001_5
TLWYFKKDPV
GRN*FLYT*HF*IF*KWYTKPRHGSASTHWLRITVVC*R*R*MLKFQCEI
GFSVMVGTNTSCQHCERGDX
>EMBOSS_001_6
NTLVFQKGSCW*ELIFIYLTLLNILEMVYQAQAWFSEYTLAPDNCGVLKVAVNVEVSV*N
WVFSYGWDQYQLSTLREGRX
46% to 32A1 (below) 42% to
37A1 39% to 37B1
>CYP32A1 C26F1.2 U53148 508 aa also Y97E10 contig266 revised 3/19/04
Length = 509
Score = 1169 (411.5 bits), Expect = 3.6e-122, P = 3.6e-122
Identities = 228/495 (46%), Positives = 329/495 (66%)
Query: 8
LLLTYFAYLIFRQKDDILQFLHVRKICTREFKKLRGPPAVLIFGTLWYFKKDPVEMVYQA 67
+++ Y YL+ IL+ + + C + + GPPA+ + G+ FK +P +Q
Sbjct: 7
IVIGYVIYLVVVNFQQILELWRINRKCAQNLSMVNGPPALPLVGSAHLFKWNPYAFTFQM 66
Query: 68
QAWFSEYTL---------APDN--CGVLKLWLGPIPAVNIARGEIAKIVLDSSVNISKSS 116
+ W +Y
AP+N G++ LW+GP+P V
+ E + VL+S+ NISK S
Sbjct: 67
EGWAQKYLFGRAKYGEIAAPNNEVDGIMLLWIGPVPIVFLGTSECIRPVLESNTNISKPS 126
Query: 117 QYNKLKEWIGDGLLISTGDKWRSRRKMLTQTFHFAVLKEYQKIFGAQGKILVEVLQLRAN
176
QY+K+ EWIG GLL ST +KW
RRKMLT TFHF ++++Y +F ++L + ++L +
Sbjct: 127
QYDKMSEWIGTGLLTSTHEKWFHRRKMLTPTFHFTIIQDYFPVFVRNAEVLADAVELHVD 186
Query: 177 NKFSFDIMPYIKRCALDIICETAMGCSISSQRGANDEYVNSVRRLSEIVWNYEKAPQFWL
236
+ FD PY KRC LDIICETAMG +++Q G N+EYV++V+R+SEIVWN+ K P WL
Sbjct: 187
GDY-FDAFPYFKRCTLDIICETAMGIQVNAQLGHNNEYVHAVKRISEIVWNHMKFPWLWL 245
Query: 237
KPIWYLFGDGFEFNRHVKLTTDFTRDVIENRKKELKTHNSEQNETKKLAFLDYLLKSQEE 296
KPIWYL G GFEF+R+V++T +F R V ++E +E
K+ AFLD LL Q+E
Sbjct: 246
KPIWYLTGLGFEFDRNVRMTNNFVRKV--------DAADNEASEKKRKAFLDLLLTIQKE 297
Query: 297
HPDILTDEGIREEVDTFMFEGHDTTSSGITFAVWFLGQFPEYFGEDFERIPNSEDIQKMV 356
L+DE
IREEVDTFMFEGHDTTSSGI F + +LG +PE F
+ + P+ +DI+K
Sbjct: 298
E-GTLSDEDIREEVDTFMFEGHDTTSSGIGFTILWLGFYPECF--ETNQPPSMDDIKKCS 354
Query: 357
YLEQCIKETLRMTPPVPFVSRKLTEDVKIPHATKPDLLLPAGINCMINIITIMKDARYFE 416
YLE+CIKE+LRM P VP ++R+L+EDV I H + ++LPAG+
++ I +D R +
Sbjct: 355
YLEKCIKESLRMFPSVPLIARRLSEDVTINHPSGQKIVLPAGLAACVSPIAAARDPRAWP 414
Query: 417
RPYEFFPEHFSPERVAAREPFAFVPFSAGPRNCIGQKFALLEEKVLLSWIFRNFTVTSMT 476
P + P++F + +A R+P+A++PFSAGPRNCIGQKFALLE+K +LS FR + V S+
Sbjct: 415
DPDTYNPDNFDIDAIAGRDPYAYIPFSAGPRNCIGQKFALLEQKTILSTFFRKYEVESLQ 474
Query: 477 KFPEEMPIPELILKPQFGTQVLLRNR
502
P+PELIL+P G ++ ++ R
Sbjct: 475 TEENLRPVPELILRPYNGMKIKIKRR
500
>gi|22417476|emb|CAAC01000007.1| Download subject sequence spanning the
HSP Caenorhabditis
briggsae contig cb25.fpc0023 from assembly cb25.agp8,
whole genome shotgun sequence
Length =
1407032
726375
MIAAILVALLTYFLFSIFQKRHEIQKFLYARRICIQEFEKVKGPPAVPIFGTTWYFKSDP 726554
726555
IEMIKQAQKWFVEYTLAPDSNGLLK (0) 726629
726676 FWMGPVPVVSICRGEVAK
()
726977
IFDSSTNIPKSSQYRKLKEWIGDGLLI () 727057
727102
STGPKWRSRRKMLTQTFHFAVLKEYHKVFASQGKILVDVLRLRANNTYPFDIMPYIKR 727275
727276 CTLDIIC () 727296
727682
ETAMGCSISSQMGSNDKYVESVKRLSELVWNYEK () 727783
727827
APLYWLKPIWYLFGNGFEFDRLVKLTTDFTRDVIDKRKEELKLHESEPSDKKLAFLDY 728000
728001
LLKSQTDHPEILTDEGIREEVDTFMFEGHDTTSSGIKFAIWFLGQYPEYQQQVQDEMDEI 728180
728181
FGDDYERYPNSEDIQRMIYLEQCIKETLRLTPPVPFISRQLEEDVLI () 728321
728362
AHATKPPVLLPAGMNIMINIITIMKDARYFEKPYEFFPEHFEPERVNSREAFAYVPFSA 728538
728539 GPRNCI ()
728820
GQKFALLEEKVVLSWIFRNFTVTSMSKYPEEHPIPELILKPQFGTQVLLKNRRK 728981
Score = 119 bits (299), Expect(2) = 8e-30
Identities = 53/86 (61%), Positives = 71/86 (82%)
Frame = +3
Query: 1 MLAAALVLLLTYFAYLIFRQKDDILQFLHVRKICTREFKKLRGPPAVLIFGTLWYFKKDP
60
M+AA LV LLTYF + IF+++ +I +FL+ R+IC +EF+K++GPPAV IFGT WYFK DP
Sbjct: 726375
MIAAILVALLTYFLFSIFQKRHEIQKFLYARRICIQEFEKVKGPPAVPIFGTTWYFKSDP 726554
Query: 61
VEMVYQAQAWFSEYTLAPDNCGVLKL 86
+EM+ QAQ WF EYTLAPD+ G+LK+
Sbjct: 726555
IEMIKQAQKWFVEYTLAPDSNGLLKV 726632
Score = 37.7 bits (86),
Expect(2) = 8e-30
Identities = 17/50 (34%), Positives = 30/50 (60%), Gaps =
2/50 (4%)
Frame = +1
Query: 85
KLWLGPIPAVNIARGEIAKI--VLDSSVNISKSSQYNKLKEWIGDGLLIS 132
+ W+GP+P V+I RGE+AK+
DS ++ +Y K+ + D +I+
Sbjct: 726673
QFWMGPVPVVSICRGEVAKVGQKFDSIEHLGSFDRYQKVIN*LNDHHIIN 726822
Score = 48.9 bits (115), Expect(2) = 2e-32
Identities = 22/27 (81%), Positives = 23/27 (85%)
Frame = +2
Query: 105 VLDSSVNISKSSQYNKLKEWIGDGLLI
131
+ DSS NI KSSQY KLKEWIGDGLLI
Sbjct: 726977
IFDSSTNIPKSSQYRKLKEWIGDGLLI 727057
Score = 117 bits
(293), Expect(2) = 2e-32
Identities = 54/67 (80%), Positives = 60/67 (89%)
Frame = +1
Query: 130
LISTGDKWRSRRKMLTQTFHFAVLKEYQKIFGAQGKILVEVLQLRANNKFSFDIMPYIKR 189
+ STG KWRSRRKMLTQTFHFAVLKEY K+F +QGKILV+VL+LRANN + FDIMPYIKR
Sbjct: 727096
IYSTGPKWRSRRKMLTQTFHFAVLKEYHKVFASQGKILVDVLRLRANNTYPFDIMPYIKR 727275
Query: 190 CALDIIC 196
C LDIIC
Sbjct: 727276 CTLDIIC
727296
Score = 63.5 bits (153),
Expect(3) = e-117
Identities = 28/34 (82%), Positives = 32/34 (94%)
Frame = +2
Query: 197
ETAMGCSISSQRGANDEYVNSVRRLSEIVWNYEK 230
ETAMGCSISSQ G+ND+YV SV+RLSE+VWNYEK
Sbjct: 727682
ETAMGCSISSQMGSNDKYVESVKRLSELVWNYEK 727783
Score = 285 bits
(730), Expect(3) = e-117
Identities = 133/167 (79%), Positives = 154/167 (92%)
Frame = +3
Query: 230
KAPQFWLKPIWYLFGDGFEFNRHVKLTTDFTRDVIENRKKELKTHNSEQNETKKLAFLDY 289
+AP +WLKPIWYLFG+GFEF+R VKLTTDFTRDVI+ RK+ELK H SE ++ KKLAFLDY
Sbjct: 727824
RAPLYWLKPIWYLFGNGFEFDRLVKLTTDFTRDVIDKRKEELKLHESEPSD-KKLAFLDY 728000
Query: 290
LLKSQEEHPDILTDEGIREEVDTFMFEGHDTTSSGITFAVWFLGQFPEYQQRVHDELDEI 349
LLKSQ
+HP+ILTDEGIREEVDTFMFEGHDTTSSGI FA+WFLGQ+PEYQQ+V DE+DEI
Sbjct: 728001
LLKSQTDHPEILTDEGIREEVDTFMFEGHDTTSSGIKFAIWFLGQYPEYQQQVQDEMDEI 728180
Query: 350
FGEDFERIPNSEDIQKMVYLEQCIKETLRMTPPVPFVSRKLTEDVKI 396
FG+D+ER PNSEDIQ+M+YLEQCIKETLR+TPPVPF+SR+L EDV I
Sbjct: 728181
FGDDYERYPNSEDIQRMIYLEQCIKETLRLTPPVPFISRQLEEDVLI 728321
Score = 122 bits
(307), Expect(3) = e-117
Identities = 58/89 (65%), Positives = 70/89 (78%)
Frame = +1
Query: 396
IPHATKPDLLLPAGINCMINIITIMKDARYFERPYEFFPEHFSPERVAAREPFAFVPFSA 455
+ HATKP +LLPAG+N MINIITIMKDARYFE+PYEFFPEHF PERV +RE FA+VPFSA
Sbjct: 728359
LAHATKPPVLLPAGMNIMINIITIMKDARYFEKPYEFFPEHFEPERVNSREAFAYVPFSA 728538
Query: 456
GPRNCIGQKFALLEEKVLLSWIFRNFTVT 484
GPRNCIG+ + E + + + F V+
Sbjct: 728539
GPRNCIGK*YDKCEPILYVHFKFNYLDVS 728625
Score = 104 bits
(259), Expect = 4e-20
Identities = 49/55 (89%), Positives = 54/55 (98%)
Frame = +3
Query: 461
IGQKFALLEEKVLLSWIFRNFTVTSMTKFPEEMPIPELILKPQFGTQVLLRNRRK 515
+GQKFALLEEKV+LSWIFRNFTVTSM+K+PEE
PIPELILKPQFGTQVLL+NRRK
Sbjct: 728817
VGQKFALLEEKVVLSWIFRNFTVTSMSKYPEEHPIPELILKPQFGTQVLLKNRRK 728981
Y5H2B.6 = CYP33C12P
Y5H2B.6; CE21315. AC006810
one in frame stop codon = pseudogene
MLLLLFLSVLFLALFYEFHWKRRNYPAGPLPLPVIGNMWSMMRNNSGVEC
FRQWTKDFGDVYTFWFGTKPYIVVSSYKRLKEAFILDGDTFADKIRQPFQ
DQFRGGNYGVVDTNGHVWSTHRRFALSSFRDFGLGKNLLQEKMLIEVQDM
FAKFDANLGKEQNLPVVLYNAAANVINQLIFGYRFDKEREGELKKLKALM
EFQETAFTTFKVYVQFFAPAIGKHLPGKSVEDLLAEFTVDFYKFFNHQIE
EHRSKIDFDSEESLDYAEAYLKEQRKQEAQGEFELFSTKQLSNTCFDLWF
AGLSTTHITLTWIVGHVLNYPDVQRKLHKELDEVIGSDRLITNDDKNNLP
YLNAVINESQRCANIVPINQIHSTSRDTVINGITVKKGTGVIPQISAIML
DDKVFPDPYAFNPERFLDANGKLRKI*EFVPFSVGKRQCLGEGLARMEMF
IFMSNFFNRYQ (0) 17014
VSPASSGPPSLVKESLLNVAPRKFDAILKKRHV*
16469
This gene is a CYP33C but
it is missing its C-terminal
Including the heme
signature of p450s. The missing
seq has a stop codon.
Compare to briggsae
>CYP33C.b cb25.fpc0023 4
exons
Length = 495
Score = 1761 (619.9 bits), Expect = 6.6e-185, P = 6.6e-185
Identities = 318/426 (74%), Positives = 375/426 (88%)
Query: 1
MLLLLFLSVLFLALFYEFHWKRRNYPAGPLPLPVIGNMWSMMRNNSGVECFRQWTKDFGD 60
M+LLL + L L LF+E +WKRRNYP
GPLPLPVIGNM +M+ G E FRQWTK+FGD
Sbjct: 1
MILLLLFTTLSLWLFHELYWKRRNYPNGPLPLPVIGNMVPIMKAKPGYEAFRQWTKEFGD 60
Query: 61
VYTFWFGTKPYIVVSSYKRLKEAFILDGDTFADKIRQPFQDQFRGGNYGVVDTNGHVWST 120
V+TFW GTKPYIVVSSYKRLKE FILDGDT+ADK
QPFQ+QFRGG YGV+DTNGHVWST
Sbjct: 61
VFTFWLGTKPYIVVSSYKRLKETFILDGDTYADKAYQPFQEQFRGGQYGVIDTNGHVWST 120
Query: 121
HRRFALSSFRDFGLGKNLLQEKMLIEVQDMFAKFDANLGKEQNLPVVLYNAAANVINQLI 180
HRRFAL++FRDFGLGK+L+Q+K+LIEV ++F KFD N+GKEQ +P V YNA ANVINQLI
Sbjct: 121
HRRFALTTFRDFGLGKDLMQQKILIEVDEIFRKFDENIGKEQEIPGVFYNAGANVINQLI 180
Query: 181 FGYRFDKEREGELKKLKALMEFQETAFTTFKVYVQFFAPAIGKHLPGKSVEDLLAEFTVD
240
FGYRFD+E++ ELKKLKALMEFQETAFTTFKV VQFFAP +G+ LPGKSVE+LLAEFTVD
Sbjct: 181
FGYRFDEEKQEELKKLKALMEFQETAFTTFKVQVQFFAPIVGRMLPGKSVEELLAEFTVD 240
Query: 241
FYKFFNHQIEEHRSKIDFDSEESLDYAEAYLKEQRKQEAQGEFELFSTKQLSNTCFDLWF 300
FYKFF+HQIEEHRSKIDFDSEE+LDYAEAYLKEQ+K+E++G+ ELF +QLSN CFDLW
Sbjct: 241
FYKFFDHQIEEHRSKIDFDSEENLDYAEAYLKEQKKKESEGDMELFGNRQLSNMCFDLWV 300
Query: 301
AGLSTTHITLTWIVGHVLNYPDVQRKLHKELDEVIGSDRLITNDDKNNLPYLNAVINESQ 360
AGLSTTH TL+WI+ +VLN+ +VQ+ +
ELDEVIGSDRLI+ DKNNLP++NAVINESQ
Sbjct: 301
AGLSTTHTTLSWIIAYVLNHSEVQKTMQIELDEVIGSDRLISTGDKNNLPFMNAVINESQ 360
Query: 361
RCANIVPINQIHSTSRDTVINGITVKKGTGVIPQISAIMLDDKVFPDPYAFNPERFLDAN 420
RCANIVP+NQIH S+DT+ING+
VKKGTG+IPQIS ++LD+ FPDPY FNPERF+D
+
Sbjct: 361
RCANIVPLNQIHCVSKDTMINGVLVKKGTGIIPQISTVLLDETTFPDPYKFNPERFIDEH 420
Query: 421 GKLRKI 426
GKL+K+
Sbjct: 421 GKLKKV 426
>CYP33C.b cb25.fpc0023 4
exons
723577 MILLLLFTTLSLWLFHELYWKRRNYPNGPLPLPVIGNMVPIMKAKPGYEAFRQWTK
723744
723745 EFGDVFTFWL (1)
723774
724349
GTKPYIVVSSYKRLKETFILDGDTYADKAYQP 724444
724445
FQEQFRGGQYGVIDTNGHVWSTHRRFALTTFRDFGLGKDLMQQKILIEVDEIFRKFDENI 724624
724625
GKEQEIPGVFYNAGANVINQLIFGYRFDEEKQEELKKLKALMEFQETAFTTFKVQVQFFA 724804
724805 PIVGRMLPGKSVEEL (2)
724849
724904
LAEFTVDFYKFFDHQIEEHRSKIDFDSEENLDYAEA 725009
725010
YLKEQKKKESEGDMELFGNRQLSNMCFDLWVAGLSTTHTTLSWIIAYVLNHSEVQKTMQI 725189
725190
ELDEVIGSDRLISTGDKNNLPFMNAVINESQRCANIVPLNQIHCVSKDTMINGVLVKKGT 725369
725370
GIIPQISTVLLDETTFPDPYKFNPERFIDEHGKLKKV EELCAFSVGKRQCLGEGLARMEM 725549
725550 FLFISNFFNRYQ (0) 725585
725633 VTPGSSGPPSLEKETMFNATPRKIRAVLTKRYL* 725734
CYP31A3 and pseudogene
CYP31A4P
>CYP31A3 T16C6 486 aa
this sequence revised according to Y17G9.contig 61
Length = 488
Score = 1164 (409.7 bits),
Expect = 1.2e-121, P = 1.2e-121
Identities = 242/332 (72%), Positives = 263/332 (79%)
Query: 1
MGVIIPAVLLASATIIAWLLYKHLRMRQALKHLNQPRSYPIVGHGLVTKPDPEGFMNQVI 60
MGVIIPAVLLASATIIAWLLYKHLRMRQALKHLNQPRSYPIVGHGLVTKPDPEGFMNQVI
Sbjct: 1
MGVIIPAVLLASATIIAWLLYKHLRMRQALKHLNQPRSYPIVGHGLVTKPDPEGFMNQVI 60
Query: 61
GMGYLYPDPRMCLLWIGPFPCLMLYSADLVEPIFSSTKHLNKGFAYVLLEPWLGISILTS 120
GMGYLYPDPRMCLLWIGPFPCLMLYSADLVEPIFSSTKHLNKGFAYVLLEPWLGISILTS
Sbjct: 61
GMGYLYPDPRMCLLWIGPFPCLMLYSADLVEPIFSSTKHLNKGFAYVLLEPWLGISILTS 120
Query: 121
QKEQWRPKRKLLTPTFHYDILKDFLPIFNEQSKILVQKMCSLGAEEEVDVLSVITLCTLD 180
QKEQWRPKRKLLTPTFHYDILKDFLPIFNEQSKILVQKMCSLGAEEEVDVLSVITLCTLD
Sbjct: 121
QKEQWRPKRKLLTPTFHYDILKDFLPIFNEQSKILVQKMCSLGAEEEVDVLSVITLCTLD 180
Query: 181
IICETSMGKAIGAQLAENNEYVWAVHTINKLISKRTNNPLMWNS---------FIYNLYD 231
IICETSMGKAIGAQLAENNEYVWAVHTINKLISKRTNNPL+
+++
Sbjct: 181 IICETSMGKAIGAQLAENNEYVWAVHTINKLISKRTNNPLITEDGRTHEKCLRILHDFTK
240
Query: 232
SFIIKKVNSILF--FRTEDGRTHEKCLRILHDFTKKVIVERKEALQEND-YKMEGR-LAF 287
I+++ ++ ++ E GR L +L + K ++ + E D + EG
Sbjct: 241
KVIVERKEALQENDYKME-GRL--AFLDLLLEMVKSGQMDETDVQAEVDTFMFEGHDTTS 297
Query: 288
LDLLLEMVKSGQMDETD--VQAEVDTFMFEGHDTT 320
L+ + G E
VQAE+D M + D T
Sbjct: 298
TGLMWAIHLLGNHPEVQRKVQAELDEVMGDDEDVT 332
Score = 1394 (490.7 bits), Expect = 5.1e-146, P = 5.1e-146
Identities = 274/295 (92%), Positives = 277/295 (93%)
Query: 217 NNPLMWNSFIYNLYDSFIIKKVNSILFFRTEDGRTHEKCLRILHDFTKKVIVERKEALQE
276
NN +W N I K+ N+ L TEDGRTHEKCLRILHDFTKKVIVERKEALQE
Sbjct: 198
NNEYVWAVHTIN---KLISKRTNNPLI--TEDGRTHEKCLRILHDFTKKVIVERKEALQE 252
Query: 277
NDYKMEGRLAFLDLLLEMVKSGQMDETDVQAEVDTFMFEGHDTTSTGLMWAIHLLGNHPE 336
NDYKMEGRLAFLDLLLEMVKSGQMDETDVQAEVDTFMFEGHDTTSTGLMWAIHLLGNHPE
Sbjct: 253
NDYKMEGRLAFLDLLLEMVKSGQMDETDVQAEVDTFMFEGHDTTSTGLMWAIHLLGNHPE 312
Query: 337 VQRKVQAELDEVMGDDEDVTIEHLSRMKYLECALKEALRLFPSVPIITRELSDDQVIGGV
396
VQRKVQAELDEVMGDDEDVTIEHLSRMKYLECALKEALRLFPSVPIITRELSDDQVIGGV
Sbjct: 313
VQRKVQAELDEVMGDDEDVTIEHLSRMKYLECALKEALRLFPSVPIITRELSDDQVIGGV 372
Query: 397 NIPKGVTFLLNLYLVHRDPSQWKDPDVFDPDRFLPENSIARKSFAFIPFSAGSRNCIGQR
456
NIPKGVTFLLNLYLVHRDPSQWKDPDVFDPDRFLPENSIARKSFAFIPFSAGSRNCIGQR
Sbjct: 373
NIPKGVTFLLNLYLVHRDPSQWKDPDVFDPDRFLPENSIARKSFAFIPFSAGSRNCIGQR 432
Query: 457
FALMEEKVIMAHLLRNFNVKAVELMHEVRPKMEIIVRPVTPIHMKLTRRRPIVSP 511
FALMEEKVIMAHLLRNFNVKAVELMHEVRPKMEIIVRPVTPIHMKLTRRRPIVSP
Sbjct: 433
FALMEEKVIMAHLLRNFNVKAVELMHEVRPKMEIIVRPVTPIHMKLTRRRPIVSP 487
Y17G9B.3; CE24183. remove cyan seq
MGVIIPAVLLASATIIAWLLYKHLRMRQALKHLNQPRSYPIVGHGLVTKP 50
DPEGFMNQVIGMGYLYPDPRMCLLWIGPFPCLMLYSADLVEPIFSSTKHL
100
NKGFAYVLLEPWLGISILTSQKEQWRPKRKLLTPTFHYDILKDFLPIFNE
150
QSKILVQKMCSLGAEEEVDVLSVITLCTLDIICETSMGKAIGAQLAENNE
200
YVWAVHTINKLISKRTNNPLMWNSFIYNLYDSFIIKKVNSILFFRTEDGR 250
THEKCLRILHDFTKKVIVERKEALQENDYKMEGRLAFLDLLLEMVKSGQM
300
DETDVQAEVDTFMFEGHDTTSTGLMWAIHLLGNHPEVQRKVQAELDEVMG
350
DDEDVTIEHLSRMKYLECALKEALRLFPSVPIITRELSDDQVIGGVNIPK
400
GVTFLLNLYLVHRDPSQWKDPDVFDPDRFLPENSIARKSFAFIPFSAGSR
450
NCIGQRFALMEEKVIMAHLLRNFNVKAVELMHEVRPKMEIIVRPVTPIHM
500
KLTRRRPIVSP
511
with cyan seq removed we
get
>CYP31A3 T16C6 486 aa
this sequence revised according to Y17G9.contig 61
Length = 488
add the green seq to the model
Score = 2521 (887.4 bits), Expect = 1.9e-265, P = 1.9e-265
Identities = 486/495 (98%), Positives = 487/495 (98%)
Query: 1
MGVIIPAVLLASATIIAWLLYKHLRMRQALKHLNQPRSYPIVGHGLVTKPDPEGFMNQVI 60
MGVIIPAVLLASATIIAWLLYKHLRMRQALKHLNQPRSYPIVGHGLVTKPDPEGFMNQVI
Sbjct: 1
MGVIIPAVLLASATIIAWLLYKHLRMRQALKHLNQPRSYPIVGHGLVTKPDPEGFMNQVI 60
Query: 61
GMGYLYPDPRMCLLWIGPFPCLMLYSADLVEPIFSSTKHLNKGFAYVLLEPWLGISILTS 120
GMGYLYPDPRMCLLWIGPFPCLMLYSADLVEPIFSSTKHLNKGFAYVLLEPWLGISILTS
Sbjct: 61
GMGYLYPDPRMCLLWIGPFPCLMLYSADLVEPIFSSTKHLNKGFAYVLLEPWLGISILTS 120
Query: 121
QKEQWRPKRKLLTPTFHYDILKDFLPIFNEQSKILVQKMCSLGAEEEVDVLSVITLCTLD 180
QKEQWRPKRKLLTPTFHYDILKDFLPIFNEQSKILVQKMCSLGAEEEVDVLSVITLCTLD
Sbjct: 121
QKEQWRPKRKLLTPTFHYDILKDFLPIFNEQSKILVQKMCSLGAEEEVDVLSVITLCTLD 180
Query: 181 IICETSMGKAIGAQLAENNEYVWAVHTINKLISKRTNNPLMWNSFIYNLTEDGRTHEKCL 240
IICETSMGKAIGAQLAENNEYVWAVHTINKLISKRTNNPL +TEDGRTHEKCL
Sbjct: 181
IICETSMGKAIGAQLAENNEYVWAVHTINKLISKRTNNPL--------ITEDGRTHEKCL 232
Query: 241
RILHDFTKKVIVERKEALQENDYKMEGRLAFLDLLLEMVKSGQMDETDVQAEVDTFMFEG 300
RILHDFTKKVIVERKEALQENDYKMEGRLAFLDLLLEMVKSGQMDETDVQAEVDTFMFEG
Sbjct: 233
RILHDFTKKVIVERKEALQENDYKMEGRLAFLDLLLEMVKSGQMDETDVQAEVDTFMFEG 292
Query: 301
HDTTSTGLMWAIHLLGNHPEVQRKVQAELDEVMGDDEDVTIEHLSRMKYLECALKEALRL 360
HDTTSTGLMWAIHLLGNHPEVQRKVQAELDEVMGDDEDVTIEHLSRMKYLECALKEALRL
Sbjct: 293
HDTTSTGLMWAIHLLGNHPEVQRKVQAELDEVMGDDEDVTIEHLSRMKYLECALKEALRL 352
Query: 361
FPSVPIITRELSDDQVIGGVNIPKGVTFLLNLYLVHRDPSQWKDPDVFDPDRFLPENSIA 420
FPSVPIITRELSDDQVIGGVNIPKGVTFLLNLYLVHRDPSQWKDPDVFDPDRFLPENSIA
Sbjct: 353
FPSVPIITRELSDDQVIGGVNIPKGVTFLLNLYLVHRDPSQWKDPDVFDPDRFLPENSIA 412
Query: 421
RKSFAFIPFSAGSRNCIGQRFALMEEKVIMAHLLRNFNVKAVELMHEVRPKMEIIVRPVT 480
RKSFAFIPFSAGSRNCIGQRFALMEEKVIMAHLLRNFNVKAVELMHEVRPKMEIIVRPVT
Sbjct: 413
RKSFAFIPFSAGSRNCIGQRFALMEEKVIMAHLLRNFNVKAVELMHEVRPKMEIIVRPVT 472
Query: 481 PIHMKLTRRRPIVSP 495
PIHMKLTRRRPIVSP
Sbjct: 473 PIHMKLTRRRPIVSP 487
Compare to 31A2
>CYP31A2 F22B3 Z68336
487 aa
Length = 488
Score = 2517 (886.0 bits), Expect = 5.1e-265, P = 5.1e-265
Identities = 478/488 (97%), Positives = 484/488 (99%)
Query: 1
MGVIIPAVLLASATIIAWLLYKHLRMRQALKHLNQPRSYPIVGHGLVTKPDPEGFMNQVI 60
MGVIIPAVLLA AT+IAWLLYKHLRMRQ LKHLNQPRSYPIVGHGL+TKPDPEGFMNQVI
Sbjct: 1
MGVIIPAVLLAMATVIAWLLYKHLRMRQVLKHLNQPRSYPIVGHGLITKPDPEGFMNQVI 60
Query: 61
GMGYLYPDPRMCLLWIGPFPCLMLYSADLVEPIFSSTKHLNKGFAYVLLEPWLGISILTS 120
GMGYLYPDPRMCLLWIGPFPCLMLYSADLVEPIFSSTKHLNKGFAYVLLEPWLGISILTS
Sbjct: 61
GMGYLYPDPRMCLLWIGPFPCLMLYSADLVEPIFSSTKHLNKGFAYVLLEPWLGISILTS 120
Query: 121
QKEQWRPKRKLLTPTFHYDILKDFLPIFNEQSKILVQKMCSLGAEEEVDVLSVITLCTLD 180
QKEQWRPKRKLLTPTFHYDILKDFLPIFNEQSKILVQK+C LGA+EEVDVLSVITLCTLD
Sbjct: 121 QKEQWRPKRKLLTPTFHYDILKDFLPIFNEQSKILVQKLCCLGADEEVDVLSVITLCTLD
180
Query: 181
IICETSMGKAIGAQLAENNEYVWAVHTINKLISKRTNNPLITEDGRTHEKCLRILHDFTK 240
IICETSMGKAIGAQLAENNEYVWAVHTINKLISKRTNNPLITEDGRTHEKCLRILHDFTK
Sbjct: 181
IICETSMGKAIGAQLAENNEYVWAVHTINKLISKRTNNPLITEDGRTHEKCLRILHDFTK 240
Query: 241
KVIVERKEALQENDYKMEGRLAFLDLLLEMVKSGQMDETDVQAEVDTFMFEGHDTTSTGL 300
KVIVERKEALQENDYKMEGRLAFLDLLLEMVKSGQMDETDVQAEVDTFMFEGHDTTSTGL
Sbjct: 241
KVIVERKEALQENDYKMEGRLAFLDLLLEMVKSGQMDETDVQAEVDTFMFEGHDTTSTGL 300
Query: 301
MWAIHLLGNHPEVQRKVQAELDEVMGDDEDVTIEHLSRMKYLECALKEALRLFPSVPIIT 360
MWAIHLLGNHPEVQRKVQAELDEVMGDDEDVTIEHLSRMKYLECALKEALRLFPSVPIIT
Sbjct: 301
MWAIHLLGNHPEVQRKVQAELDEVMGDDEDVTIEHLSRMKYLECALKEALRLFPSVPIIT 360
Query: 361 RELSDDQVIGGVNIPKGVTFLLNLYLVHRDPSQWKDPDVFDPDRFLPENSIARKSFAFIP 420
RELSDDQVIGGVNIPKGVTFLLNLYLVHRDP+QWKDPDVFDPDRFLPENSI RKSFAFIP
Sbjct: 361
RELSDDQVIGGVNIPKGVTFLLNLYLVHRDPAQWKDPDVFDPDRFLPENSIGRKSFAFIP 420
Query: 421
FSAGSRNCIGQRFALMEEKVIMAHLLRNFNVKAVELMHEVRPKMEIIVRPVTPIHMKLTR 480
FSAGSRNCIGQRFALMEEKVIMAHLLRNFN+KAVELMHEVRPKMEIIVRPVTPIHMKLTR
Sbjct: 421
FSAGSRNCIGQRFALMEEKVIMAHLLRNFNIKAVELMHEVRPKMEIIVRPVTPIHMKLTR 480
Query: 481 RRPIVSP* 488
RRPIVSP*
Sbjct: 481 RRPIVSP* 488
I had the ends of my genes
mixed up. The pseudogene should be
downstream of CYP31A3
Not in the last intron.
>CYP31A3 T16C6 486 aa
this sequence revised according to Y17G9.contig 61
Length = 488
Plus Strand HSPs:
Score = 551 (194.0 bits), Expect = 1.9e-69, Sum P(2) =
1.9e-69
Identities = 107/113 (94%), Positives = 111/113 (98%), Frame
= +3
Query: 13203
VPIITRELSDDQVIGGVNIPKGVTFLLNLYLVHRDPSQWKDPDVFDPDRFLPENSIARKS 13382
VPIITRELSDDQVIGGVNIPKGVTFLLNLYLVHRDPSQWKDPDVFDPDRFLPENSIARKS
Sbjct: 356
VPIITRELSDDQVIGGVNIPKGVTFLLNLYLVHRDPSQWKDPDVFDPDRFLPENSIARKS 415
Query: 13383
FAFIPFSAGSRNCIGQRFALMEEKVIMAHLLRNFNVKAVELMHEVKH*KLKLI
13541
FAFIPFSAGSRNCIGQRFALMEEKVIMAHLLRNFNVKAVELMHEV+
K+++I
Sbjct: 416 FAFIPFSAGSRNCIGQRFALMEEKVIMAHLLRNFNVKAVELMHEVRP-KMEII
467
Score = 146 (51.4 bits),
Expect = 1.9e-69, Sum P(2) = 1.9e-69
Identities = 29/30 (96%), Positives = 30/30 (100%), Frame =
+1
Query: 13567
QVRPKMEIIVRPVTPIHMKLTRRRPIVSP* 13656 end of CYP31A3
+VRPKMEIIVRPVTPIHMKLTRRRPIVSP*
Sbjct: 459
EVRPKMEIIVRPVTPIHMKLTRRRPIVSP* 488
Score = 165 (58.1 bits),
Expect = 2.3e-11, P = 2.3e-11
Identities = 40/63 (63%), Positives = 44/63 (69%), Frame = +2
This is the pseudogene
CYP31A4P
Query: 15233 MTHLLRNFSVKDVELVHEMIFESL*IKFLIKLVQVRPKMEIIVCPVSPIHMKLTRRRPII
15412
M HLLRNF+VK VEL+HE
VRPKMEIIV PV+PIHMKLTRRRPI+
Sbjct: 442 MAHLLRNFNVKAVELMHE----------------VRPKMEIIVRPVTPIHMKLTRRRPIV
485
Query: 15413 SP* 15421
SP*
Sbjct: 486 SP* 488
AC006720
19620 bp
DNA
linear INV
07-JUL-2003
DEFINITION Caenorhabditis elegans cosmid Y17G9B,
complete sequence.
13201 ctgttccaat tattacaaga gaattgagtg
atgatcaagt gattggaggt gttaatattc
13261 caaaaggagt cacttttctg ctcaacctgt accttgttca
tcgtgatcct tcgcaatgga
13321 aggatcctga tgtattcgat ccggatcgtt
tccttcctga aaattcaatt gctcgcaagt
13381 ctttcgcctt cattccattc tcagctggaa
gtcgtaactg cattggccaa cgttttgcac
13441 tcatggagga gaaggtcatc atggctcatt
tgttgagaaa tttcaatgta aaagctgtcg
13501
agttgatgca tgaggtgaaa cattgaaaat taaaattgat aaattaaatt ctaactgaaa
13561 attgttcagg ttcgtccgaa aatggagatt
atcgttcgcc cagtcactcc aattcacatg
13621 aagcttacca gacgccgtcc aatcgtctct
ccataatttc aaaatagcca attttattct
13681 tgatgattat atatttttaa tgcatgataa
attttaattt tcatacttta ccttgcaaat
13741 gtgatgaaca aattcttatt tcggggagtg
aactattcct ggaacaaaaa cgatttgatt
13801 ttttaagctc tataggctct acactaaatt
ctgtgaatac gtactgcaaa tatatcaaaa
13861 ctaaattttt ctcattgcaa aaagtctgaa
tttattgatt tttgaatgaa gttatagcta
13921 gtaatattga ttctcatgga acagctcata
ttgacaaatg tgacaaattt tgatacggta
13981 tttttaaagg cgcagcaaaa aattttctct
atggtcccgc aacgtacaca gtttcactag
14041 attttgacat gccgctttaa aggcacgtgg
atttatttaa atgggtaaca cggcccggca
14101 agtggtacat ccatgcaaat gcgctctact
gataatttga gtgtagacca ggtttgggcg
14161 cgtgataacg aaaaaagctt tggtccaaaa
aatttagaat ttagtttcgg acatttttta
14221 tatgcatcac aaaaaaactg gaccaaccgt
ttttgagata cacgcgccca aacgtccagg
14281 tatacggtag acaaattgcg tacaggtacc
acttctcggg ccgtgggtaa tatgggaaca
14341 aacaattcta agaatgcgta ctgggcgaca
tatttggcgc gcaaaatatc ttgtagcgaa
14401 aactacagta attcttttaa tgaccactgt
agctcttgtg tcgatttacg ggcccacttt
14461 cgaaatgagt ttctttttga atagtgataa
caaatttctc attttccttc gttattttgt
14521 attattttct cacgttttgc ttaattttaa
tattccataa atggttttaa ttcattcaga
14581 aattcaggcc cgtaaatcga caaaagagct
acagtagtca tctatagaat tactgtattt
14641 ttcgctccga gatattttgc gcgtcaaata
tattgcgtag tacgcattct cagaaattaa
14701 tgttcccgta ataccaaagt catacaggaa
aagtgaattt ttcgaaattc aaaatttgaa
14761 ccgttgatat tttcttgttg atgttttgca
aacaacactc cagcgcttga gcatattgcc
14821 atcaaacacc aatgagaacg aaaaaggttg
aatcgaacaa ttcgaagaat tttctatttt
14881 aagagaaaag atctctaacc agtgccttct cgctccatta
tcctttggaa catgtgcgtg
14941 tctctgcacg ctttagacaa agaaaaatat
gtataatatc tctaagatga gaattagaga
15001 attggggaag gggggtcaat tggatcggac
cacccaatct tcccatctgc tcattctcca
15061 tttgtgacca tggacaacgg tgtgtctatt
ccattccaac tgctcaaatc acctcaaaaa
15121 tcgattagtt atgttcttca aagaatgact
atcggagatc tgtaagtaaa atgtgaccaa
15181 atataattat ttacagacct gcaagtggaa
ctattcgcta gacattcact agatgactca
15241 tttgttgaga aatttcagtg taaaagatgt
tgagttggtg catgaaatga tatttgaaag
15301 tttataaatt aaattcttaa taaaacttgt
tcaggttcgt ccaaaaatgg aaataatcgt
15361 ttgcccggtc tctccaatcc acatgaagct
caccagacgc cgtccaatca tatctccatg
15421 atttgaaatt aatttgaaat ttcgaacgat
tttatatttt tattgcttga taaatttgag
15481 ttttctgtaa aaagtttgaa acttcaacct
tgaaattccc gttactggtc gagtccaaat
Y80D3A.5
>CYP42A1 CM08B12 M89401, AL020988 Y80D3 contig 01341 and
00427 504 aa
Length = 505
Score = 2290 (806.1 bits), Expect = 5.8e-241, P = 5.8e-241
Identities = 440/450 (97%), Positives = 440/450 (97%)
Query: 1
MGIITASLIVLTITWIIHFAFRKAKFIYNKLTVFQGPAALPLIGNFHQFHFSPEEFFEQS 60
MGIITASLIVLTITWIIHFAFRKAKFIYNKLTVFQGPAALPLIGNFHQFHFSPEEFFEQS
Sbjct: 1
MGIITASLIVLTITWIIHFAFRKAKFIYNKLTVFQGPAALPLIGNFHQFHFSPEEFFEQS 60
Query: 61 QGIAYMMRKGDERITRVWLGGLPFVLLYGAHEVEAILGSPKMLNKPFLYGFLSAWIGDGL
120
QGIAYMMRKGDERITRVWLGGLPFVLLYGAHEVEAILGSPKMLNKPFLYGFLSAWIGDGL
Sbjct: 61
QGIAYMMRKGDERITRVWLGGLPFVLLYGAHEVEAILGSPKMLNKPFLYGFLSAWIGDGL 120
Query: 121 LIS-KPDKWRPRRKLLTPTFHYDILKDFVEVYNRHGRTLLSKFEAQAGTGEYSDVFHTIT
179
LIS KPDKWRPRRKLLTPTFHYDILKDFVEVYNRHGRTLLSKFEAQAGTGEYSDVFHTIT
Sbjct: 121 LISRKPDKWRPRRKLLTPTFHYDILKDFVEVYNRHGRTLLSKFEAQAGTGEYSDVFHTIT
180
Query: 180
LCTLDVICEAALGTSINAQKDPHSPYLDAVFKMKDIVFQRLLRPHYFSDTIFNLIGPGKE 239
LCTLDVICEAALGTSINAQKDPHSPYLDAVFKMKDIVFQRLLRPHYFSDTIFNLIGPGKE
Sbjct: 181
LCTLDVICEAALGTSINAQKDPHSPYLDAVFKMKDIVFQRLLRPHYFSDTIFNLIGPGKE 240
Query: 240 HDECVK--ILHEFTSKAIYARKAKVDAAGGVEQLLAQETAEGRRRMAFLDLMLDMNSKGE 297
HDECVK ILHEFTSKAIYARK QLLAQETAEGRRRMAFLDLMLDMNSKGE
Sbjct: 241 HDECVKVIILHEFTSKAIYARK----------QLLAQETAEGRRRMAFLDLMLDMNSKGE 290
Query: 298
LPMEGICEEVDTFTFEGHDTTSAAMNWFLHLMGANPEIQSKVQKEIDEVLGEADRPVSYE 357
LPMEGICEEVDTFTFEGHDTTSAAMNWFLHLMGANPEIQSKVQKEIDEVLGEADRPVSYE
Sbjct: 291
LPMEGICEEVDTFTFEGHDTTSAAMNWFLHLMGANPEIQSKVQKEIDEVLGEADRPVSYE 350
Query: 358
DLGKLKYLEACFKETLRLYPSVPLIARQCVEDIQVRGHTLPSGTAVVMVPSMVHKDPRYW 417
DLGKLKYLEACFKETLRLYPSVPLIARQCVEDIQVRGHTLPSGTAVVMVPSMVHKDPRYW
Sbjct: 351
DLGKLKYLEACFKETLRLYPSVPLIARQCVEDIQVRGHTLPSGTAVVMVPSMVHKDPRYW 410
Query: 418
DDPEIFNPERFITGELKHPYAYIPFSAGSRNCI 450
DDPEIFNPERFITGELKHPYAYIPFSAGSRNCI
Sbjct: 411
DDPEIFNPERFITGELKHPYAYIPFSAGSRNCI 443
Y80D3A.5; CE23108.
MGIITASLIVLTITWIIHFAFRKAKFIYNKLTVFQGPAALPLIGNFHQFH 50
FSPEEFFEQSQGIAYMMRKGDERITRVWLGGLPFVLLYGAHEVEAILGSP
100
KMLNKPFLYGFLSAWIGDGLLISKPDKWRPRRKLLTPTFHYDILKDFVEV
150
YNRHGRTLLSKFEAQAGTGEYSDVFHTITLCTLDVICEAALGTSINAQKD
200
PHSPYLDAVFKMKDIVFQRLLRPHYFSDTIFNLIGPGKEHDECVK ILHEF 250
TSKAIYARKAKVDAAGGVEQLLAQETAEGRRRMAFLDLMLDMNSKGELPM
300
EGICEEVDTFTFEGHDTTSAAMNWFLHLMGANPEIQSKVQKEIDEVLGEA
350
DRPVSYEDLGKLKYLEACFKETLRLYPSVPLIARQCVEDIQVRGHTLPSG
400
TAVVMVPSMVHKDPRYWDDPEIFNPERFITGELKHPYAYIPFSAGSRNCI
450
ANIDNMGVKLTEIT
end wrong
>gi|25815101|emb|AL132853.2|CEY80D3A Download subject sequence spanning the
HSP Caenorhabditis elegans YAC Y80D3A, complete sequence
Length =
102495
The genomic seq supports
the yellow regions for Y80D3A.5 so my seq is wrong there
Score = 102 bits
(253), Expect = 7e-23
Identities = 45/47 (95%), Positives = 47/47 (100%)
Frame = +1
Query: 1
PHSPYLDAVFKMKDIVFQRLLRPHYFSDTIFNLIGPGKEHDECVKIL 47
PHSPYLDAVFKMKDIVFQRLLRPHYFSDTIFNLIGPGKEHDECVK++
Sbjct: 58066
PHSPYLDAVFKMKDIVFQRLLRPHYFSDTIFNLIGPGKEHDECVKVI 58206
Score = 109 bits
(272), Expect = 5e-25
Identities = 55/56 (98%), Positives = 56/56 (100%)
Frame = +3
Query: 45
KILHEFTSKAIYARKAKVDAAGGVEQLLAQETAEGRRRMAFLDLMLDMNSKGELPM 100
+ILHEFTSKAIYARKAKVDAAGGVEQLLAQETAEGRRRMAFLDLMLDMNSKGELPM
Sbjct: 58620
QILHEFTSKAIYARKAKVDAAGGVEQLLAQETAEGRRRMAFLDLMLDMNSKGELPM 58787
>gi|275296|gb|M89401.1|M89401 UniGene info CEL08B12 Chris Martin
sorted cDNA library Caenorhabditis elegans
cDNA
clone cm08b12 5' similar to cytochrome P450
homologous peptide.
Length = 528
Score = 26.6 bits (57), Expect(2) = 2e-11
Identities = 12/12 (100%), Positives = 12/12 (100%)
Frame = +2
Query: 60 AKVDAAGGVEQL 71
AKVDAAGGVEQL
Sbjct: 2 AKVDAAGGVEQL 37
Score = 56.2 bits (134), Expect(2) = 2e-11
Identities = 27/28 (96%), Positives = 27/28 (96%)
Frame = +3
Query: 73 AQETAEGRRRMAFLDLMLDMNSKGELPM 100
AQETA
GRRRMAFLDLMLDMNSKGELPM
Sbjct: 42 AQETAXGRRRMAFLDLMLDMNSKGELPM 125
>gi|25815101|emb|AL132853.2|CEY80D3A Download subject sequence spanning the
HSP Caenorhabditis elegans YAC Y80D3A, complete sequence
Length =
102495
Score = 226 bits
(577), Expect = 5e-60
Identities = 107/107 (100%), Positives = 107/107 (100%)
Frame = +3
Query: 1
VLGEADRPVSYEDLGKLKYLEACFKETLRLYPSVPLIARQCVEDIQVRGHTLPSGTAVVM 60
VLGEADRPVSYEDLGKLKYLEACFKETLRLYPSVPLIARQCVEDIQVRGHTLPSGTAVVM
Sbjct: 60489
VLGEADRPVSYEDLGKLKYLEACFKETLRLYPSVPLIARQCVEDIQVRGHTLPSGTAVVM 60668
Query: 61 VPSMVHKDPRYWDDPEIFNPERFITGELKHPYAYIPFSAGSRNCIGE
107
VPSMVHKDPRYWDDPEIFNPERFITGELKHPYAYIPFSAGSRNCIGE
Sbjct: 60669 VPSMVHKDPRYWDDPEIFNPERFITGELKHPYAYIPFSAGSRNCIGE
60809
Score = 119 bits
(299), Expect = 9e-28
Identities = 60/61 (98%), Positives = 60/61 (98%)
Frame = +3
Query: 106
GERFAMMEEKCILAIILKNLKVKAKLRTDEMRVAAELIIRPLYGNELKFEKREFGDYTSI 165
G RFAMMEEKCILAIILKNLKVKAKLRTDEMRVAAELIIRPLYGNELKFEKREFGDYTSI
Sbjct: 61968 GMRFAMMEEKCILAIILKNLKVKAKLRTDEMRVAAELIIRPLYGNELKFEKREFGDYTSI
62147
Query: 166 Y 166
Y
Sbjct: 62148 Y 62150
>gi|30733169|gb|CB391459.1|CB391459 UniGene info OSTF152C3_1 AD-wrmcDNA
Caenorhabditis elegans cDNA.
Length = 153
Score = 99.4 bits (246), Expect = 4e-22
Identities = 50/50 (100%), Positives = 50/50 (100%)
Frame = +3
cDNAs for the end of the
protein
Query: 117
ILAIILKNLKVKAKLRTDEMRVAAELIIRPLYGNELKFEKREFGDYTSIY 166
ILAIILKNLKVKAKLRTDEMRVAAELIIRPLYGNELKFEKREFGDYTSIY
Sbjct: 3
ILAIILKNLKVKAKLRTDEMRVAAELIIRPLYGNELKFEKREFGDYTSIY 152
>gi|30733181|gb|CB391471.1|CB391471 UniGene info OSTF152C3_2 AD-wrmcDNA
Caenorhabditis elegans cDNA.
Length = 142
Score = 89.7 bits (221), Expect = 3e-19
Identities = 45/46 (97%), Positives = 46/46 (100%)
Frame = +3
Query: 108 RFAMMEEKCILAIILKNLKVKAKLRTDEMRVAAELIIRPLYGNELK
153
RFAMM+EKCILAIILKNLKVKAKLRTDEMRVAAELIIRPLYGNELK
Sbjct: 3
RFAMMDEKCILAIILKNLKVKAKLRTDEMRVAAELIIRPLYGNELK 140
>CYP42A1 CM08B12
M89401, AL020988 Y80D3 contig 01341 and 00427 504 aa
MGIITASLIVLTITWIIHFAFRKAKFIYNKLTVFQGPAALPLIGNFHQFHFSPEEFFEQ
SQGIAYMMRKGDERITRVWLGGLPFVLLYGAHEVEAILGSPKMLNKPFLYGFLSA
WIGDGLLISRKPDKWRPRRKLLTPTFHYDILKDFVEVYNRHGRTLLSKFEAQAGT
GEYSDVFHTITLCTLDVICEAALGTSINAQKDPHSPYLDAVFKMKDIVFQRLLRPHY
FSDTIFNLIGPGKEHDECVKVIILHEFTSKAIYARKQLLAQETAEGRRRMAFLDLML
DMNSKGELPMEGICEEVDTFTFEGHDTTSAAMNWFLHLMGANPEIQSKVQKEIDE
VLGEADRPVSYEDLGKLKYLEACFKETLRLYPSVPLIARQCVEDIQVRGHTLPSGTA
VVMVPSMVHKDPRYWDDPEIFNPERFITGELKHPYAYIPFSAGSRNCIG
(1)
RFAMMEEKCILAIILKNLKVKAKLRTDEMRVAAELIIRPLYGNELKFEKREFGDYTSIY*
>gi|22417578|emb|CAAC01000109.1| Download subject sequence spanning the
HSP Caenorhabditis
briggsae contig cb25.fpc4175 from assembly cb25.agp8,
whole genome shotgun sequence
Length =
116228
Score = 223 bits
(567), Expect = 5e-57
Identities = 103/107 (96%), Positives = 107/107 (100%)
Frame = +2
Query: 1
VLGEADRPVSYEDLGKLKYLEACFKETLRLYPSVPLIARQCVEDIQVRGHTLPSGTAVVM 60
VLGEADRP+SYEDLGKLKYLEACFKETLRLYPSVPLIARQCVEDIQ+RGHTLPSGTAVVM
Sbjct: 66992
VLGEADRPISYEDLGKLKYLEACFKETLRLYPSVPLIARQCVEDIQIRGHTLPSGTAVVM 67171
Query: 61
VPSMVHKDPRYWDDPEIFNPERFITGELKHPYAYIPFSAGSRNCIGE 107
VPSMVHKDPRYW+DPEIFNPERFI+GELKHPYAYIPFSAGSRNCIGE
Sbjct: 67172
VPSMVHKDPRYWEDPEIFNPERFISGELKHPYAYIPFSAGSRNCIGE 67312
Score = 110 bits
(275), Expect = 4e-23
Identities = 55/66 (83%), Positives = 60/66 (90%)
Frame = +2
Query: 97
FSAGSRNCIGERFAMMEEKCILAIILKNLKVKAKLRTDEMRVAAELIIRPLYGNELKFEK 156
FS + N G
RFAMMEEKCILAI+LKNLKVKAKLRTD+MRVAAELIIRPL+GNELKFE+
Sbjct: 68207
FSPKTSNFPGMRFAMMEEKCILAILLKNLKVKAKLRTDQMRVAAELIIRPLFGNELKFER 68386
Query: 157 REFGDY 162
REFGDY
Sbjct: 68387 REFGDY 68404
CYP25A4
C36A4.6; CE03074. use this seq
MAILIISTIFFTIITFISYSIWRRHAIFKLRSSIGIPGPPVHWLWGNLNI 50
IKDRVSRLGYNDTTQWHPTLHTKYGPIFGLYCGTQLHITVSEEEDIKEIF
100
IQNFSNFSDRMTPDIFGMNQLNQSLLQNTYATGWKHTRSAIAPIFSTGKM
150
KAMHETLVSKIDIFLEVLKEKSSSGQKWDIFENFQSLSLDIIGKCAFAID
200
SNCQRDRTDLFYVQARKFVGAVDLKKSWILPVSLILPELSWLWRFLYKFS
250
DLSAAELPLVKGLVDLYDRRRAGEGGNDSTDLLNLLIRRETIGKMTQREV
300
IENCFAFLIAGYETTSTAMMFSAYLLAEYPIVQQKLYEEIKKTKENAGLN
350
YDSIHNMKYLDCVYKESLRFYPPTTHFTNRVCLNDMTIRGQIYPEDSTLK
400
VQPYTIHRNPANWESPDEFQPERFENWEEKSSSLKWIPFGVGPRYCVGMR
450
FAEMEFKTTIAKLLDTFELSLVPGDPPMIPETNGVIFRPRSPVRLNLKLR
500
I
>CYP25A4 C36A4.6 Z66495 495 aa
Length = 496
Score = 2587 (910.7 bits), Expect = 1.9e-272, P = 1.9e-272
Identities = 493/501 (98%), Positives = 493/501 (98%)
Query: 1
MAILIISTIFFTIITFISYSIWRRHAIFKLRSSIGIPGPPVHWLWGNLNIIKDRVSRLGY 60
MAILIISTIFFTIITFISYSIWRRHAIFKLRSSIGIPGPPVHWLWGNLNIIKDRV
Sbjct: 1
MAILIISTIFFTIITFISYSIWRRHAIFKLRSSIGIPGPPVHWLWGNLNIIKDRV----- 55
Query: 61 NDTTQWHPTLHTKYGPIFGLYCGTQLHITVSEEEDIKEIFIQNFSNFSDRMTPDIFGMNQ
120
DTTQWHPTLHTKYGPIFGLYCGTQLHITVSEEEDIKEIFIQNFSNFSDRMTPDIFGMNQ
Sbjct: 56 -DTTQWHPTLHTKYGPIFGLYCGTQLHITVSEEEDIKEIFIQNFSNFSDRMTPDIFGMNQ
114
Query: 121
LNQSLLQNTYATGWKHTRSAIAPIFSTGKMKAMHETLVSKIDIFLEVLKEKSSSGQKWDI 180
LNQSLLQNTYATGWKHTRSAIAPIFSTGKMKAMHETLVSKIDIFLEVLKEKSSSGQKWDI
Sbjct: 115 LNQSLLQNTYATGWKHTRSAIAPIFSTGKMKAMHETLVSKIDIFLEVLKEKSSSGQKWDI
174
Query: 181
FENFQSLSLDIIGKCAFAIDSNCQRDRTDLFYVQARKFVGAVDLKKSWILPVSLILPELS 240
FENFQSLSLDIIGKCAFAIDSNCQRDRTDLFYVQARKFVGAVDLKKSWILPVSLILPELS
Sbjct: 175
FENFQSLSLDIIGKCAFAIDSNCQRDRTDLFYVQARKFVGAVDLKKSWILPVSLILPELS 234
Query: 241
WLWRFLYKFSDLSAAELPLVKGLVDLYDRRRAGEGGNDSTDLLNLLIRRETIGKMTQREV 300
WLWRFLYKFSDLSAAELPLVKGLVDLYDRRRAGEGGNDSTDLLNLLIRRETIGKMTQREV
Sbjct: 235
WLWRFLYKFSDLSAAELPLVKGLVDLYDRRRAGEGGNDSTDLLNLLIRRETIGKMTQREV 294
Query: 301
IENCFAFLIAGYETTSTAMMFSAYLLAEYPIVQQKLYEEIKKTKENAGLNYDSIHNMKYL 360
IENCFAFLIAGYETTSTAMMFSAYLLAEYPIVQQKLYEEIKKTKENAGL DSIHNMKYL
Sbjct: 295
IENCFAFLIAGYETTSTAMMFSAYLLAEYPIVQQKLYEEIKKTKENAGLYNDSIHNMKYL 354
Query: 361 DCVYKESLRFYPPTTHFTNRVCLNDMTIRGQIYPEDSTLKVQPYTIHRNPANWESPDEFQ
420
DCVYKESLRFYPPTTHFTNRVCLNDMTIRGQIYPEDSTLKVQPYTIHRNPANWESPDEFQ
Sbjct: 355
DCVYKESLRFYPPTTHFTNRVCLNDMTIRGQIYPEDSTLKVQPYTIHRNPANWESPDEFQ 414
Query: 421
PERFENWEEKSSSLKWIPFGVGPRYCVGMRFAEMEFKTTIAKLLDTFELSLVPGDPPMIP 480
PERFENWEEKSSSLKWIPFGVGPRYCVGMRFAEMEFKTTIAKLLDTFELSLVPGDPPMIP
Sbjct: 415
PERFENWEEKSSSLKWIPFGVGPRYCVGMRFAEMEFKTTIAKLLDTFELSLVPGDPPMIP 474
Query: 481 ETNGVIFRPRSPVRLNLKLRI 501
ETNGVIFRPRSPVRLNLKLRI
Sbjct: 475 ETNGVIFRPRSPVRLNLKLRI 495
CYP25A3
added 7 aa to the seq
>CYP25A3 C36A4.3 Z66495 496 aa
MAFLILTSILVSLVSFIIYVILARKERFRLRGKIGLSGPEPHWLMGNLKQIIER
KAKLGYDDSYDWYNKLHKQFGETFGIYFGTQLNINITNEEDIKEVFIKNFSNFSDRTPPPI
IEDNKLKESLLQNTYESGWKHTRSAIAPIFSTGKMKAMHETIHSKVDLFLEIL
KEKASSGQKWDIYDDFQGLTLDVIGKCAFAIDSNCQRDRNDVFYVNARKFITN
IDIRHSKIISTSFLFPELSKLWKVLYRFTDLAKAEIPLVEGLADVYERRRGGEG
SDSVDLLKLLLNREDDKSKPMTKQEVIENCFAFLLAGYETTSTAMTYCSYLLS
KYPNVQQKLYEEIMEAKENGGLTYDSIHNMKYLDYVYKETLRCYPPVIHFSNR
RCLKDITIRGQFYPKGAIVVCLPHTVHRNPENWDSPEEFHPERFENWEEKSSSLK
WIPFGVGPRYCVGMRFAEMEFKTTIAKLLDTFELKQFEGEADLIPDCNGVIMRPKD
PVRLHLKPRN*
CYP34A3
C41G6.1; CE15699.
MIFVTLVTTILIYFSCKLWRGRKKNPNGPLPLPLIGNLHQLVYNSWKTGG 50
IVAGFQVFKKQYGKFFTLWFGPIPIVFIADYDIAYETHVKKANIFGHRFT
100
TGVMNYIREGRGIIGSNGAFWQEHRRFALTTLRNFGLGKNIMEDRIMDEY
150
RYR LLKNSR incorrect seq here
KNGVIEVNAATMFDLLVGSIINRMLVSKRFEQGNLEFETMK
200
VYLTKALEEVSIFEAFLPVWLLKSNLLRWRTKYTLAPVEYIYSLVQKEIQ
250
ERTTSIENGSHVPSEDGDDFVDAFLIKIEKDKKEGIDSTFTLESLAIDLF
300
DLWLAGQDTSSTTLLWAGICLLNHPEVVEKLRSELLEVTGGIRRLSLTDR
350
ARTPYLMAVLNEIQRIASILNINIFRELREDTEIDGQPIAAGTVVANQLS
400
MIHTDEELFEDHTRFDPERFIENSTLEKKLIPFNLGKRSCPGESLARAEL
450
YLIIGNLVLDFDFEAVGIKPEIKTSTPFGIMKRPPNYISD 490
>CYP34A3 C41G6.1 Z81047 492 aa also
T13C10 Z81591
Length = 493
Score = 2526 (889.2 bits), Expect = 5.7e-266, P = 5.7e-266
Identities = 490/492 (99%), Positives = 490/492 (99%)
Query: 1
MIFVTLVTTILIYFSCKLWRGRKKNPNGPLPLPLIGNLHQLVYNSWKTGGIVAGFQVFKK 60
MIFVTLVTTILIYFSCKLWRGRKKNPNGPLPLPLIGNLHQLVYNSWKTGGIVAGFQVFKK
Sbjct: 1
MIFVTLVTTILIYFSCKLWRGRKKNPNGPLPLPLIGNLHQLVYNSWKTGGIVAGFQVFKK 60
Query: 61
QYGKFFTLWFGPIPIVFIADYDIAYETHVKKANIFGHRFTTGVMNYIREGRGIIGSNGAF 120
QYGKFFTLWFGPIPIVFIADYDIAYETHVKKANIFGHRFTTGVMNYIREGRGIIGSNGAF
Sbjct: 61
QYGKFFTLWFGPIPIVFIADYDIAYETHVKKANIFGHRFTTGVMNYIREGRGIIGSNGAF 120
Query: 121
WQEHRRFALTTLRNFGLGKNIMEDRIMDEYRY--RLLKNSRKNGVIEVNAATMFDLLVGS 178
WQEHRRFALTTLRNFGLGKNIMEDRIMDEYRY RLLKNSRKNGVIEVNAATMFDLLVGS
Sbjct: 121 WQEHRRFALTTLRNFGLGKNIMEDRIMDEYRYNPRLLKNSRKNGVIEVNAATMFDLLVGS
180
Query: 179
IINRMLVSKRFEQGNLEFETMKVYLTKALEEVSIFEAFLPVWLLKSNLLRWRTKYTLAPV 238
IINRMLVSKRFEQGNLEFETMKVYLTKALEEVSIFEAFLPVWLLKSNLLRWRTKYTLAPV
Sbjct: 181
IINRMLVSKRFEQGNLEFETMKVYLTKALEEVSIFEAFLPVWLLKSNLLRWRTKYTLAPV 240
Query: 239
EYIYSLVQKEIQERTTSIENGSHVPSEDGDDFVDAFLIKIEKDKKEGIDSTFTLESLAID 298
EYIYSLVQKEIQERTTSIENGSHVPSEDGDDFVDAFLIKIEKDKKEGIDSTFTLESLAID
Sbjct: 241
EYIYSLVQKEIQERTTSIENGSHVPSEDGDDFVDAFLIKIEKDKKEGIDSTFTLESLAID 300
Query: 299
LFDLWLAGQDTSSTTLLWAGICLLNHPEVVEKLRSELLEVTGGIRRLSLTDRARTPYLMA 358
LFDLWLAGQDTSSTTLLWAGICLLNHPEVVEKLRSELLEVTGGIRRLSLTDRARTPYLMA
Sbjct: 301
LFDLWLAGQDTSSTTLLWAGICLLNHPEVVEKLRSELLEVTGGIRRLSLTDRARTPYLMA 360
Query: 359 VLNEIQRIASILNINIFRELREDTEIDGQPIAAGTVVANQLSMIHTDEELFEDHTRFDPE
418
VLNEIQRIASILNINIFRELREDTEIDGQPIAAGTVVANQLSMIHTDEELFEDHTRFDPE
Sbjct: 361
VLNEIQRIASILNINIFRELREDTEIDGQPIAAGTVVANQLSMIHTDEELFEDHTRFDPE 420
Query: 419
RFIENSTLEKKLIPFNLGKRSCPGESLARAELYLIIGNLVLDFDFEAVGIKPEIKTSTPF 478
RFIENSTLEKKLIPFNLGKRSCPGESLARAELYLIIGNLVLDFDFEAVGIKPEIKTSTPF
Sbjct: 421
RFIENSTLEKKLIPFNLGKRSCPGESLARAELYLIIGNLVLDFDFEAVGIKPEIKTSTPF 480
Query: 479 GIMKRPPNYISD 490
GIMKRPPNYISD
Sbjct: 481 GIMKRPPNYISD 492
>CYP34A3 C41G6.1 Z81047 494 aa also T13C10 Z81591
This gene has a frameshift IQDFSKTHG revised 3/20/04
MIFVTLVTTILIYFSCKLWRGRKKNPNGPLPLPLIGNLHQLVYNSWKTGGIV
AGFQVFKKQYGKFFTLWFGPIPIVFIADYDIAYETHVKKANIFGHRFTTGVM
NYIREGRGIIGSNGAFWQEHRRFALTTLRNFGLGKNIMEDRIMDEYRYR (2)
IQDFSKTHG
(frameshift)
KNGVIEVNAATMFDLLVGSIINRMLVSKRFEQGNLEFETMKVYLTKA
LEEVSIFEAFLPVWLLKSNLLRWRTKYTLAPVEYIYSLVQKEIQERTTSIENG
SHVPSEDGDDFVDAFLIKIEKDKKEGIDSTFTLESLAIDLFDLWLAGQDTSSTT
LLWAGICLLNHPEVVEKLRSELLEVTGGIRRLSLTDRARTPYLMAVLNEIQR
IASILNINIFRELREDTEIDGQPIAAGTVVANQLSMIHTDEELFEDHTRFDPER
FIENSTLEKKLIPFNLGKRSCPGESLARAELYLIIGNLVLDFDFEAVGIKPEIKT
STPFGIMKRPPNYISD*
compare to CYP34A5 B0213.10
>gi|17557253|ref|NM_071698.1| LocusLink infoUniGene info
Caenorhabditis elegans cytochrome p450 2c2 family member (5E430),
mRNA
Length =
1500
Score = 192 bits
(488), Expect = 4e-50
Identities = 91/105 (86%), Positives = 101/105 (96%)
Frame = +1
Query: 1
NYIREGRGIIGSNGAFWQEHRRFALTTLRNFGLGKNIMEDRIMDEYRYRIQDFSKTHGKN 60
+YIREGRGI+GSNG
FWQEHRRFALTTLRNFG+G+NIMED+IMDEYRYRIQDFSKTHGKN
Sbjct: 313
DYIREGRGIVGSNGDFWQEHRRFALTTLRNFGVGRNIMEDKIMDEYRYRIQDFSKTHGKN 492
Query: 61
GVIEVNAATMFDLLVGSIINRMLVSKRFEQGNLEFETMKVYLTKA 105
GVIEVNA TMFDLLVGSIINRMLVS+RFEQG+ +FE +K+YLTKA
Sbjct: 493
GVIEVNATTMFDLLVGSIINRMLVSERFEQGDQDFEKLKMYLTKA 627
>gi|6435485|emb|Z81047.2|CEC41G6 Download subject sequence spanning the
HSP Caenorhabditis elegans cosmid C41G6, complete sequence
Length =
31437
Score = 74.7
bits (182), Expect = 1e-14
Identities =
36/41 (87%), Positives = 37/41 (90%), Gaps = 1/41 (2%)
Frame = +3
Query: 1 WQEHRRFALTTLRNFGLGKNIMEDRIMDEYRYR-IQDFSKT
40
WQEHRRFALTTLRNFGLGKNIMEDRIMDEYRYR
+ F KT
Sbjct: 21507 WQEHRRFALTTLRNFGLGKNIMEDRIMDEYRYRCFKIFQKT
21629
Score = 28.5 bits (62), Expect(2) = 4e-07
Identities =
11/12 (91%), Positives = 12/12 (100%)
Frame = +1
Query: 32 YRIQDFSKTHGK 43
+RIQDFSKTHGK
Sbjct: 21844 FRIQDFSKTHGK 21879
Score = 40.8
bits (94), Expect(2) = 4e-07
Identities =
20/23 (86%), Positives = 20/23 (86%)
Frame = +3
Query: 39 KTHGKNGVIEVNAATMFDLLVGS 61
K KNGVIEVNAATMFDLLVGS
Sbjct: 21864 KNSRKNGVIEVNAATMFDLLVGS 21932
CYP33C1
>CYP33C1 C45H4.2 502 aa
Length = 503
Score = 2603 (916.3 bits), Expect = 3.9e-274, P = 3.9e-274
Identities = 496/502 (98%), Positives = 497/502 (99%)
Query: 1
MIIILLLTFLTIYFVYELYWKRRNFPPGPCPLPVFGNLLSIANPPPGYKAFERWTKKYGD 60
MIIILLLTFLTIYFVYELYWKRRNFPPGPCPLPVFGNLLSIANPPPGYKAFERWTKKYGD
Sbjct: 1
MIIILLLTFLTIYFVYELYWKRRNFPPGPCPLPVFGNLLSIANPPPGYKAFERWTKKYGD 60
Query: 61
VYTFWIGNTPHIMINTWDKIKETFIRDADTYTNKVVLPMVTLSRGGEYGIIDSNGAMWRE 120
VYTFWIGNTPHIMINTWDKIKETFIRDADTYTNKVVLPMVTLSRGGEYGIIDSNGAMWRE
Sbjct: 61
VYTFWIGNTPHIMINTWDKIKETFIRDADTYTNKVVLPMVTLSRGGEYGIIDSNGAMWRE 120
Query: 121
HRRFALSTMRDFGLGKNLMQENILMEVQDVFARLDAKLGSETDVPEVFDHAVANVVNQLL 180
HRRFALSTMRDFGLGKNLMQENILMEVQDVFARLDAKLGSETDVPEVFDHAVANVVNQLL
Sbjct: 121
HRRFALSTMRDFGLGKNLMQENILMEVQDVFARLDAKLGSETDVPEVFDHAVANVVNQLL 180
Query: 181
FGYRFMGPKENEYQELKHIIDSPAEIFGKLHIFLAMNIPIFAKLLPESLYEGPIKTFRDT 240
FGYRFMGPKENEYQELKHIIDSPAEIFGKLHIFLAMNIPIFAKLLPESLYEGPIKTFRDT
Sbjct: 181
FGYRFMGPKENEYQELKHIIDSPAEIFGKLHIFLAMNIPIFAKLLPESLYEGPIKTFRDT 240
Query: 241
TLAFFNKQIEAHRHRIDFEDLNSESTDFVETFLKEQKRRESE-----ANSKNLNFSNIQL 295
TLAFFNKQIEAHRHRIDFEDLNSESTDFVETFLKEQKRRESE
ANSKNLNFSNIQL
Sbjct: 241
TLAFFNKQIEAHRHRIDFEDLNSESTDFVETFLKEQKRRESEGDSETANSKNLNFSNIQL 300
Query: 296 LNVCIDLWFAGLNTTTNTITWAISYVLHHPEVQDKIHEELDKVIGSDRLITTADKNDLPY
355
LNVCIDLWFAGLNTTTNTITWAISYVLHHPEVQDKIHEELDKVIGSDRLITTADKNDLPY
Sbjct: 301
LNVCIDLWFAGLNTTTNTITWAISYVLHHPEVQDKIHEELDKVIGSDRLITTADKNDLPY 360
Query: 356
FNASINESQRGINILPLNLQHATTRDTVIDGFKIPKGTGVVAQISTVMNNEEVFPDPYTF 415
FNASINESQRGINILPLNLQHATTRDTVIDGFKIPKGTGVVAQISTVMNNEEVFPDPYTF
Sbjct: 361
FNASINESQRGINILPLNLQHATTRDTVIDGFKIPKGTGVVAQISTVMNNEEVFPDPYTF 420
Query: 416
NPDRFIDENGKLKKVDELAPFSVGKRSCPGEGLARMELFLFIANFLNRYKIHPSKEGLPS 475
NPDRFIDENGKLKKVDELAPFSVGKRSCPGEGLARMELFLFIANFLNRY+IHPSKEGLPS
Sbjct: 421
NPDRFIDENGKLKKVDELAPFSVGKRSCPGEGLARMELFLFIANFLNRYQIHPSKEGLPS 480
Query: 476 MAKGSGPVVAPRLFTAILTRRF 497
MAKGSGPVVAPRLFTAILTRRF
Sbjct: 481 MAKGSGPVVAPRLFTAILTRRF 502
compare to cDNA
>gi|30738516|gb|CB396805.1|CB396805 UniGene info OSTR179D10_1 AD-wrmcDNA
Caenorhabditis elegans cDNA.
Length = 592
Score = 85.5 bits (210), Expect = 4e-18
Identities = 43/53 (81%), Positives = 45/53 (84%)
Frame = -3
remove the ANSKNLN fragment Keep GDSET
Query: 1 NSESTDFVETFLKEQKRRESEGDSETANSKNLNFSNIQLLNVCIDLWFAGLNT
53
NSESTDFVETFLKEQKRRESEGDSET FSN+QL NVC+DLWFAGLNT
Sbjct: 320
NSESTDFVETFLKEQKRRESEGDSET-------FSNVQLSNVCMDLWFAGLNT 183
>CYP33C1 C45H4.2 502 aa revised 3/20/04 small deletion
MIIILLLTFLTIYFVYELYWKRRNFPPGPCPLPVFGNLLSIANPPPGYKAFER
WTKKYGDVYTFWIGNTPHIMINTWDKIKETFIRDADTYTNKVVLPMVTLSR
GGEYGIIDSNGAMWREHRRFALSTMRDFGLGKNLMQENILMEVQDVFARLD
AKLGSETDVPEVFDHAVANVVNQLLFGYRFMGPKENEYQELKHIIDSPAEIF
GKLHIFLAMNIPIFAKLLPESLYEGPIKTFRDTTLAFFNKQIEAHRHRIDFEDL
NSESTDFVETFLKEQKRRESEGDSETFSNIQLLNVCIDLWFAGLNT
TTNTITWAISYVLHHPEVQDKIHEELDKVIGSDRLITTADKNDLPYFNASINE
SQRGINILPLNLQHATTRDTVIDGFKIPKGTGVVAQISTVMNNEEVFPDPYTF
NPDRFIDENGKLKKVDELAPFSVGKRSCPGEGLARMELFLFIANFLNRYQ IHPS
KEGLPSMAKGSGPVVAPRLFTAILTRRF*
CYP33E1
>CYP33E1 C49C8.4 U61945 494 aa
Length = 495
Score = 2605 (917.0 bits), Expect = 2.4e-274, P = 2.4e-274
Identities = 493/494 (99%), Positives = 493/494 (99%)
Query: 1
MILLILTSILIIYLFNHFYWKRRKLPPGPIPLPIIGNLYLMTEDVKPGYKMYEKLKDKYG 60
MILLILTSILIIYLFNHFYWKRRKLPPGPIPLPIIGNLYLMTEDVKPGYKMYEKLKDKYG
Sbjct: 1
MILLILTSILIIYLFNHFYWKRRKLPPGPIPLPIIGNLYLMTEDVKPGYKMYEKLKDKYG 60
Query: 61
PVFTFWLANLPMVTVTDWKLIKQHFIKDGANFVGRPEFPISMEMRQGPYGIIESHGDRWI 120
PVFTFWLANLPMVTVTDWKLIKQHFIKDGANFVGRPEFPISMEMRQGPYGIIESHGDRWI
Sbjct: 61
PVFTFWLANLPMVTVTDWKLIKQHFIKDGANFVGRPEFPISMEMRQGPYGIIESHGDRWI 120
Query: 121
QQRRFALHILRDFGLGKNLMEEKVLGEVTAMIDSIRKSMEDVDMQNIFDASVGSVINNML 180
QQRRFALHILRDFGLGKNLMEEKVLGEVTAMIDSIRKSMEDVDMQNIFDASVGSVINNML
Sbjct: 121
QQRRFALHILRDFGLGKNLMEEKVLGEVTAMIDSIRKSMEDVDMQNIFDASVGSVINNML 180
Query: 181
FGYRYDETNIEEFLELKNRMNKHFKLAAEPMGGLIGMNPWLGHLPFFKGYKNVIMHNWMG 240
FGYRYDETNIEEFLELKNRMNKHFKLAAEPMGGLIGMNPWLGHLPFFKGYKNVIMHNWMG
Sbjct: 181
FGYRYDETNIEEFLELKNRMNKHFKLAAEPMGGLIGMNPWLGHLPFFKGYKNVIMHNWMG 240
Query: 241
LMEMFRKQATDRLASIDYDSDEYSDYVEAFLKERKKHENEQDFGGFEMEQLDSVCFDLWV 300
LMEMFRKQATDRLASIDYDSDEYSDYVEAFLKERKKHENEQDFGGF MEQLDSVCFDLWV
Sbjct: 241
LMEMFRKQATDRLASIDYDSDEYSDYVEAFLKERKKHENEQDFGGFRMEQLDSVCFDLWV 300
Query: 301
AGMETTSNTLNWALLYVLLNPEVRQKVYEELEREIGSDRIITTTDRPKLNYINATVNESQ 360
AGMETTSNTLNWALLYVLLNPEVRQKVYEELEREIGSDRIITTTDRPKLNYINATVNESQ
Sbjct: 301
AGMETTSNTLNWALLYVLLNPEVRQKVYEELEREIGSDRIITTTDRPKLNYINATVNESQ 360
Query: 361
RLANLLPMNLSRSTNADVEIAGYRIPKDTVITPQISSVMYDPEIFPEPYEFKPERFLESD 420
RLANLLPMNLSRSTNADVEIAGYRIPKDTVITPQISSVMYDPEIFPEPYEFKPERFLESD
Sbjct: 361 RLANLLPMNLSRSTNADVEIAGYRIPKDTVITPQISSVMYDPEIFPEPYEFKPERFLESD
420
Query: 421
GSLKKVEELVPFSIGKRQCLGEGLAKMELFLYFANLFNKFDIKFHESNPNPSIKKEVGVT 480
GSLKKVEELVPFSIGKRQCLGEGLAKMELFLYFANLFNKFDIKFHESNPNPSIKKEVGVT
Sbjct: 421
GSLKKVEELVPFSIGKRQCLGEGLAKMELFLYFANLFNKFDIKFHESNPNPSIKKEVGVT 480
Query: 481 MKAKNYRVSMKERY 494
MKAKNYRVSMKERY
Sbjct: 481 MKAKNYRVSMKERY 494
E is created at intron
joint
CYP35A4
>CYP35A4 C49G7 494 aa
Revised 3/19/04
Length = 494
Score = 2535 (892.4 bits), Expect = 6.3e-267, P = 6.3e-267
Identities = 492/494 (99%), Positives = 492/494 (99%)
Query: 1
MFVILFISAILSWLIVRQYQKVSRLPPGPVSLPLIGNLPQIFYYLWTTGSIVSTLDLFRK 60
MFVILFISAILSWLIVRQYQKVSRLPPGPVSLPLIGNLPQIFYYLWTTGSIVSTLDLFRK
Sbjct: 1 MFVILFISAILSWLIVRQYQKVSRLPPGPVSLPLIGNLPQIFYYLWTTGSIVSTLDLFRK
60
Query: 61
RYGNIFTLWVGPLPHVSIADYEISHEVFVKNGSKYADKFHAPVMQDIRKDMGIMVTNGDH 120
RYGNIFTLWVGPLPHVSIADYEISHEVFVKNGSKYADKFHAPVMQDIR MGIMVTNGDH
Sbjct: 61
RYGNIFTLWVGPLPHVSIADYEISHEVFVKNGSKYADKFHAPVMQDIRSIMGIMVTNGDH 120
Query: 121
WQHMRRFSLQTFRNMGVGKDIMETKIMQELDARCAEIDTSAMNGVTVTQASEFFELTVGS 180
WQHMRRFSLQTFRNMGVGKDIMETKIMQELDARCAEIDTSAMNGVTVTQASEFFELTVGS
Sbjct: 121
WQHMRRFSLQTFRNMGVGKDIMETKIMQELDARCAEIDTSAMNGVTVTQASEFFELTVGS 180
Query: 181
IINSILVGKRFDATTKHEFLKIIETMDASIETASFFDMMVPVWILKTFFKHRYDNLSDAF 240
IINSILVGKRFDATTKHEFLKIIETMDASIETASFFDMMVPVWILKTFFKHRYDNLSDAF
Sbjct: 181
IINSILVGKRFDATTKHEFLKIIETMDASIETASFFDMMVPVWILKTFFKHRYDNLSDAF 240
Query: 241 EVSKAFSAAEAIKRVDQIKSGRYFIDENNLQDYTDAFLLKIEKEGQCQDFNMETLKTMIG
300
EVSKAFSAAEAIKRVDQIKSGRYFIDENNLQDYTDAFLLKIEKEGQCQDFNMETLKTMIG
Sbjct: 241
EVSKAFSAAEAIKRVDQIKSGRYFIDENNLQDYTDAFLLKIEKEGQCQDFNMETLKTMIG 300
Query: 301
DLWITGQETTTTTLISGFNQLLLHPEVMVKAREELMKVTENGSRSLSLTDRASTPYLNAM 360
DLWITGQETTTTTLISGFNQLLLHPEVMVKAREELMKVTENGSRSLSLTDRASTPYLNAM
Sbjct: 301
DLWITGQETTTTTLISGFNQLLLHPEVMVKAREELMKVTENGSRSLSLTDRASTPYLNAM 360
Query: 361
IGEIQRHASILNVNFWKINKEFTYMGGHPVDAGALVTAQLGTLHVNETVFENPLKFDPER 420
IGEIQRHASILNVNFWKINKEFTYMGGHPVDAGALVTAQLGTLHVNETVFENPLKFDPER
Sbjct: 361
IGEIQRHASILNVNFWKINKEFTYMGGHPVDAGALVTAQLGTLHVNETVFENPLKFDPER 420
Query: 421
YIRDENLLQKVIPFGVGKRSCLGESLARAELYLIFGNLLLRYKFESSSKLSTTELAPYSI 480
YIRDENLLQKVIPFGVGKRSCLGESLARAELYLIFGNLLLRYKFESSSKLSTTELAPYSI
Sbjct: 421
YIRDENLLQKVIPFGVGKRSCLGESLARAELYLIFGNLLLRYKFESSSKLSTTELAPYSI 480
Query: 481 GKRPFKLEMKFVKI 494
GKRPFKLEMKFVKI
Sbjct: 481 GKRPFKLEMKFVKI 494
KD forms at intron joint
CYP33C9
>CYP33C9 C50H11 495 aa
Length = 497
Score = 2609 (918.4 bits), Expect = 9.1e-275, P = 9.1e-275
Identities = 492/496 (99%), Positives = 493/496 (99%)
Query: 1
MVLNIILVVVVAFLFHHLYWKRRNWPAGPTPLPLIGNLLSLRNPAPGYKAFARWTAKYGD
60
MVLNIILVVVVAFLFHHLYWKRRN+ GPTPLPLIGNLLSLRNPAPGYKAFARWTAKYGD
Sbjct: 1
MVLNIILVVVVAFLFHHLYWKRRNFFPGPTPLPLIGNLLSLRNPAPGYKAFARWTAKYGD
60
Query: 61
IYTFWLGTRPYILVSSYEALKETFIKDGETYADKKPMAFQESFRGGSYGVVETNGPFWRE 120
IYTFWLGTRPYILVSSYEALKETFIKDGETYADKKPMAFQESFRGGSYGVVETNGPFWRE
Sbjct: 61
IYTFWLGTRPYILVSSYEALKETFIKDGETYADKKPMAFQESFRGGSYGVVETNGPFWRE 120
Query: 121
HRRFAIHQFRDFGLGKDRMEQRIMLEVEDIFNNCDKTIGEGVDLTDIFDRAVGNVINQML 180
HRRFAIHQFRDFGLGKDRMEQRIMLEVEDIFNNCDKTIGEGVDLTDIFDRAVGNVINQML
Sbjct: 121
HRRFAIHQFRDFGLGKDRMEQRIMLEVEDIFNNCDKTIGEGVDLTDIFDRAVGNVINQML 180
Query: 181
FGYRFDETRADEFRTIRAFFNFNSGEFASFSMRVQFFLPWMGYIMPGPTILDRFKKYQKG 240
FGYRFDETRADEFRTIRAFFNFNSGEFASFSMRVQFFLPWMGYIMPGPTILDRFKKYQKG
Sbjct: 181 FGYRFDETRADEFRTIRAFFNFNSGEFASFSMRVQFFLPWMGYIMPGPTILDRFKKYQKG
240
Query: 241
FTEFFGTQIENHKKEIDFELEENSDYVEAFLKEQRKREASGDFESFSTKQLSNMCLDLWF 300
FTEFFGTQIENHKKEIDFELEENSDYVEAFLKEQRKREASGDFES STKQLSNMCLDLWF
Sbjct: 241
FTEFFGTQIENHKKEIDFELEENSDYVEAFLKEQRKREASGDFESSSTKQLSNMCLDLWF 300
Query: 301
AALMTTSNTMTWCFAYTLNYLDAQQKLHEELDRVIGSERHINTADKPNLPYTNAYINEIQ 360
AALMTTSNTMTWCFAYTLNYLDAQQKLHEELDRVIGSERHINTADKPNLPYTNAYINEIQ
Sbjct: 301
AALMTTSNTMTWCFAYTLNYLDAQQKLHEELDRVIGSERHINTADKPNLPYTNAYINEIQ 360
Query: 361
RTANLVPLNLLHMTTRDTVLKGYNIPKGTGVVAQISTVMYDENVFPEPYIFKPERFLDDD 420
RTANLVPLNLLHMTTRDTVLKGYNIPKGTGVVAQISTVMYDENVFPEPYIFKPERFLDDD
Sbjct: 361
RTANLVPLNLLHMTTRDTVLKGYNIPKGTGVVAQISTVMYDENVFPEPYIFKPERFLDDD 420
Query: 421 GKLKKVEQLVPFSVGKRQCLGEGLARMELFLFIANFFNRYRVVPDANGPPIIDKAVLGGM
480
GKLKKVEQLVPFSVGKRQCLGEGLARMELFLFIANFFNRYRVVPDANGPPIIDKAVLGGM
Sbjct: 421
GKLKKVEQLVPFSVGKRQCLGEGLARMELFLFIANFFNRYRVVPDANGPPIIDKAVLGGM 480
Query: 481 HTKEFKAILQRRHVSE 496
HTKEFKAILQRRHVSE
Sbjct: 481 HTKEFKAILQRRHVSE 496
correct seq at two exon
boundaries
CYP43A1
>CYP43A1 E03E2.1 485 aa
Length = 485
Score = 2143 (754.4 bits), Expect = 9.1e-247, Sum P(2) =
9.1e-247
Identities = 406/406 (100%), Positives = 406/406 (100%)
Query: 101
PEILSDDPITSDNIHMFAAKGERWKRLRTLTSYGLSTVKLKLLFPTMDTCVSEFMDHVNS 160
PEILSDDPITSDNIHMFAAKGERWKRLRTLTSYGLSTVKLKLLFPTMDTCVSEFMDHVNS
Sbjct: 80
PEILSDDPITSDNIHMFAAKGERWKRLRTLTSYGLSTVKLKLLFPTMDTCVSEFMDHVNS 139
Query: 161
LSDGQSVVINHSHSLFQNHTSYVLARKAFGNFAELQKSTAEKIAYIFPETKLIFKNSFVG 220
LSDGQSVVINHSHSLFQNHTSYVLARKAFGNFAELQKSTAEKIAYIFPETKLIFKNSFVG
Sbjct: 140
LSDGQSVVINHSHSLFQNHTSYVLARKAFGNFAELQKSTAEKIAYIFPETKLIFKNSFVG 199
Query: 221 HFLKSATQQKFLDYLLHLISNFQSRKNVDNNNGICCTENDHYSLLGFFFEHHNEKKLIEK
280
HFLKSATQQKFLDYLLHLISNFQSRKNVDNNNGICCTENDHYSLLGFFFEHHNEKKLIEK
Sbjct: 200
HFLKSATQQKFLDYLLHLISNFQSRKNVDNNNGICCTENDHYSLLGFFFEHHNEKKLIEK 259
Query: 281
AEGQIDMKKVKVEKSISYEEITAQCKFISVAGFDTTSNTLTLLFNFLANNPDVQDKIYEA 340
AEGQIDMKKVKVEKSISYEEITAQCKFISVAGFDTTSNTLTLLFNFLANNPDVQDKIYEA
Sbjct: 260
AEGQIDMKKVKVEKSISYEEITAQCKFISVAGFDTTSNTLTLLFNFLANNPDVQDKIYEA 319
Query: 341
EIKNQPEQISFETVSSLRLLQNCIFETLRLFPHASPLQTRICTEPFKIGKYQFLENVQIV 400
EIKNQPEQISFETVSSLRLLQNCIFETLRLFPHASPLQTRICTEPFKIGKYQFLENVQIV
Sbjct: 320
EIKNQPEQISFETVSSLRLLQNCIFETLRLFPHASPLQTRICTEPFKIGKYQFLENVQIV 379
Query: 401
VNPWGPHHDREIWGNDVDCFRPSRFENLTEQQRKAFMPFGVGPRQCVGMRFALLEMKTTA 460
VNPWGPHHDREIWGNDVDCFRPSRFENLTEQQRKAFMPFGVGPRQCVGMRFALLEMKTTA
Sbjct: 380
VNPWGPHHDREIWGNDVDCFRPSRFENLTEQQRKAFMPFGVGPRQCVGMRFALLEMKTTA 439
Query: 461
FRMLQKYSVFTNSPVHDRHGKTVRMTVRDTGTIWPTDKLGLVLKQR 506
FRMLQKYSVFTNSPVHDRHGKTVRMTVRDTGTIWPTDKLGLVLKQR
Sbjct: 440
FRMLQKYSVFTNSPVHDRHGKTVRMTVRDTGTIWPTDKLGLVLKQR 485
Score = 225 (79.2 bits), Expect = 9.1e-247, Sum P(2) =
9.1e-247
Identities = 43/43 (100%), Positives = 43/43 (100%)
Query: 58
MCSMIGDQTFSVLRGATPVVITSNVDLIHAISTEHFDCFHSRI 100
MCSMIGDQTFSVLRGATPVVITSNVDLIHAISTEHFDCFHSRI
Sbjct: 1
MCSMIGDQTFSVLRGATPVVITSNVDLIHAISTEHFDCFHSRI 43
>cb25.fpc3857
ACC=CAAC01000068 CYP43
Length = 474
Score = 1243 (437.6 bits), Expect = 3.8e-193, Sum P(2) =
3.8e-193
Identities = 243/363 (66%), Positives = 290/363 (79%)
Query: 152
SEFMDHVNSLSDGQ-SVVIN---HSHSLFQNH--TSYV-LARKAFGNFAELQKSTAEKIA 204
S +DH +SL S V+ + H QNH +++ +
AFG ++ QKS EKI
Sbjct: 109
SVVIDHSHSLFQNHTSYVLARCAYGHKE-QNHRVNNFLSVFSDAFGALSDFQKSMTEKIT 167
Query: 205
YIFPETKLIFKNSFVGHFLKSATQQKFLDYLLHLISNFQSRKNVDNNNGICCTEND--HY 262
Y FPE K IFKN+
FL+S QQKFLDYLL+LIS FQSR+ ++NNN ++ + Y
Sbjct: 168
YFFPEMKSIFKNNLFAQFLQSTNQQKFLDYLLNLISKFQSRRVIENNNNEESSDPEAGKY 227
Query: 263 SLLGFFFEHHNEKKLIEKAEGQIDMKKVKVEKSISYEEITAQCKFISVAGFDTTSNTLTL
322
SLL FFFEHH+EKK++EKAEG+IDMKKVKVEKSISY+EITAQCKFISVAGFDTT+NTLTL
Sbjct: 228
SLLEFFFEHHHEKKIVEKAEGRIDMKKVKVEKSISYQEITAQCKFISVAGFDTTANTLTL 287
Query: 323
LFNFLANNPDVQDKIYEAEIKNQPEQISFETVSSLRLLQNCIFETLRLFPHASPLQTRIC 382
LFNFLA+NP +Q++IY +EIK ++FETV SL +LQNCIFETLRLFPHASPLQ RIC
Sbjct: 288
LFNFLAHNPAIQEQIYNSEIKGNKSSMNFETVCSLPILQNCIFETLRLFPHASPLQMRIC 347
Query: 383
TEPFKIGKYQFLENVQIVVNPWGPHHDREIWGNDVDCFRPSRFENLTEQQRKAFMPFGVG 442
T P IG+Y+F EN+Q+V+NPWGPH DR
IWG+DV+CF+PSRFE+L+EQQRKAFMPFGVG
Sbjct: 348
TAPITIGQYKFDENMQVVINPWGPHRDRVIWGDDVNCFKPSRFESLSEQQRKAFMPFGVG 407
Query: 443
PRQCVGMRFALLEMKTTAFRMLQKYSVFTNSPVHDRHGKTVRMTVRDTGTIWPTDKLGLV 502
PRQCVGMRFALLE+KTTAFRMLQKY V +
PV DRHGK V MTVRDTGT+WPTDKLGLV
Sbjct: 408
PRQCVGMRFALLELKTTAFRMLQKYVVRSTQPVIDRHGKLVNMTVRDTGTVWPTDKLGLV 467
Query: 503 LKQR 506
L +R
Sbjct: 468 LTKR 471
Score = 618 (217.5 bits), Expect = 3.8e-193, Sum P(2) =
3.8e-193
Identities = 113/134 (84%), Positives = 131/134 (97%)
Query: 58
MCSMIGDQTFSVLRGATPVVITSNVDLIHAISTEHFDCFHSRIPEILSDDPITSDNIHMF 117
MCS++GD+TFS+LRGATPV++T++V+LIHAISTE FDCFHSRIPE LSDDPIT++NIHMF
Sbjct: 1
MCSVVGDKTFSMLRGATPVIVTNDVNLIHAISTESFDCFHSRIPEALSDDPITAENIHMF 60
Query: 118
AAKGERWKRLRTLTSYGLSTVKLKLLFPTMDTCVSEFMDHVNSLSDGQSVVINHSHSLFQ 177
AAKGERWKRLRT+TSYGLSTVKLKLLFPT++TCVSEF+DHVNSLSDGQSVVI+HSHSLFQ
Sbjct: 61
AAKGERWKRLRTITSYGLSTVKLKLLFPTIETCVSEFLDHVNSLSDGQSVVIDHSHSLFQ 120
Query: 178 NHTSYVLARKAFGN 191
NHTSYVLAR A+G+
Sbjct: 121 NHTSYVLARCAYGH 134
>CYP43A1 E03E2.1 485 aa green missed in CE29747
MCSMIGDQTFSVLRGATPVVITSNVDLIHAISTEHFDCFHSRI
CICCSILIVHQGSKM
KNYCNTAKILKLCIFYNNYLQ PEILSDDPITSDNIHMFAAKGERWKRLRTLTSYGL
STVKLKLLFPTMDTCVSEFMDHVNSLSDGQSVVINHSHSLFQNHTSYVLARKAFG
NFAELQKSTAEKIAYIFPETKLIFKNSFVGHFLKSATQQKFLDYLLHLISNFQSRKN
VDNNNGICCTENDHYSLLGFFFEHHNEKKLIEKAEGQIDMKKVKVEKSISYEEITA
QCKFISVAGFDTTSNTLTLLFNFLANNPDVQDKIYEAEIKNQPEQISFETVSSLRLLQ
NCIFETLRLFPHASPLQTRICTEPFKIGKYQFLENVQIVVNPWGPHHDREIWGNDV
DCFRPSRFENLTEQQRKAFMPFGVGPRQCVGMRFALLEMKTTAFRMLQKYSVF
TNSPVHDRHGKTVRMTVRDTGTIWPTDKLGLVLKQR
E03E2.1; CE29747. extra seq
MILLILLPVAIFTYVFLR
()
NFKLYYTLHKCGLNPRFDIFGIKGLLWLDSSA 50
AHENFTR
MCSMIGDQTFSVLRGATPVVITSNVDLIHAISTEHFDCFHSRI 100
PEILSDDPITSDNIHMFAAKGERWKRLRTLTSYGLSTVKLKL
LFPTMDTCVSEFMDHVNSLSDGQSVVINHSHSLFQNHTSYVLARKAFGNFAELQKSTA
EKIAYIFPETKLIFKNSFVGHFLKSATQQKFLDYLLHLISNFQSRKNVDN 250
NNGICCTENDHYSLLGFFFEHHNEKKLIEKAEGQIDMKKVKVEKSISYEE 300
ITAQCKFISVAGFDTTSNTLTLLFNFLANNPDVQDKIYEAEIKNQPEQIS 350
FETVSSLRLLQNCIFETLRLFPHASPLQTRICTEPFKIGKYQFLENVQIV 400
VNPWGPHHDREIWGNDVDCFRPSRFENLTEQQRKAFMPFGVGPRQCVGMR 450
FALLEMKTTAFRMLQKYSVFTNSPVHDRHGKTVRMTVRDTGTIWPTDKLG 500
LVLKQR
506
TGAATCAACAACTGTACTTGTCTGCAATTTTTTCAGTTCAATGATTCTACTTTTTCTTCT
GCCGACTGCATTATTAATTTATGTTTTAATAAGGTAAAGCTAACTAAATGTCAAGACACG
GCTCTCAGAATAGATTTTCAGAAATCTCAAAATATATTTCACACTCTATAGAAATGGACT
GAACCCCCAATTTGACATATTTGGAATCAAGGGGCTCCGATGGTTGGATAGTTCTAATGC
TCATGAAAACTTTACACAAATGTGTTCTGTTGTTGGTGATAAAACATTTTCTATGCTCCG
CGGTGCAACACCGGTCATTGTGACCAATGATGTTAATTTGATACATGCAATTTCGACGGA
AAGTTTTGATTGTTTCCATTCGCGAATTGTGAGTTTTATTTAGAATAGAAAAACAATACT
GGAAAATTCATTAAATTGGAAAATATATAATTCCAGCCAGAAGCTTTATCTGATGACCCG
ATCACAGCCGAAAATATCCACATGTTTGCTGCCAAAGGTGAGCGCTGGAAACGTCTTCGA
ACAATAACGAGTTATGGATTATCCACAGTTAAGTTAAAATTGGTATGTCCCATGACAGTT
TTGTTGTCCTTTTTATCTAATCATGTTTGTTCAGTTGTTCCCAACCATTGAGACTTGTGT
TTCTGAATTTTTGGATCACGTGAACAGCCTGTCCGATGGTCAAAGTGTTGTCATCGATCA
TTCTCATAGGTTTGTATAATCATCTTCTATTTATAGATTTGACTTTTTTCAGTCTCTTTC
AAAATCATACATCTTACGTTCTAGCTCGATGTGCTTACGGTCACAAAGAGCAAAATCATC
>EMBOSS_001_1
*INNCTCLQFFQFNDSTFSSADCIINLCFNKVKLTKCQDTALRIDFQKSQNIFHTL*KWT
EPPI*HIWNQGAPMVG*F*CS*KLYTNVFCCW**NIFYAPRCNTGHCDQ*C*FDTCNFDG
KF*LFPFANCEFYLE*KNNTGKFIKLENI*FQPEALSDDPITAENIHMFAAKGERWKRLR
TITSYGLSTVKLKLVCPMTVLLSFLSNHVCSVVPNH*DLCF*IFGSREQPVRWSKCCHRS
FS*VCIIIFYL*I*LFSVSFKIIHLTF*LDVLTVTKSKII
>EMBOSS_001_2
ESTTVLVCNFFSSMILLFLLPTALLIYVLIR*S*LNVKTRLSE*IFRNLKIYFTLYRNGL
NPQFDIFGIKGLRWLDSSNAHENFTQMCSVVGDKTFSMLRGATPVIVTNDVNLIHAISTE
SFDCFHSRIVSFI*NRKTILENSLNWKIYNSSQKLYLMTRSQPKISTCLLPKVSAGNVFE
Q*RVMDYPQLS*NWYVP*QFCCPFYLIMFVQLFPTIETCVSEFLDHVNSLSDGQSVVIDH
SHRFV*SSSIYRFDFFQSLSKSYILRSSSMCLRSQRAKS
>EMBOSS_001_3
NQQLYLSAIFSVQ*FYFFFCRLHY*FMF**GKAN*MSRHGSQNRFSEISKYISHSIEMD*
TPNLTYLESRGSDGWIVLMLMKTLHKCVLLLVIKHFLCSAVQHRSL*PMMLI*YMQFRRK
VLIVSIREL*VLFRIEKQYWKIH*IGKYIIPARSFI**PDHSRKYPHVCCQR*ALETSSN
NNELWIIHS*VKIGMSHDSFVVLFI*SCLFSCSQPLRLVFLNFWIT*TACPMVKVLSSII
LIGLYNHLLFIDLTFFSLFQNHTSYVLARCAYGHKEQNH
GAGTCAACAATTTTCTTTCGGTGTTTTCTGATGCGTTCGGGGCATTGTCAGATTTTCAAA
AATCAATGACTGAGAAAATTACTTGTGAGTCACTAAAACATAATTCATCTAATTGAGAAA
TCTATTTGCAGATTTTTTCCCAGAAATGAAGTCAATTTTCAAAAACAACTTATTTGCCCA
ATTTTTACAAAGTACAAATCAACAAAAATTCTTGGATTATTTACTGAATCTGATCTCAAA
ATTTCAATCCCGAAGAGTTATCGAAAACAATAATAATGAAGAATCAAGCGACCCCGAAGC
TGGCAAGTACAGTTTATTGGAGTTTTTCTTCGAACATCATCATGAAAAAAAGATTGTGGA
AAAGGCGGAAGGGCGGATTGACATGAAGAAAGTGAAAGTCGAGAAAAGTATTTCTTATCA
GGTGTGAGTCTCGGTTATCTTGTAGTCTAAAAAACGCAGTACATTAGAGTTTCTGCTGGC
TACAATGTTCATTTGTTTTGCAAATATTGAAAATTGTTTTGCAAAATATTCAATGAAAGA
AAACCGAAACTAATTTATTGCAATACTGGAGAATGTAAACATTTACTATCGTTTGTTTTT
TTCAATAGAAAAATAGACATTGGAAACTTTGGAGTCCCAAAAAGTGATCCAAATATTTTT
GAATATCGGAAAATGTTAGAGCATACTAAACAAAATTGGAGCTTCGGTTTTATCGTCGAG
AACCAAACTCTTATTACAGCTTCAGAAAAGTTAAATAAATCGAATTAGAAAAGGTTTTTT
TGAGTGATACCGGTACACTACTGGACAAAAATTTATTCTCGATTTTTTTAAAAGAATATT
GAGTTGGCAGGTCTAAAAATTATTTTACTGCTCTGAAAACTTTGTTTTTCAATGTAGACA
TAAAAAGAGATTAACTTAGGTAATGAAATTTGTCTTTAGTACATTCTGAAATTTTCAGAC
AAATCTGAAATATTTAGACTA
>CYP37A1 F01D5 Z81493 516 aa also Y39G8
Z92851
Length = 517
Score = 2596 (913.8 bits), Expect = 2.2e-273, P = 2.2e-273
Identities = 503/524 (95%), Positives = 505/524 (96%)
Query: 1
MGIAVYLLALVVIYVVFNLSKILKFVKERMRLYHLMSKIDGPLALPLLGTTFQFKMDPVE 60
MGIAVYLLALVVIYVVFNLSKILKFVKERMRLYHLMSKIDGPLALPLLGTTFQFKMDPVE
Sbjct: 1
MGIAVYLLALVVIYVVFNLSKILKFVKERMRLYHLMSKIDGPLALPLLGTTFQFKMDPVE 60
Query: 61
FALQLYNWGLEYSTKGSSLAAFWMGPYPMVIVLTPEANKVQTRNMSGAQNFHIFQKVLES 120
FALQLYNWGLEYSTKGSSLAAFWMGPYPMVIVLTPEANKVQTRNMSGAQNFHIFQKVLES
Sbjct: 61
FALQLYNWGLEYSTKGSSLAAFWMGPYPMVIVLTPEANKVQTRNMSGAQNFHIFQKVLES 120
Query: 121
NALINKSSEYDIFLPWLGTGLLLASGEKWRGRRKMMTPSFHFNVLIDFQVVFNSQSMILL 180
NALINKSSEYDIFLPWLGTGLLLASGEKWRGRRKMMTPSFHFNVLID
++ Q ILL
Sbjct: 121 NALINKSSEYDIFLPWLGTGLLLASGEKWRGRRKMMTPSFHFNVLIDEFANWDFQ--ILL 178
Query: 181
EQIENAAKKTDDSTIDAFPYIKRCALDIICETAMGTTVSAQTNHTHPYVVAVNEMNSLAF 240
EQIENAAKKTDDSTIDAFPYIKRCALDIICETAMGTTVSAQTNHTHPYVVAVNEMNSLAF
Sbjct: 179
EQIENAAKKTDDSTIDAFPYIKRCALDIICETAMGTTVSAQTNHTHPYVVAVNEMNSLAF 238
Query: 241
KYQRMPWLWIKPIRQLIGYEADFQRNLDIVTSFTKKVIDRKLREHDETDGMVVVEEESKK 300
KYQRMPWLWIKPIRQLIGYEADFQRNLDIVTSFTKKVIDRKLREHDETDGMVVVEEESKK
Sbjct: 239
KYQRMPWLWIKPIRQLIGYEADFQRNLDIVTSFTKKVIDRKLREHDETDGMVVVEEESKK 298
Query: 301 KAFLDMLIEKKEEGGLGYEDIREEVDTFMFEGHDTTSAGIGWSLWCLANCPEYQKKCHEE
360
KAFLDMLIE EGGLGYEDIREEVDTFMFEGHDTTSAGIGWSLWCLANCPEYQKKCHEE
Sbjct: 299 KAFLDMLIE---EGGLGYEDIREEVDTFMFEGHDTTSAGIGWSLWCLANCPEYQKKCHEE
355
Query: 361 LDEIFEGTSRECSVEDLKKMKYLEKCVKEALRMRPSVPQMARSVEEEVEIDGKILPKGCS
420
LDEIF E DLKKMKYLEKCVKEALRMRPSVPQMARSVEEEVEIDGKILPKGCS
Sbjct: 356 LDEIF---GEEFDSFDLKKMKYLEKCVKEALRMRPSVPQMARSVEEEVEIDGKILPKGCS
412
Query: 421
VMISPAFIQNNPRTFPNHEVFDPERFNEDEISKRHAYAYIPFSAGPRNCIGQKFAMQEEK 480
VMISPAFIQNNPRTFPNHEVFDPERFNEDEISKRHAYAYIPFSAGPRNCIGQKFAMQEEK
Sbjct: 413
VMISPAFIQNNPRTFPNHEVFDPERFNEDEISKRHAYAYIPFSAGPRNCIGQKFAMQEEK 472
Query: 481
TVISWVLRRFHIHTDIGLLENMPLPETITRPSLGFPLKFTVRQQ 524
TVISWVLRRFHIHTDIGLLENMPLPETITRPSLGFPLKFTVRQQ
Sbjct: 473
TVISWVLRRFHIHTDIGLLENMPLPETITRPSLGFPLKFTVRQQ 516
compare to EST
>gi|14837585|dbj|AU205355.1|AU205355 UniGene info AU205355 unpublished
oligo-capped cDNA library, stage L4
Caenorhabditis elegans cDNA clone yk852g07 5'.
Length = 603
Score = 183 bits
(465), Expect = 6e-48
Identities = 94/104 (90%), Positives = 95/104 (91%), Gaps =
2/104 (1%)
Frame = +1
Query: 11
QKVLESNALINKSSEYDIFLPWLGTGLLLASGEKWRGRRKMMTPSFHFNVLID--EFANW 68
+KVLESNALINKSSEYDIFLPWLGTGLLLASGEKWRGRRKMMTPSFHFNVLID N
Sbjct: 208
KKVLESNALINKSSEYDIFLPWLGTGLLLASGEKWRGRRKMMTPSFHFNVLIDFQVVFNS 387
Query: 69
DFQILLEQIENAAKKTDDSTIDAFPYIKRCALDIICETAMGTTV 112
ILLEQIENAAKKTDDSTIDAFPYIKRCALDIICETAMGTTV
Sbjct: 388
QSMILLEQIENAAKKTDDSTIDAFPYIKRCALDIICETAMGTTV 519
>gi|18321101|dbj|BJ153116.1|BJ153116 BJ153116 unpublished oligo-capped
cDNA library, C. elegans L1 stage
Caenorhabditis elegans cDNA clone yk1315e09 3'.
Length = 725
Score = 243 bits
(620), Expect = 2e-65
Identities = 124/136 (91%), Positives = 124/136 (91%), Gaps =
6/136 (4%)
Frame = -1
Query: 1
KKVIDRKLREHDETDGMVVVEEESKKKAFLDMLIE---EGGLGYEDIREEVDTFMFEGHD 57
KKVIDRKLREHDETDGMVVVEEESKKKAFLDMLIE EGGLGYEDIREEVDTFMFEGHD
Sbjct: 623
KKVIDRKLREHDETDGMVVVEEESKKKAFLDMLIEKKEEGGLGYEDIREEVDTFMFEGHD 444
Query: 58
TTSAGIGWSLWCLANCPEYQKKCHEELDEIF---GEEFDSFDLKKMKYLEKCVKEALRMR 114
TTSAGIGWSLWCLANCPEYQKKCHEELDEIF E DLKKMKYLEKCVKEALRMR
Sbjct: 443
TTSAGIGWSLWCLANCPEYQKKCHEELDEIFEGTSRECSVEDLKKMKYLEKCVKEALRMR 264
Query: 115 PSVPQMARSVEEEVEI
130
PSVPQMARSVEEEVEI
Sbjct: 263 PSVPQMARSVEEEVEI
216
This is the correct seq
UNIPROT
MGIAVYLLALVVIYVVFNLSKILKFVKERMRLYHLMSKIDGPLALPLLGT 50
TFQFKMDPVEFALQLYNWGLEYSTKGSSLAAFWMGPYPMVIVLTPEANKV
100
QTRNMSGAQNFHIFQKVLESNALINKSSEYDIFLPWLGTGLLLASGEKWR
150
GRRKMMTPSFHFNVLIDFQVVFNSQSMILLEQIENAAKKTDDSTIDAFPY
200
IKRCALDIICETAMGTTVSAQTNHTHPYVVAVNEMNSLAFKYQRMPWLWI
250
KPIRQLIGYEADFQRNLDIVTSFTKKVIDRKLREHDETDGMVVVEEESKK
300
KAFLDMLIEKKEEGGLGYEDIREEVDTFMFEGHDTTSAGIGWSLWCLANC
350
PEYQKKCHEELDEIFEGTSRECSVEDLKKMKYLEKCVKEALRMRPSVPQM
400
ARSVEEEVEIDGKILPKGCSVMISPAFIQNNPRTFPNHEVFDPERFNEDE
450
ISKRHAYAYIPFSAGPRNCIGQKFAMQEEKTVISWVLRRFHIHTDIGLLE
500
NMPLPETITRPSLGFPLKFTVRQQ
CYP13B1
F02C12.5a; CE09177.
F02C12.5b; CE09178.
F02C12.5c; CE23627.
MGAIIVLVVLFATIAGYFKWIHTYWRRRGISGPEGLPFIGNYYDLADVNK 50
PRGYLIHKWTQKFGKVFGYYEGAVPVLVVSDMDMLQELFLKKFDNFYARK
100
STNHIHGNLECSKSEPRINLFTSRGARWKRLRALASPGFSVKALKQVHDV
150
MEDSAINMVDLMAKHEDGKPFNIHAYFQEFTYDVISRLAMGQPNSELFNN
200
SGVEIVKSIFMRTHRVLPWYFTVLFPQFEHLVKRMFYNHAAVQGGDIEKL
250
LLICKKTVESRIQEREENAKLGFENAENDFIDMFLNYYSEQVEDIEFGST
300
VEKKVTAEDVIGACFVFLLAGFDTTANSLAYASYLLAKHPEKMKLAQEEV
350
DTVVGSENVSYDDMTKLKYLDAVVRESLRLYPVAWFACSRECVKPTTLGD
400
IYIDKGVKIEADVMSLHRSKEIWGENADDFVPERWLEPSSRHTMSWIPFG
450
AGPRQCVGMRLGLSEAKTALAHLLRRYDLVAGVETEVRLTLRFFFIFKSS
500
QIQKELNILGCTTTSPEAVTLYLKPRI
527
>CYP13B1 F02C12.5 Z54269 (C-terminal) C29F7 Z92827 N-terminal 511 aa
Length = 511
Score = 2512 (884.3 bits), Expect = 7.3e-276, Sum P(2) =
7.3e-276
Identities = 483/491 (98%), Positives = 484/491 (98%)
Query: 1
MGAIIVLVVLFATIAGYFKWIHTYWRRRGISGPEGLPFIGNYYDLADVNKPRGYLIHKWT 60
MGAIIVLVVLFATIAGYFKWIHTYWRRRGISGPEGLPFIGNYYDLADVNKPRGYLIHKWT
Sbjct: 1
MGAIIVLVVLFATIAGYFKWIHTYWRRRGISGPEGLPFIGNYYDLADVNKPRGYLIHKWT 60
Query: 61 QKFGKVFGYYEGAVPVLVVSDMDMLQELFLKKFDNFYARKSTNHIHGNLECSKSEPRINL
120
Q FGKVFGYYEGAVPVLVVSDMDMLQELFLKKFDNFYARKSTNHIHGNLECSKSEPRINL
Sbjct: 61 QVFGKVFGYYEGAVPVLVVSDMDMLQELFLKKFDNFYARKSTNHIHGNLECSKSEPRINL
120
Query: 121 FTSRGARWKRLRALASPGFSVKALKQV--HDVMEDSAINMVDLMAKHEDGKPFNIHAYFQ 178
FTSRGARWKRLRALASPGFSVKALKQV HDVMEDSAINMVDLMAKHEDGKPFNIH YFQ
Sbjct: 121 FTSRGARWKRLRALASPGFSVKALKQVYVHDVMEDSAINMVDLMAKHEDGKPFNIHRYFQ 180
Query: 179 EFTYDVISRLAMGQPNSELFNNSGVEIVKSIFMRTHRVLPWYFTVLFPQFEHLVKRMFYN
238
EFTYDVISRLAMGQPNSELFNNSGVEIVK IFMRTHRVLPWYFTVLFPQFEHLVKRMFYN
Sbjct: 181 EFTYDVISRLAMGQPNSELFNNSGVEIVK-IFMRTHRVLPWYFTVLFPQFEHLVKRMFYN
239
Query: 239 HAAVQGGDIEKLLLICKKTVESRIQEREENAKLGFENAENDFIDMFLNYYSEQVEDIEFG
298
HAAVQGGDIEKLLLICKKTVESRIQERE NAKLGFENAENDFIDMFLNYYSEQVEDIEFG
Sbjct: 240 HAAVQGGDIEKLLLICKKTVESRIQERE-NAKLGFENAENDFIDMFLNYYSEQVEDIEFG
298
Query: 299
STVEKKVTAEDVIGACFVFLLAGFDTTANSLAYASYLLAKHPEKMKLAQEEVDTVVGSEN 358
STVEKKVTAEDVIGACFVFLLAGFDTTANSLAYASYLLAKHPEKMKLAQEEVDTVVGSEN
Sbjct: 299
STVEKKVTAEDVIGACFVFLLAGFDTTANSLAYASYLLAKHPEKMKLAQEEVDTVVGSEN 358
Query: 359 VSYDDMTKLKYLDAVVRESLRLYPVAWFACSRECVKPTTLGDIYIDKGVKIEADVMSLHR
418
VSYDDMTKLKYLDAVVRESLRLYPVAWFACSRECVKPTTLGDIYIDKGVKIEADVMSLHR
Sbjct: 359
VSYDDMTKLKYLDAVVRESLRLYPVAWFACSRECVKPTTLGDIYIDKGVKIEADVMSLHR 418
Query: 419
SKEIWGENADDFVPERWLEPSSRHTMSWIPFGAGPRQCVGMRLGLSEAKTALAHLLRRYD 478
SKEIWGENADDFVPERWLEPSSRHTMSWIPFGAGPRQCVGMRLGLSEAKTALAHLLRRYD
Sbjct: 419
SKEIWGENADDFVPERWLEPSSRHTMSWIPFGAGPRQCVGMRLGLSEAKTALAHLLRRYD 478
Query: 479 LVAGVETEVRLTL 491
LVAGVETE L +
Sbjct: 479 LVAGVETEKELNI 491
Score = 131 (46.1 bits), Expect = 7.3e-276, Sum P(2) =
7.3e-276
Identities = 27/44 (61%), Positives = 33/44 (75%)
Query: 484 ETEVRLTLRFFFIFKSSQIQKELNILGCTTTSPEAVTLYLKPRI 527
+T + LR + + + +KELNILGCTTTSPEAVTLYLKPRI
Sbjct: 467 KTALAHLLRRYDLVAGVETEKELNILGCTTTSPEAVTLYLKPRI 510
compare to ests
>gi|30744632|gb|CB402905.1|CB402905 UniGene info OSTF221B8_1 AD-wrmcDNA
Caenorhabditis elegans cDNA.
Length = 556
Score = 192 bits
(489), Expect = 4e-50
Identities = 97/100 (97%), Positives = 97/100 (97%)
Frame = +1
Query: 1
ECSKSEPRINLFTSRGARWKRLRALASPGFSVKALKQVYVHDVMEDSAINMVDLMAKHED 60
ECSKSEPRINLFTSRGARWKRLRALASPGFSVKALKQV HDVMEDSAINMVDLMAKHED
Sbjct: 262
ECSKSEPRINLFTSRGARWKRLRALASPGFSVKALKQV--HDVMEDSAINMVDLMAKHED 435
Query: 61
GKPFNIHRYFQEFTYDVISRLAMGQPNSELFNNSGVEIVK 100
GKPFNIH YFQEFTYDVISRLAMGQPNSELFNNSGVEIVK
Sbjct: 436
GKPFNIHAYFQEFTYDVISRLAMGQPNSELFNNSGVEIVK 555
>gi|14853552|dbj|AU215395.1|AU215395 UniGene info AU215395 unpublished
oligo-capped cDNA library, stage L1
Caenorhabditis elegans cDNA clone yk825b05 3'.
Length = 623
Score = 145 bits
(365), Expect = 4e-36
Identities = 70/70 (100%), Positives = 70/70 (100%)
Frame = -3
F02C12.5a; CE09177.
F02C12.5b; CE09178.
F02C12.5c; CE23627.
These sequences fail to remove a short intron see below.
Query: 1
RHTMSWIPFGAGPRQCVGMRLGLSEAKTALAHLLRRYDLVAGVETEKELNILGCTTTSPE 60
RHTMSWIPFGAGPRQCVGMRLGLSEAKTALAHLLRRYDLVAGVETEKELNILGCTTTSPE
Sbjct: 291 RHTMSWIPFGAGPRQCVGMRLGLSEAKTALAHLLRRYDLVAGVETEKELNILGCTTTSPE
112
Query: 61 AVTLYLKPRI 70
AVTLYLKPRI
Sbjct: 111 AVTLYLKPRI 82
CYP14A5
F08F3.7; CE09262. 100%
MSVFIVALSVFIISYVISFYWKVRKYPKGPFPLPFFGNLLQFPADNIQEH 50
LDKLSKTYGPCFTVWTPLPAVVLTDYEHIKEAFVTQGDAFVNRAQRLPEI
100
LFQPHPNTGVVFSSGDNWKIQRRTALKILRDFGLGRNLMEEQVMRSVHEM
150
LAQLEHISDKKNVDMYWPIQLCVGNVINESLFGYHYKYEDAGRFEKFVKV
200
VDRHLKIAQGNASLLVSAFPWLRHLPVIGNLGYHSIKNNIKSYQQFIEEE
250
VTSQLKNYDGESEPENFVHAYMQQMKQTGNPNLDMTNLCASVLDFWLAGM
300
ETASNSLRWHLAFMMKYPEVQDKVRNEIFENIGTARLPSMSDKQNMPYTQ
350
AVIHEVQRCSNMVPILATHMNTEDVLVKEHNIPTGTLLFAQIWSVLKNDP
400
VFEENSKFNPDRYLMPDGKTLNKTVLERTIPFSVGKRNCVGEGLARMELF
450
LIFSALIQKYEFIPKTNVDLKPVYGGVITVKPYLCELVPQNA
CYP35D1
>CYP35D1 F14H3.10 498 aa
Length = 499
Score = 2595 (913.5 bits), Expect = 2.8e-273, P = 2.8e-273
Identities = 498/499 (99%), Positives = 498/499 (99%)
add the extra K
Query: 1
MILILLFLTAITVITVRLYRKVLRFPPGPFPLPLIGNAHQIAYQAWRRGGILPALDYYRK 60
MILILLFLTAITVITVRLYRKVLRFPPGPFPLPLIGNAHQIAYQAWRRGGILPALDYYRK
Sbjct: 1
MILILLFLTAITVITVRLYRKVLRFPPGPFPLPLIGNAHQIAYQAWRRGGILPALDYYRK 60
Query: 61 KYGNAYTLWLGPKASVSITDFETSQEVFVKQGKKCYNRQLAPILEHVTGGVGLLIANGEN
120
YGNAYTLWLGPKASVSITDFETSQEVFVKQGKKCYNRQLAPILEHVTGGVGLLIANGEN
Sbjct: 61 -YGNAYTLWLGPKASVSITDFETSQEVFVKQGKKCYNRQLAPILEHVTGGVGLLIANGEN
119
Query: 121
WAEMRRFTLLTFRQMGVGTNIMEKRIMDELNGRCLEIDAQIARNDRAIVDVKFFDLTVGS 180
WAEMRRFTLLTFRQMGVGTNIMEKRIMDELNGRCLEIDAQIARNDRAIVDVKFFDLTVGS
Sbjct: 120 WAEMRRFTLLTFRQMGVGTNIMEKRIMDELNGRCLEIDAQIARNDRAIVDVKFFDLTVGS
179
Query: 181
VINSFLIGKRFEDEEEFLKIKKLFDESSETFNIFDLNVPVWFLKTFLPSRFKLTWDSRHQ 240
VINSFLIGKRFEDEEEFLKIKKLFDESSETFNIFDLNVPVWFLKTFLPSRFKLTWDSRHQ
Sbjct: 180
VINSFLIGKRFEDEEEFLKIKKLFDESSETFNIFDLNVPVWFLKTFLPSRFKLTWDSRHQ 239
Query: 241
IMDHVMKGVEERIRDIESGAYKIDPKKPNDVVDAFLSKMKKEEEIAGGQHPYYNLKSLKL 300