48
Tetrahymena P450 genes
This file is being updated to complete these genes. Currently 37 are
complete (magenta). There are very few short introns
present.
There are four full length pseudogenes and one solo exon
pseudogene.
Note: this is the first time CYP names have used four
digit numbers for families.
Blast server
including Tetrahymena P450s
sequence
alignment of 47 Tetrahymena P450s
D. Nelson April 23, 2004
>CYP5001A1 8254555 596115 bp 4 contigs 32% to CYP689A1 This gene's N-term
is very
hard to detect. I have assumed one intron, but this may
not be true.
MPIQHNSQPKEVQTCEQMMKQLENNGFIRLKSYRPFQEEDLFKYDDELYSVKQ
FKSRYPNAKGFICYSDSSQKVPYNYIGICDADLIHEFMVEKEKYYERDVTLSYVARILGD
SLLFNLSDIWRMKRKVYGQMFHFYQYHQLVKLVREITTKNSYIIDMNSQKPVQLISHFEDILGEWSLHLFFGY
NASQYDLQSKLKAAYNNARSAILLLQPQNIINCLDPLKEVLLEYLYITSDQIKSDPDYQSKPQNNITC
(2)
YSFIHSFYLLIIFLKFKNRFLLNSKEVQWNKDFL
105041
DQICQDIIIFFFAGRDTMAHTLQMMFYYLCIYPEYKKKIDEEINSLNGDYSVQNISNLN 105217
105218
YLEAVLKETLRHYGPTTTILPKVVMERHSFMNDKVFLEEGTIVSLAIQELNYDPQHYENP 105397
105398 FKFYPERWFDENLKQKEKKIFMPFSYGSHKCIGDKLALPVMKMLYITLNEYFSIDIDPSF
105577
105578 KLKLVALIQYTPSTQIPFLITKKKNIQSNNEVYFS*
105685
ATTCTAGGAGAATGGTCTCTTCATCTATTTTTCGGATATAATGCTAGCTAATATGATTTA
F F
G Y N
A S Q
Y D L
CAATCTAAATTAAAAGCAGCTTATAACAATGCTCGCTCTGCCATACTCCTTTTACAGCCC
Q S K
L K A
A Y N
N A R
S A I
L L L
Q P
P Y
S F Y
S P
CAAAATATAATAAACTGTCTTGACCCTTTAAAGGAAGTCCTCCTTGAATATCTATACATC
Q N I
I N C1 L D P L
K E V0 L L E Y
L Y I
K I
Q Q T V L T L
Q R K
S S L
N I Y
T S
ACTTCTGATTAGATAAAAAGCGATCCTGATTACTAAAGCAAACCTTAAAATAACATCACA
T S D
Q I K
S D P
D Y Q
S K P
Q N N
I T
L L
I R2 Q K
A1 I L I
T K A1 N L K I
T S H
TGGTATAAAATTTAAAAATTAATTAAAAACAATAAAATAAATGTTTTTTTTAGTTAGTTA
W2 Y K I
Q K L
I K N
N K I
N V0 F F
S1 Q2 L
G I
K F K
N Q L
K T I
K Q M
F F L
V S2 Y
TTCATTCATTCATTCATTTTACTTACTAATAATTTTTTTGAAATTTAAAAACAGGTTTTT
F I H
S F I
L L T
N N F
F E I
Q K Q
V0
S F
I H S
F Y L
L I I
F L K
F K N
R2 F L
GCTAAATAGCAAAGAAGTTTAATGGAACAAAGATTTCCTTGATTAAATTTGCTAAGATAT
L N
S2 K E1 V1 Q W
N K D1 F L D Q
I C Q
D I
TATCATCTTTTTCTTCGCTGGAAGAGATACGATGGCTCATACTCTTTAAATGATGTTTTA
F A G
R D T
TTATTTATGTATTTACCCTGAATATAAAAAGAAGATAGACGAAGAAATAAATAGCCTAAA
CGGAGATTATTCTGTTTAAAATATTTCAAATTTAAATTATTTAGAAGCTGTTTTAAAAGA
5 POSSIBLE
INTRON LOCATIONS THAT MAKE SENSE.
VPYNYIGICDADLIHEFMVEKEKYYERDVTLSYVARILGDSLLFNLSDIWRMKRKVYGQM
FHFYQYHQLVKLVREITTKNSYIIDMNSQKPVQLISHFEDILGEWSLHLFFGYNASQYDL
QSKLKAAYNNARSAILLLQPQNIIN (1) SILITKANLKITSH
GIKFKNQLKTIKQMFFLVSYSFIHSFYLLIIFLKFKNRFLLNSKEVQWNKDFLDQICQDI
IIFFFAGRDTMAHTLQMMFYYLCIYPEYKKKIDEEINSLNGDYSVQNISNLNYLEAVLK
VPYNYIGICDADLIHEFMVEKEKYYERDVTLSYVARILGDSLLFNLSDIWRMKRKVYGQM
FHFYQYHQLVKLVREITTKNSYIIDMNSQKPVQLISHFEDILGEWSLHLFFGYNASQYDL
QSKLKAAYNNARSAILLLQPQNIIN (1) SNLKITSH
GIKFKNQLKTIKQMFFLVSYSFIHSFYLLIIFLKFKNRFLLNSKEVQWNKDFLDQICQDI
IIFFFAGRDTMAHTLQMMFYYLCIYPEYKKKIDEEINSLNGDYSVQNISNLNYLEAVLK
MY CHOICE
BASED ON COMPARISON TO PARAMECIUM P450S
VPYNYIGICDADLIHEFMVEKEKYYERDVTLSYVARILGDSLLFNLSDIWRMKRKVYGQM
FHFYQYHQLVKLVREITTKNSYIIDMNSQKPVQLISHFEDILGEWSLHLFFGY
NASQYDL
QSKLKAAYNNARSAILLLQPQNIINCLDPLKEVLLEYLYITSDQIKSDPDYQSKPQNNITC
(2) YSFIHSFYLLIIFLKFKNRFLLNSKEVQWNKDFLDQICQDI
IIFFFAGRDTMAHTLQMMFYYLCIYPEYKKKIDEEINSLNGDYSVQNISNLNYLEAVLK
VPYNYIGICDADLIHEFMVEKEKYYERDVTLSYVARILGDSLLFNLSDIWRMKRKVYGQM
FHFYQYHQLVKLVREITTKNSYIIDMNSQKPVQLISHFEDILGEWSLHLFFGYNASQYDL
QSKLKAAYNNARSAILLLQPQNIINCLDPLKEVLLEYLYITSDQIKSDPDYQSKPQNNIT
WYKIQKLIKNNKINVFF (1) KVQWNKDFLDQICQDI
IIFFFAGRDTMAHTLQMMFYYLCIYPEYKKKIDEEINSLNGDYSVQNISNLNYLEAVLK
VPYNYIGICDADLIHEFMVEKEKYYERDVTLSYVARILGDSLLFNLSDIWRMKRKVYGQM
FHFYQYHQLVKLVREITTKNSYIIDMNSQKPVQLISHFEDILGEWSLHLFFGYNASQYDL
QSKLKAAYNNARSAILLLQPQNIINCLDPLKEVLLEYLYITSDQIKSDPDYQSKPQNNIT
WYKIQKLIKNNKINVFF (1) IQWNKDFLDQICQDI
IIFFFAGRDTMAHTLQMMFYYLCIYPEYKKKIDEEINSLNGDYSVQNISNLNYLEAVLK
>CYP5002A1 8254654 510086 bp 17 contigs 30% to CYP696C1
MLTELFYTVILLIGVLFYLTFLRPLYKLLMLKIKHGKEILIKFEPIIGIHAYLKKSEKIHGDPMKLLKLEMAKN
PKLKAVACSLGFGVLITSVDPDMNKSFLIDNYQKSVKLAPIFNYKILKNGIIFSE
GEQWKKQRKMISDSFHFDSKYYFFSQKFNLLTLIMFKKVQNR (1)
VSVDSQLFFTKIKGEIGIRVFFGPDFGDHQIDGKNISEELKE (0)
IVDQTIVYSYTSIFFQLK
RIFLGDELAMKFLTAKEKKLLSRSDRINQIAMEQIEKKIKYYEGQPSVQSDYLIDV
292370 LVRNYLTKQDPNLTKEQ (0)
ICQQCITMFIAALDNTGCIIA 292543
292544
WIIFCLANYKEEGDKVRQEINSVFPTQETLDNIQFDDLKNLNYTSAFIDECFRHYTSSMG 292723
292724 VFLRRVVEPFKSGNFEIQK (1)
GDIIQSLWH
292894
PTQFNPDWFENPDKFDVNRFLNGKEKQGNSYAFTPFSIGPRNCIGKQMSLVEVKVSLIYF 293073
293074 VKKFNCIRNSVPLKTTASIVYKPVDQNLIKISFCL*
293181
AATATTTAACTATAAAATACTTAAAAATGGAATAATCTTTTCAGAAGGAGAACAGTGGAA
W K
GAAATAGCGCAAAATGATATCAGATTCCTTCCATTTCGATAGTAAATACTATTTTTTCAG
K Q R
K M I
S D S
F H F
D S1 K Y
Y F F
S1
TCAAAAATTTAATTTATTAACTCTCATTATGTTCAAAAAGGTTTAAAATCGAGGTTTTAA
Q K
F N L
L T L
I M2 F K
K V0 Q N
R G1 F K2
GTTATTGCATAAGTAACTAAAGAAACAATTGACAGTCTCAGCAAATATGTAGGAAAGGAT
L L
H K2 Q L
K K Q
L T V0 S A N M2 Q E R I
TACTTCCCTTTATTTGATGAAATGATGAAGCATTCAGGTAATTACTTTAATTAACATTTT
T S
L Y L
M K *
AGTTTCAGTTGATTCTCAACTTTTTTTTACTAAAATTAAAGGCGAAATAGGTATTAGGGT
V1 S V1 D K I
K G E
I G I
R V
TTTCTTTGGACCAGATTTTGGTGATCACCAAATAGATGGTAAGAATATATCTGAAGAGCT
F F
G P D
F G D
H Q I
D G K
N I S
E E L
TAAGGAGGTAAATGATTAATTGATTCATTTATTTATTTATTTCGTAGGTATATTATAATT
K
E V
TGTATATGATTTATCTTAACATCAAGAATGAAAACATATTTATATAAATGCAAAAATAAA
K H I
Y I N
A K I
K
ATTAAAGATTGTAGATCAGACTATTGTTTACAGCTATACTTCTATTTTCTTCTAATTAAA
L K
I V D
Q T I
V Y S
Y T S
I F F
Q L K
GAGAATCTTTTTGGGAGATGAATTGGCAATGAAGTTTTTAACTGCGAAAGAAAAGAAACT
R I
F L G
D E L
A M K
F L T
A K E
K K L
TTTGTCTAGGAGTGATAGGATAAATTAAATTGCTATGGAATAAATAGAAAAAAAAATAAA
ATACTATGAAGGTTAACCTAGTGTACAATCAGACTACCTTATAGATGTATTAGTGAGAAA
TTATCTCACAAAATAAGACCCAAATCTAACAAAAGAATAAGTAATCCATTTTATATTAAG
D P N
L T K
E Q V
I H F
I L S
CTTTTTTTAAAAATAAATGTCTTCATTTATATCTAATTAGATATGCTAATAGTGCATAAC
F
F Q K
Q M S
S F I
S N Q
I C
TATGTTTATTGCAGCTTTAGACAACACTGGATGCATAATAGCTTGGATAATTTTTTGTTT
GGCTAACTATAAAGAAGAAGGAGATAAAGTAAGATAAGAAATCAACTCCGTCTTTCCAAC
ATAGGAGACCTTGGATAACATCTAATTTGATGATTTAAAAAATCTTAATTATACTTCAGC
ATTTATTGATGAATGTTTCAGGCACTATACAAGTTCTATGGGCGTTTTTCTCCGTAGAGT
H Y
T S S
M G V
F L R
R V
AGTTGAACCCTTTAAGTCTGGAAACTTTGAAATATAAAAAGGTACTGATTAATCTATCAT
V E P F
K S G
N F E
I Q K
G
TAATAAATATTAATTATTTTAATTTAAAACTAAACTAATTAACTAAAAATTTTTAAATTA
AATAAAGGAGATATTATTTAAAGTTTATGGCATCCTACCTAATTTAATCCTGACTGGTTT
K G D I
I Q S
GAAAATCCTGATAAATTTGATGTGAATAGATTCTTAAATGGAAAAGAAAAGTAAGGAAAC
>CYP5003A1 8254752 1171415 bp INTRONS 1, 3 AND 4 HAVE UNDEFINED
BOUNDARIES 29% TO CYP694C3
MILLIIGLLIFSLFSYFAYLIFVKPYVRNKWYVKQYGKICKPIYYPILGNIKFVRESFKKYG
DSLQWIRELLKNDNDVQFLIGYNNIHIVDAQLAKDITLNNLQNIRKDMGMQFGDLLFKQG
IVFSEGEKWKKQRQILSHSFSFDVLKNRVSKINNIVKEMINKIQIEE
(1)
GKPTKIVTECTKITAEVVLRSFFGERFSQQQHKGINLGIYLAQL
LLSVFDQTRRLRMLIPSFFFPKFMSKIFADEKYYEVQKNCIEFRQLIEKEVNLRIENYDQ
NKTQKDFLDILVDEYKLNYSKNDGKVEEYITKESIIHQFIVFFFA
(1)
GMDTTGNLTGIMLYW (2)
VSKRKDIYEKLVQEIKSVFGQNKDPEINDEQLKKLNYCHMFIQECLRYHCPAMLL
(2)
FTRRAERDFYIGDNILVQKGMQVNISLHGVLRRE
262363
KYFQNPDEFIPERFSEENKKNINHLAFIPFSSGPKNCIGQHMALIEAKIILVQFILNFDI 262184
262183 TNNENVPLKMDSNSLYCPVNDELLFLKRKQ* 262091
AATCTCTTTCAGCTCTACGTGTGAACCTTAAAAATTTACAAAAGTTAATTTTAATAGACT
D1 R2 E A1 R R T
F R2 L F
K C F
N I K
I S Q
L L
S0
AAAAATATTATTTATTTACAATAACATTGCTGGGCAGTGGTATCTGAGACATTCTTAGAT
F I
N N I
Q L2 L M2 A P
AAACATGTGGCAATAATTTAATTTCTTTAGTTATTCGTCATTGATTTCAGGATCTTTATT
TTACCCAAATACACTTTTTATTTCTTAAACTAATTTCTCATATATATCTTTTCTTTTAGA
R
K S
AACCCTATTTGATAATAATTTACATTTAATTATTATTTTTATTATTTTTATAATATCTAC
V R2 W
CAATAAAGCATGATGCCAGTCAAATTTCCAGTTGTATCCATACCTAAATTTTAAATGAGT
CAAAAAAGGATCTTAAAACAACTTCTGCCGTTATCTT
AGTGCATTCAGTTACAATTTTAG
A T I
K0
TTGGTTTTCCTATAAAAAAATAATTAAATTATTTACATACGATATTGTTTTAACTTTATA
P
K G1 I
TTCATCAAATCAGATTAATTATAAATAACTAAATTACCTTCTTCAATTTATATTTTATTA
V L
N G E
E I Q
I K N
I
ATCATTTCTTTAACAATATTATTGATTTTAGATACTCTATTTTTCAACACATCAAAAGAG
>CYP5004A1 8254638 979488 bp 2 contigs 31% to CYP694C3 COMPLETE, NO
INTRONS
747540
MIILIASLLTILTFCILLLIFSPKIKLLHLAKKYKDTEVLFSPVGGIVNLYQQSFEKY 747367
747366
GDALYFIRDLIQKKPNTKAILYNLGVDIVVNFQDAELVKEVHQSYSDYYEKLKGILTLQY 747187
747186
LFDKGMLMTDGDEWQKQRKLLANTFLFQNIKSRLPVTKQVVQEIYSSKQLDLSKPINALH 747007
747006
MSEKVTSDVIFQTFFGISFRGKQINGVEIQKELSNVLIEGFNYIQSSFYYKLKMVILK 746833
746832
EKAVGFKPFKGEEDLLKRMKNLRDTTEEVIQNRLKQIKSGNLDISQSELFLDILLRDYI 746656
746655
QKNSESEKDIQQMIDQFMTFFFGGTDTTADLVALTLYHLSHNQNFQTKLRQEIKSIINNF 746476
746475 EELNYDNLNQMNYLQCVIKESLRIHPPAVGVLPRVCIKDHKVGQIEMKKGMLMDTHFIGV
746296
746295
LNNPKYYDNPQDFNPDRWNDSKKMESAPFSPFGIGKRSCIGQHLGMMNAKVIICFLVMNY 746116
746115 LVKPNHQQKLRMSLNTVYTPFNDDLVYFEKIN* 746017
>CYP5005A1 8254815a 909308 bp 16 contigs 10 genes all in (+) 29% to
CYP694C2
MILVSYIALGA
241470
IVLLIAIFIYNLILYPQIRLNLMKQKHGDKIKTFFHIGEGIFKEYYTNLTQKKDSVAFIR 241649
241650
GIHQKNIKALAYNLGTRICLNFVDPELVKQVHQNPEAFRKNDASLAITFLFGNSILFA 241823
241824
KGKSWQRQRQFLGKSFHFEEIKNYLPLIKEICDKVFERVSEKLTQNDQLEINAVKICEQV 242003
242004 TSEVVFRAFFGSTTQNLVITRQDGSQIPIADELIQAIMNSFQLLKKDKIALFKYLIFGRN
242183
242184
STKFLRTQGEQSILNRLIAIKESCLQVVQQRKNQLQNDPTQAKKNFLDQYLYDMITNKQ 242360
242361
SEVNNEEIIDNFLGLFFAGTDTTGNMTGVALYYLSRYPDIQKQAREEVIQILSQNSNKRN 242540
242541
HSELFSQLTFENLQNMNLINSILKESLRLIPPAIEVFPRVAIHNMKIGEFQIKKGDLVST 242720
242721 YFIYNQSNPELYQDPEIFNPQRWMNVKD 242804 (2)
242867 SQNNFNFTPFSLGPRNCIGQHLAMIEGKCMLACILLNFDILPNYQQEVVKELRVIY
243034
243035 GFQKDNLIYFKQRKQNN* 243088
ATCCAGAAATATTTAATCCTCAAAGGTGGATGAATGTAAAAGAGTAAGTCTTTTATATTT
W M
N V K
E2
ATTATATTCTATATTTATAATGTATCTTCTTAATATAAATCAAAGCTCATAGAATAATTT
S2
S Q0 N N F
CAACTTTACTCCGTTTTCACTAGGCCCTAGAAATTGCATCGGTTAGCATCTTGCTATGAT
N
F T P
F S
>CYP5005A2 8254815b 909308 bp 16 contigs 10 genes all in (+) 30% to
CYP694C1
244655 MLLNSIIISFVIILIIYKFLLYPKLRLIQIKRKYGDKIKVIHHYGPGLLVEYKK
244816
244817
SYTQNKDSLSFLRQMAYQNGKSIQAYAFNIGTNVGYSFVDPELIKQVHQNHDAFQKIDV 244993
244994
SQAIAYLFSNSLLFATGKSWQKQRQFLGKSFHYDEIKNYFPSIKDICLKVSDRISEELI 245170
245171
QNPNKEIKVVKTCEQITSEVVFRAFFGATSENIFISTENGSTKIPISQELVSVIMDSFRI 245350
245351
LQQSKLIQLKSQLLKRKTFNIFPLKEEKKLLNRLITLKQACKNIVLKRK 245497
245498
EELVSDPSLYKKNFIDLYLKEIIQNSNNEITIEEIVDNFCALFFAGTDTTGNMTGAALYF 245677
245678
LSLNPKIQQEAREEVMQIIKQKANSSDLKDIYNYLSFEDLTKLDLINSILKESLRLLPPASGV 245866
245867 LPRISNRDIKIGQFEVKKGDLVNTHFIYHQSNPEVYENQDQFNPYRWMKGKE
246043 (2)
246053 QNNAFNFTPFSLGPRNCIGQHLAMIEGKCILINFLINFDIMPIPQKQV
246229
246230 QYEMKVIYGLQPDDLVYFRKRNN* 246301
TAGATGGATGAAAGGAAAAGAGTAAAACATTTTTATATAATTATAAAAAAATTAAATAAT
W M K G
K E
TTTTAAATAAATATAAATTAAAAGGTAAAATAATGCATTCAATTTTACTCCTTTTTCGTT
Q N N
A F N
F T P
F S L
AGGACCTAGAAACTGTATAGGATAACATTTGGCGATGATAGAAGGTAAGTGTATTCTTAT
>CYP5005A3 8254815c 909308 bp 16 contigs 10 genes all in (+) 31% to
CYP694C1
247451 MIFVAIYTILGILALIFAIALYNLVFYPYLRLKIMKQKYGDKIQVFFNIGDGILKKYKNDLE
247636
247637
DKGDSLAFIRGMHQKNLKAIGFNLGAKVGLSFVDPDLIKQVHQNHDSFHKIDAAMAIT 247810
247811
FLFGNSILYAKGKDWQRQRQFLGKSFHFEEIKNYLPLIKEVCQNTFQNVNKQLALKEQTE 247990
247991 IQAVKICEQITSEVVFRVFFGSTSQNLDIKKEDGTRIPIAHELVETIMNSFQLLQNDKIA
248170
248171
LIKWLLFKRNSTKFLRTQGEESILNRLITVKQTCLQVVIKRRDELFKDPSQAKKNF 248338
248339
LDQYLIDMIQNKNSKVTYDEIVDNFSGLFFAGTDTTGNMTGVALYYLSLYPEIQQQAREE 248518
248519 VIKVLSQKQKEKKTDQLF
248573 NQLTFDDLSNLDLINSILKESLRLVPPANEVFPRIADHDMKIGDFQIEKGDLVNTHFIYN
248752
248753 QSNPEYYPNPDIFDPYRWMNAKD 248836 (2)
248875 QQNTFHFTPFSLGPRNCIGQHLAMIEGKCMLASILLQFEILPNHSAKIVKEVKVIYGLKN
DNIIFFKKLKAN* 249093
CCCTGAATATTATCCAAATCCAGATATTTTTGATCCTTATAGATGGATGAACGCAAAAGA
R W M
N A K
E
GTAAATTTTATTTTTATATTCATTCTTAAAAAATAATATTTAACTAAAAATAGTTAATAG
S Q Q
AACACATTTCACTTTACTCCATTCTCTTTAGGTCCCAGAAATTGCATAGGTTAACATCTA
N T
F H F
T P F
S L
>CYP5005A4 8254815d 909308 bp 16 contigs 10 genes all in (+) 29% to CYP696C1
249796 MFVENINLIPVLFILGILFILYKMILYPKIRLMQLQKKYGESIKVVHHYGTGLLMEYKKGFQQLND
249993
249994
SMAFIRKMSKENQSNVEAFAFNIGPKIGISFVYPELIKQVHKNHDAFQKIDVSQAVVYLF 250173
250174
SNSLIYATGKQWQRQRQFLGKSFHFEEIKNYLPCIKDICIKTSNQISEDIRINPQKEIQV 250353
250354 VKICEKVTSEVVFRVFFGSTQENILIEREDGQKLSISEELVSIVMDSFRILQQDKLLLIK
250533
250534
SLILKRYSMKIFPLREEKSLHNRLIQLKKACETIVLQRKKQLQLDPALYKNNFLDLYL 250707
250708
KEMIQNSNTQITIDEIIANFCGLFFAGTDTTGNMTGVALYYLSLNPQIQKEAREEVIQI 250884
250885
ISKKNSNLDLKDQFNYITFEDLSSMNLINSILKESLRLIPPAIGVFPRYANRDIKI 251052
251053 GQFELKKGDLVNTHFIYNQSNPSIFQNPEQFDPKRWMNGND
(2)
251242 LQFAFSFTPFSLGPRNCIGQHLAMIEGKCMLANFLLKYDILPNKSQNIGLEMKIIYGLSPDNLVYFKQR*
251451
TACAATTAGTCAAATCCTTCAATATTTTAAAATCCAGAACAGTTTGATCCTAAAAGATGG
R W
ATGAATGGAAATGAGTAATTAATTAATTAATTAATCATTTAATTTTTTATTATTTATTTA
M N
G N E
CAAAAATTTATTAAATATAGTTTGTAATTTGCGTTTAGCTTTACACCATTTTCATTAGGC
L Q F
A F S
F T P
F S L
>CYP5005A5P 8254815e 909308 bp 16 contigs 10 genes all in (+) (+) 29% to
CYP694C1
one in
frame stop and a frameshift, stop codon missing at end, may be a pseudogene
252767 MLKILITFLGLFLFLLVFVVYKVFIYPYLRLSTFSKKHGKKVKVYFNFGVGLL
252925
252926
KEYKQSLIQYKDSLAFFRNLHQKSLQAIAFCFGTKVGLIFVDSNLIKQVHQNHEAFM 253096
253097
KIDASMAIIYLFGDSLIYAKGKEWQRQRKFLGKSFHFEEIKNYLPSIKETSIKIFGEIKD 253276
253277 QLKQSDQMEIQVVKTCESVTSEVVFRVFFGQTSQNLKNTKEDGSQISVAQELVSTITNSF
253456
253457 QILQNDKISLIK*IFFERNTIKYFPTKNEQNLQKRLIEIKKQCMQVVEKRRIELIE 253624
253625
VIKSAKNNFLHQYLKEMIVNEKSKISNEEIIDNFLALFFAGTDTTGNMTRVALYYLSLYP 253804
253805 DIQNKAREEIIKLASSRINSTNPVDLFNSLTFEDIQNLNFLNSILKESLRLIPPAIEVFP
253984
253985 RLAIQDIQIGD 254017 (fs)
YEVKKGDFINTYFIYNFSNPEIYPEPDNFDPSRWMKQID
(2)
254199 QQNTFNFTPFSLGPRNCIGQHLAMIEGKSMLAYILLNFEKFPNKNQEVVKEMKIIYGFQKDNLVYFKNRSQ
254411
RKKSGNIQLTQQFYIQIKKYLFLLCLKFQYVFFTTKKQKPSILQKFIKEKQNKFKHQKQTRFYFQHSKQKEVEINMI
LIISIIILQIKKQNMRKNISYKQKQFKQKNFLSLIELLFLSFKLKFFIFQK*
ATTTTGATCCTTCAAGATGGATGAAACAAATAGAGTATTTATTTAATATTCATAAATAAT
W M K
Q I E2
TAGTTAACATTAATGTATTTTTTTAATGAATAAAAAGTTAGTAGAATACATTTAATTTCA
Q Q N
T F N
F T
CTCCCTTCTCATTAGGACCAAGAAATTGCATTGGATAGCATCTAGCCATGATTGAAGGAA
P
F S L
>CYP5005A6 8254815f 909308 bp 16 contigs 10 genes all in (+) 30% to
CYP696C1
285979 MLIIFQIIFAIIASILTLAIYSLVLYPFFWFILMKYKYGDKIKVFFNIGQGMLKEYKKGLIE
286164
286165
KNDGVAFIRNMHQKNLKAMAFNYGSKVGFSFLDPELIKQVHQNHEYFIKMDATMAVTF 286338
286339
LFGNSILYAKGKEWQKQRQFLGKSFHFDEIKSYFPQIKEICQNTFRNTYQKLTNEEQINI 286518
286519 QAVKICETVTSEVIFRTFFGKTSQNLNITKKDGSQILLAFELVEVVKSAFQMLT
286680
286681
IDKIALIKWIFLQRNSARYFRTEKEEDLFQRLTAIKECCLQIVQKRRDELIKEPTQAKK 286857
286858
IFLDQYLIDTMQNQSSQVTDDEIIDNFCGMFFAGTDTTANMTGVALYYLSIYPEIQRQAR 287037
287038 EEIIKILSSKSNDKNPDSLFSQFSIEDL 287121
SNLDLINSILKESMRLIPPAIQVFPRIACQDIKIGEFEIKKGQLVTTNFVYN
QSNPEIFPKPDNFDPFRWMNEDN 287346
(2)
TQSNTFNFTPFSQGPRNCIGQHLAMIEGKCMLVYILLLFDILPNPSVLVAKEVKV
287569
287570 IYGFQNDNLIYFKKRQNII* 287629
region
around intron in CYP5006A6
TCAAATCCTGAAATATTTCCTAAGCCAGATAATTTTGATCCATTTAGATGGATGAATGAA
W M N
E
GATAAGTAAGTTTAATTAATATTTATATTTTTAGAAAATATTTATCATTAATTTAACAAA
D K2 Q V0
TAGCACTTAGAGTAATACATTTAACTTTACTCCTTTCTCTTAAGGCCCAAGAAATTGTAT
S2 T Q0 S2 N T F
N F T
P F
>CYP5005A7P 8254815g 909308 bp 16 contigs 10 genes all in (+) 30% to
CYP694C1
one in
frame stop codon same as CYP5005A5,
might be a pseudogene
MLKILFTFLGLFL
289100
LLLVFVVYKVFIYPYLRLSTFSKKHGKKVKVYFNFGVGLLKEYKQSLSQYKDSLAFFR 289273
289274
NLHQKSLQAIAFCFGTKVGLIFVDSNLIKQVHQNHEAFMKIDASMAIIYLFGDSLLYA 289447
289448
KGKEWQRQRQFLGKSFHFEEIKNYLPSIKETSMKLFGEIKEQLKQSDQIEIQVVKTCESV 289627
289628 TSEVSFRVFFGQTSQNLKITKEDGSQISVAQELVSTITNSFQILQNDKISLIK*IF 289795
289796
FERNTIKYFPTKNEQNLQKRLIEIKKQCMQVVEKRRIELIEVIKSAKNNFLHQYLK 289963
289964
EMIVNEKSKISNEEIIDNFLALFFAGTDTTGNMTRVALYYLSLYPDIQNKAREEIIKLAS 290143
290144 SRTKSTNPVDLFNSLTFE 290197
290198 DIQNLNFLNSVLKESLRLIPPAIEVFPRVAIQDIQIGDFEVKKGDFINTFFIYNFS
290365
290366 NPEIYPEPNNFDPSRWMKQID 290428 (2)
290492 QQNTFNFTPFSLGPRNCIGQHLAMIEGKSMLAYILLNFEIFPNKNQEVVKEMK
IIYGFQKDNLVYFKNRSQ* 290707
region
around intron in CYP5006A7
TCCTTCAAGATGGATGAAACAAATAGAGTATTTATTTAATATTCATAAATAATTAGTTAA
R W
M K Q
I E
CATTAATGTATTTTTTTAATGAATAAAAAGTTAGTAGAATACATTTAATTTCACTCCCTT
S2 Q0 Q0 N T F N F T P
F
CTCATTAGGACCAAGAAATTGCATTGGATAGCATCTAGCCATGATTGAAGGAAAATCTAT
>CYP5005A8 8254815h 909308 bp 16 contigs 10 genes all in (+) 30% to
CYP694C1
381326 MISLLHLIFGVLLIPIILIIYKVILYPLTRLLYLKYVHLDKVRIFYTYGQGFFPQFKKNLLEKK
381517
381518
DSLAFVREMYSKKLQTALFSIGTKVGFIFLDPELIKQVHQNADAFKKNDGSLAITYLF 381691
381692
SNSLLFVKGKEWQRQRQFLAKSFNFQDIKNYLPIFKDVSSKIFGLIRNNLEISGTQEIEI 381871
381872 VKTCEKVTSEAVFRVFFGQTSQNLMVTSKDGSSTLLAHELVAVVVDSFHMLLHDKLLLI
382048
382049
KFTMLGKNSINILPTQSELKLLNRLKEVKRVCLEIVEKRRNELLKDQSQFKNNFLDQYL 382225