48
Tetrahymena P450 genes
This file is being updated to complete these genes. Currently 37 are
complete (magenta). There are very few short introns present.
There are four full length pseudogenes and one solo exon
pseudogene.
Note: this is the first time CYP names have used four
digit numbers for families.
Blast server
including Tetrahymena P450s
sequence
alignment of 47 Tetrahymena P450s
D. Nelson April 23, 2004
>CYP5001A1 8254555 596115 bp 4 contigs 32% to CYP689A1 This gene's N-term
is very
hard to
detect. I have assumed one intron,
but this may not be true.
MPIQHNSQPKEVQTCEQMMKQLENNGFIRLKSYRPFQEEDLFKYDDELYSVKQ
FKSRYPNAKGFICYSDSSQKVPYNYIGICDADLIHEFMVEKEKYYERDVTLSYVARILGD
SLLFNLSDIWRMKRKVYGQMFHFYQYHQLVKLVREITTKNSYIIDMNSQKPVQLISHFEDILGEWSLHLFFGY
NASQYDLQSKLKAAYNNARSAILLLQPQNIINCLDPLKEVLLEYLYITSDQIKSDPDYQSKPQNNITC
(2)
YSFIHSFYLLIIFLKFKNRFLLNSKEVQWNKDFL
105041
DQICQDIIIFFFAGRDTMAHTLQMMFYYLCIYPEYKKKIDEEINSLNGDYSVQNISNLN 105217
105218 YLEAVLKETLRHYGPTTTILPKVVMERHSFMNDKVFLEEGTIVSLAIQELNYDPQHYENP
105397
105398
FKFYPERWFDENLKQKEKKIFMPFSYGSHKCIGDKLALPVMKMLYITLNEYFSIDIDPSF 105577
105578 KLKLVALIQYTPSTQIPFLITKKKNIQSNNEVYFS*
105685
ATTCTAGGAGAATGGTCTCTTCATCTATTTTTCGGATATAATGCTAGCTAATATGATTTA
F F
G Y N
A S Q
Y D L
CAATCTAAATTAAAAGCAGCTTATAACAATGCTCGCTCTGCCATACTCCTTTTACAGCCC
Q S K
L K A
A Y N
N A R
S A I
L L L
Q P
P Y
S F Y
S P
CAAAATATAATAAACTGTCTTGACCCTTTAAAGGAAGTCCTCCTTGAATATCTATACATC
Q N I
I N C1 L D P L
K E V0 L L E Y
L Y I
K I Q
Q T V L
T L Q
R K S
S L N
I Y T
S
ACTTCTGATTAGATAAAAAGCGATCCTGATTACTAAAGCAAACCTTAAAATAACATCACA
T S D
Q I K
S D P
D Y Q
S K P
Q N N
I T
L L
I R2 Q K
A1 I L I
T K A1 N L K I
T S H
TGGTATAAAATTTAAAAATTAATTAAAAACAATAAAATAAATGTTTTTTTTAGTTAGTTA
W2 Y
K I Q
K L I
K N N
K I N
V0 F F S1 Q2 L
G I K
F K N
Q L K
T I K
Q M F
F L V
S2 Y
TTCATTCATTCATTCATTTTACTTACTAATAATTTTTTTGAAATTTAAAAACAGGTTTTT
F I H
S F I
L L T
N N F
F E I
Q K Q
V0
S F I
H S F
Y L L
I I F
L K F
K N R2 F L
GCTAAATAGCAAAGAAGTTTAATGGAACAAAGATTTCCTTGATTAAATTTGCTAAGATAT
L N S2 K E1 V1 Q W N
K D1 F L
D Q I
C Q D
I
TATCATCTTTTTCTTCGCTGGAAGAGATACGATGGCTCATACTCTTTAAATGATGTTTTA
F A G
R D T
TTATTTATGTATTTACCCTGAATATAAAAAGAAGATAGACGAAGAAATAAATAGCCTAAA
CGGAGATTATTCTGTTTAAAATATTTCAAATTTAAATTATTTAGAAGCTGTTTTAAAAGA
5 POSSIBLE
INTRON LOCATIONS THAT MAKE SENSE.
VPYNYIGICDADLIHEFMVEKEKYYERDVTLSYVARILGDSLLFNLSDIWRMKRKVYGQM
FHFYQYHQLVKLVREITTKNSYIIDMNSQKPVQLISHFEDILGEWSLHLFFGYNASQYDL
QSKLKAAYNNARSAILLLQPQNIIN (1) SILITKANLKITSH
GIKFKNQLKTIKQMFFLVSYSFIHSFYLLIIFLKFKNRFLLNSKEVQWNKDFLDQICQDI
IIFFFAGRDTMAHTLQMMFYYLCIYPEYKKKIDEEINSLNGDYSVQNISNLNYLEAVLK
VPYNYIGICDADLIHEFMVEKEKYYERDVTLSYVARILGDSLLFNLSDIWRMKRKVYGQM
FHFYQYHQLVKLVREITTKNSYIIDMNSQKPVQLISHFEDILGEWSLHLFFGYNASQYDL
QSKLKAAYNNARSAILLLQPQNIIN (1) SNLKITSH
GIKFKNQLKTIKQMFFLVSYSFIHSFYLLIIFLKFKNRFLLNSKEVQWNKDFLDQICQDI
IIFFFAGRDTMAHTLQMMFYYLCIYPEYKKKIDEEINSLNGDYSVQNISNLNYLEAVLK
MY CHOICE
BASED ON COMPARISON TO PARAMECIUM P450S
VPYNYIGICDADLIHEFMVEKEKYYERDVTLSYVARILGDSLLFNLSDIWRMKRKVYGQM
FHFYQYHQLVKLVREITTKNSYIIDMNSQKPVQLISHFEDILGEWSLHLFFGY
NASQYDL
QSKLKAAYNNARSAILLLQPQNIINCLDPLKEVLLEYLYITSDQIKSDPDYQSKPQNNITC
(2) YSFIHSFYLLIIFLKFKNRFLLNSKEVQWNKDFLDQICQDI
IIFFFAGRDTMAHTLQMMFYYLCIYPEYKKKIDEEINSLNGDYSVQNISNLNYLEAVLK
VPYNYIGICDADLIHEFMVEKEKYYERDVTLSYVARILGDSLLFNLSDIWRMKRKVYGQM
FHFYQYHQLVKLVREITTKNSYIIDMNSQKPVQLISHFEDILGEWSLHLFFGYNASQYDL
QSKLKAAYNNARSAILLLQPQNIINCLDPLKEVLLEYLYITSDQIKSDPDYQSKPQNNIT
WYKIQKLIKNNKINVFF (1) KVQWNKDFLDQICQDI
IIFFFAGRDTMAHTLQMMFYYLCIYPEYKKKIDEEINSLNGDYSVQNISNLNYLEAVLK
VPYNYIGICDADLIHEFMVEKEKYYERDVTLSYVARILGDSLLFNLSDIWRMKRKVYGQM
FHFYQYHQLVKLVREITTKNSYIIDMNSQKPVQLISHFEDILGEWSLHLFFGYNASQYDL
QSKLKAAYNNARSAILLLQPQNIINCLDPLKEVLLEYLYITSDQIKSDPDYQSKPQNNIT
WYKIQKLIKNNKINVFF (1) IQWNKDFLDQICQDI
IIFFFAGRDTMAHTLQMMFYYLCIYPEYKKKIDEEINSLNGDYSVQNISNLNYLEAVLK
>CYP5002A1 8254654 510086 bp 17 contigs 30% to CYP696C1
MLTELFYTVILLIGVLFYLTFLRPLYKLLMLKIKHGKEILIKFEPIIGIHAYLKKSEKIHGDPMKLLKLEMAKN
PKLKAVACSLGFGVLITSVDPDMNKSFLIDNYQKSVKLAPIFNYKILKNGIIFSE
GEQWKKQRKMISDSFHFDSKYYFFSQKFNLLTLIMFKKVQNR (1)
VSVDSQLFFTKIKGEIGIRVFFGPDFGDHQIDGKNISEELKE (0)
IVDQTIVYSYTSIFFQLK
RIFLGDELAMKFLTAKEKKLLSRSDRINQIAMEQIEKKIKYYEGQPSVQSDYLIDV
292370 LVRNYLTKQDPNLTKEQ (0)
ICQQCITMFIAALDNTGCIIA 292543
292544 WIIFCLANYKEEGDKVRQEINSVFPTQETLDNIQFDDLKNLNYTSAFIDECFRHYTSSMG
292723
292724 VFLRRVVEPFKSGNFEIQK (1)
GDIIQSLWH
292894
PTQFNPDWFENPDKFDVNRFLNGKEKQGNSYAFTPFSIGPRNCIGKQMSLVEVKVSLIYF 293073
293074 VKKFNCIRNSVPLKTTASIVYKPVDQNLIKISFCL*
293181
AATATTTAACTATAAAATACTTAAAAATGGAATAATCTTTTCAGAAGGAGAACAGTGGAA
W K
GAAATAGCGCAAAATGATATCAGATTCCTTCCATTTCGATAGTAAATACTATTTTTTCAG
K Q R
K M I
S D S
F H F
D S1 K Y
Y F F
S1
TCAAAAATTTAATTTATTAACTCTCATTATGTTCAAAAAGGTTTAAAATCGAGGTTTTAA
Q K F
N L L
T L I
M2 F K K
V0 Q N R
G1 F K2
GTTATTGCATAAGTAACTAAAGAAACAATTGACAGTCTCAGCAAATATGTAGGAAAGGAT
L L H
K2 Q L K
K Q L
T V0 S A
N M2 Q E
R I
TACTTCCCTTTATTTGATGAAATGATGAAGCATTCAGGTAATTACTTTAATTAACATTTT
T S L
Y L M
K *
AGTTTCAGTTGATTCTCAACTTTTTTTTACTAAAATTAAAGGCGAAATAGGTATTAGGGT
V1 S V1 D
K I K
G E I
G I R
V
TTTCTTTGGACCAGATTTTGGTGATCACCAAATAGATGGTAAGAATATATCTGAAGAGCT
F F
G P D
F G D
H Q I
D G K
N I S
E E L
TAAGGAGGTAAATGATTAATTGATTCATTTATTTATTTATTTCGTAGGTATATTATAATT
K
E V
TGTATATGATTTATCTTAACATCAAGAATGAAAACATATTTATATAAATGCAAAAATAAA
K H I
Y I N
A K I
K
ATTAAAGATTGTAGATCAGACTATTGTTTACAGCTATACTTCTATTTTCTTCTAATTAAA
L K I
V D Q
T I V
Y S Y
T S I
F F Q
L K
GAGAATCTTTTTGGGAGATGAATTGGCAATGAAGTTTTTAACTGCGAAAGAAAAGAAACT
R I F
L G D
E L A
M K F
L T A
K E K
K L
TTTGTCTAGGAGTGATAGGATAAATTAAATTGCTATGGAATAAATAGAAAAAAAAATAAA
ATACTATGAAGGTTAACCTAGTGTACAATCAGACTACCTTATAGATGTATTAGTGAGAAA
TTATCTCACAAAATAAGACCCAAATCTAACAAAAGAATAAGTAATCCATTTTATATTAAG
D P N
L T K
E Q V
I H F
I L S
CTTTTTTTAAAAATAAATGTCTTCATTTATATCTAATTAGATATGCTAATAGTGCATAAC
F
F Q K
Q M S
S F I
S N Q
I C
TATGTTTATTGCAGCTTTAGACAACACTGGATGCATAATAGCTTGGATAATTTTTTGTTT
GGCTAACTATAAAGAAGAAGGAGATAAAGTAAGATAAGAAATCAACTCCGTCTTTCCAAC
ATAGGAGACCTTGGATAACATCTAATTTGATGATTTAAAAAATCTTAATTATACTTCAGC
ATTTATTGATGAATGTTTCAGGCACTATACAAGTTCTATGGGCGTTTTTCTCCGTAGAGT
H Y T
S S M
G V F
L R R
V
AGTTGAACCCTTTAAGTCTGGAAACTTTGAAATATAAAAAGGTACTGATTAATCTATCAT
V
E P F
K S G
N F E
I Q K
G
TAATAAATATTAATTATTTTAATTTAAAACTAAACTAATTAACTAAAAATTTTTAAATTA
AATAAAGGAGATATTATTTAAAGTTTATGGCATCCTACCTAATTTAATCCTGACTGGTTT
K G D I
I Q S
GAAAATCCTGATAAATTTGATGTGAATAGATTCTTAAATGGAAAAGAAAAGTAAGGAAAC
>CYP5003A1 8254752 1171415 bp INTRONS 1, 3 AND 4 HAVE UNDEFINED
BOUNDARIES 29% TO CYP694C3
MILLIIGLLIFSLFSYFAYLIFVKPYVRNKWYVKQYGKICKPIYYPILGNIKFVRESFKKYG
DSLQWIRELLKNDNDVQFLIGYNNIHIVDAQLAKDITLNNLQNIRKDMGMQFGDLLFKQG
IVFSEGEKWKKQRQILSHSFSFDVLKNRVSKINNIVKEMINKIQIEE
(1)
GKPTKIVTECTKITAEVVLRSFFGERFSQQQHKGINLGIYLAQL
LLSVFDQTRRLRMLIPSFFFPKFMSKIFADEKYYEVQKNCIEFRQLIEKEVNLRIENYDQ
NKTQKDFLDILVDEYKLNYSKNDGKVEEYITKESIIHQFIVFFFA
(1)
GMDTTGNLTGIMLYW (2)
VSKRKDIYEKLVQEIKSVFGQNKDPEINDEQLKKLNYCHMFIQECLRYHCPAMLL
(2)
FTRRAERDFYIGDNILVQKGMQVNISLHGVLRRE
262363 KYFQNPDEFIPERFSEENKKNINHLAFIPFSSGPKNCIGQHMALIEAKIILVQFILNFDI
262184
262183 TNNENVPLKMDSNSLYCPVNDELLFLKRKQ* 262091
AATCTCTTTCAGCTCTACGTGTGAACCTTAAAAATTTACAAAAGTTAATTTTAATAGACT
D1 R2 E A1 R R T
F R2 L F
K C F
N I K
I S Q
L L
S0
AAAAATATTATTTATTTACAATAACATTGCTGGGCAGTGGTATCTGAGACATTCTTAGAT
F I
N N I
Q L2 L M2 A P
AAACATGTGGCAATAATTTAATTTCTTTAGTTATTCGTCATTGATTTCAGGATCTTTATT
TTACCCAAATACACTTTTTATTTCTTAAACTAATTTCTCATATATATCTTTTCTTTTAGA
R K S
AACCCTATTTGATAATAATTTACATTTAATTATTATTTTTATTATTTTTATAATATCTAC
V R2
W
CAATAAAGCATGATGCCAGTCAAATTTCCAGTTGTATCCATACCTAAATTTTAAATGAGT
CAAAAAAGGATCTTAAAACAACTTCTGCCGTTATCTT
AGTGCATTCAGTTACAATTTTAG
A T I
K0
TTGGTTTTCCTATAAAAAAATAATTAAATTATTTACATACGATATTGTTTTAACTTTATA
P
K G1 I
TTCATCAAATCAGATTAATTATAAATAACTAAATTACCTTCTTCAATTTATATTTTATTA
V L N
G E E
I Q I
K N I
ATCATTTCTTTAACAATATTATTGATTTTAGATACTCTATTTTTCAACACATCAAAAGAG
>CYP5004A1 8254638 979488 bp 2 contigs 31% to CYP694C3 COMPLETE, NO
INTRONS
747540 MIILIASLLTILTFCILLLIFSPKIKLLHLAKKYKDTEVLFSPVGGIVNLYQQSFEKY
747367
747366
GDALYFIRDLIQKKPNTKAILYNLGVDIVVNFQDAELVKEVHQSYSDYYEKLKGILTLQY 747187
747186
LFDKGMLMTDGDEWQKQRKLLANTFLFQNIKSRLPVTKQVVQEIYSSKQLDLSKPINALH 747007
747006 MSEKVTSDVIFQTFFGISFRGKQINGVEIQKELSNVLIEGFNYIQSSFYYKLKMVILK
746833
746832
EKAVGFKPFKGEEDLLKRMKNLRDTTEEVIQNRLKQIKSGNLDISQSELFLDILLRDYI 746656
746655
QKNSESEKDIQQMIDQFMTFFFGGTDTTADLVALTLYHLSHNQNFQTKLRQEIKSIINNF 746476
746475
EELNYDNLNQMNYLQCVIKESLRIHPPAVGVLPRVCIKDHKVGQIEMKKGMLMDTHFIGV 746296
746295
LNNPKYYDNPQDFNPDRWNDSKKMESAPFSPFGIGKRSCIGQHLGMMNAKVIICFLVMNY 746116
746115 LVKPNHQQKLRMSLNTVYTPFNDDLVYFEKIN* 746017
>CYP5005A1 8254815a 909308 bp 16 contigs 10 genes all in (+) 29% to
CYP694C2
MILVSYIALGA
241470 IVLLIAIFIYNLILYPQIRLNLMKQKHGDKIKTFFHIGEGIFKEYYTNLTQKKDSVAFIR
241649
241650
GIHQKNIKALAYNLGTRICLNFVDPELVKQVHQNPEAFRKNDASLAITFLFGNSILFA 241823
241824
KGKSWQRQRQFLGKSFHFEEIKNYLPLIKEICDKVFERVSEKLTQNDQLEINAVKICEQV 242003
242004
TSEVVFRAFFGSTTQNLVITRQDGSQIPIADELIQAIMNSFQLLKKDKIALFKYLIFGRN 242183
242184
STKFLRTQGEQSILNRLIAIKESCLQVVQQRKNQLQNDPTQAKKNFLDQYLYDMITNKQ 242360
242361
SEVNNEEIIDNFLGLFFAGTDTTGNMTGVALYYLSRYPDIQKQAREEVIQILSQNSNKRN 242540
242541
HSELFSQLTFENLQNMNLINSILKESLRLIPPAIEVFPRVAIHNMKIGEFQIKKGDLVST 242720
242721 YFIYNQSNPELYQDPEIFNPQRWMNVKD 242804 (2)
242867
SQNNFNFTPFSLGPRNCIGQHLAMIEGKCMLACILLNFDILPNYQQEVVKELRVIY 243034
243035 GFQKDNLIYFKQRKQNN* 243088
ATCCAGAAATATTTAATCCTCAAAGGTGGATGAATGTAAAAGAGTAAGTCTTTTATATTT
W M N
V K E2
ATTATATTCTATATTTATAATGTATCTTCTTAATATAAATCAAAGCTCATAGAATAATTT
S2 S Q0 N N
F
CAACTTTACTCCGTTTTCACTAGGCCCTAGAAATTGCATCGGTTAGCATCTTGCTATGAT
N
F T P
F S
>CYP5005A2 8254815b 909308 bp 16 contigs 10 genes all in (+) 30% to
CYP694C1
244655
MLLNSIIISFVIILIIYKFLLYPKLRLIQIKRKYGDKIKVIHHYGPGLLVEYKK 244816
244817
SYTQNKDSLSFLRQMAYQNGKSIQAYAFNIGTNVGYSFVDPELIKQVHQNHDAFQKIDV 244993
244994
SQAIAYLFSNSLLFATGKSWQKQRQFLGKSFHYDEIKNYFPSIKDICLKVSDRISEELI 245170
245171 QNPNKEIKVVKTCEQITSEVVFRAFFGATSENIFISTENGSTKIPISQELVSVIMDSFRI
245350
245351
LQQSKLIQLKSQLLKRKTFNIFPLKEEKKLLNRLITLKQACKNIVLKRK 245497
245498
EELVSDPSLYKKNFIDLYLKEIIQNSNNEITIEEIVDNFCALFFAGTDTTGNMTGAALYF 245677
245678
LSLNPKIQQEAREEVMQIIKQKANSSDLKDIYNYLSFEDLTKLDLINSILKESLRLLPPASGV 245866
245867
LPRISNRDIKIGQFEVKKGDLVNTHFIYHQSNPEVYENQDQFNPYRWMKGKE 246043 (2)
246053
QNNAFNFTPFSLGPRNCIGQHLAMIEGKCILINFLINFDIMPIPQKQV 246229
246230 QYEMKVIYGLQPDDLVYFRKRNN* 246301
TAGATGGATGAAAGGAAAAGAGTAAAACATTTTTATATAATTATAAAAAAATTAAATAAT
W M K G
K E
TTTTAAATAAATATAAATTAAAAGGTAAAATAATGCATTCAATTTTACTCCTTTTTCGTT
Q N N
A F N
F T P
F S L
AGGACCTAGAAACTGTATAGGATAACATTTGGCGATGATAGAAGGTAAGTGTATTCTTAT
>CYP5005A3 8254815c 909308 bp 16 contigs 10 genes all in (+) 31% to
CYP694C1
247451
MIFVAIYTILGILALIFAIALYNLVFYPYLRLKIMKQKYGDKIQVFFNIGDGILKKYKNDLE 247636
247637
DKGDSLAFIRGMHQKNLKAIGFNLGAKVGLSFVDPDLIKQVHQNHDSFHKIDAAMAIT 247810
247811
FLFGNSILYAKGKDWQRQRQFLGKSFHFEEIKNYLPLIKEVCQNTFQNVNKQLALKEQTE 247990
247991 IQAVKICEQITSEVVFRVFFGSTSQNLDIKKEDGTRIPIAHELVETIMNSFQLLQNDKIA
248170
248171
LIKWLLFKRNSTKFLRTQGEESILNRLITVKQTCLQVVIKRRDELFKDPSQAKKNF 248338
248339
LDQYLIDMIQNKNSKVTYDEIVDNFSGLFFAGTDTTGNMTGVALYYLSLYPEIQQQAREE 248518
248519 VIKVLSQKQKEKKTDQLF
248573
NQLTFDDLSNLDLINSILKESLRLVPPANEVFPRIADHDMKIGDFQIEKGDLVNTHFIYN 248752
248753 QSNPEYYPNPDIFDPYRWMNAKD 248836 (2)
248875
QQNTFHFTPFSLGPRNCIGQHLAMIEGKCMLASILLQFEILPNHSAKIVKEVKVIYGLKN
DNIIFFKKLKAN* 249093
CCCTGAATATTATCCAAATCCAGATATTTTTGATCCTTATAGATGGATGAACGCAAAAGA
R W M
N A K
E
GTAAATTTTATTTTTATATTCATTCTTAAAAAATAATATTTAACTAAAAATAGTTAATAG
S Q Q
AACACATTTCACTTTACTCCATTCTCTTTAGGTCCCAGAAATTGCATAGGTTAACATCTA
N T
F H F
T P F
S L
>CYP5005A4 8254815d 909308 bp 16 contigs 10 genes all in (+) 29% to
CYP696C1
249796
MFVENINLIPVLFILGILFILYKMILYPKIRLMQLQKKYGESIKVVHHYGTGLLMEYKKGFQQLND 249993
249994 SMAFIRKMSKENQSNVEAFAFNIGPKIGISFVYPELIKQVHKNHDAFQKIDVSQAVVYLF
250173
250174
SNSLIYATGKQWQRQRQFLGKSFHFEEIKNYLPCIKDICIKTSNQISEDIRINPQKEIQV 250353
250354
VKICEKVTSEVVFRVFFGSTQENILIEREDGQKLSISEELVSIVMDSFRILQQDKLLLIK 250533
250534 SLILKRYSMKIFPLREEKSLHNRLIQLKKACETIVLQRKKQLQLDPALYKNNFLDLYL
250707
250708
KEMIQNSNTQITIDEIIANFCGLFFAGTDTTGNMTGVALYYLSLNPQIQKEAREEVIQI 250884
250885
ISKKNSNLDLKDQFNYITFEDLSSMNLINSILKESLRLIPPAIGVFPRYANRDIKI 251052
251053 GQFELKKGDLVNTHFIYNQSNPSIFQNPEQFDPKRWMNGND
(2)
251242 LQFAFSFTPFSLGPRNCIGQHLAMIEGKCMLANFLLKYDILPNKSQNIGLEMKIIYGLSPDNLVYFKQR*
251451
TACAATTAGTCAAATCCTTCAATATTTTAAAATCCAGAACAGTTTGATCCTAAAAGATGG
R W
ATGAATGGAAATGAGTAATTAATTAATTAATTAATCATTTAATTTTTTATTATTTATTTA
M N
G N E
CAAAAATTTATTAAATATAGTTTGTAATTTGCGTTTAGCTTTACACCATTTTCATTAGGC
L Q F
A F S
F T P
F S L
>CYP5005A5P 8254815e 909308 bp 16 contigs 10 genes all in (+) (+) 29% to
CYP694C1
one in frame
stop and a frameshift, stop codon missing at end, may be a pseudogene
252767
MLKILITFLGLFLFLLVFVVYKVFIYPYLRLSTFSKKHGKKVKVYFNFGVGLL 252925
252926
KEYKQSLIQYKDSLAFFRNLHQKSLQAIAFCFGTKVGLIFVDSNLIKQVHQNHEAFM 253096
253097 KIDASMAIIYLFGDSLIYAKGKEWQRQRKFLGKSFHFEEIKNYLPSIKETSIKIFGEIKD
253276
253277
QLKQSDQMEIQVVKTCESVTSEVVFRVFFGQTSQNLKNTKEDGSQISVAQELVSTITNSF 253456
253457 QILQNDKISLIK*IFFERNTIKYFPTKNEQNLQKRLIEIKKQCMQVVEKRRIELIE 253624
253625
VIKSAKNNFLHQYLKEMIVNEKSKISNEEIIDNFLALFFAGTDTTGNMTRVALYYLSLYP 253804
253805
DIQNKAREEIIKLASSRINSTNPVDLFNSLTFEDIQNLNFLNSILKESLRLIPPAIEVFP 253984
253985 RLAIQDIQIGD 254017 (fs)
YEVKKGDFINTYFIYNFSNPEIYPEPDNFDPSRWMKQID (2)
254199
QQNTFNFTPFSLGPRNCIGQHLAMIEGKSMLAYILLNFEKFPNKNQEVVKEMKIIYGFQKDNLVYFKNRSQ 254411
RKKSGNIQLTQQFYIQIKKYLFLLCLKFQYVFFTTKKQKPSILQKFIKEKQNKFKHQKQTRFYFQHSKQKEVEINMI
LIISIIILQIKKQNMRKNISYKQKQFKQKNFLSLIELLFLSFKLKFFIFQK*
ATTTTGATCCTTCAAGATGGATGAAACAAATAGAGTATTTATTTAATATTCATAAATAAT
W M K
Q I E2
TAGTTAACATTAATGTATTTTTTTAATGAATAAAAAGTTAGTAGAATACATTTAATTTCA
Q Q N
T F N
F T
CTCCCTTCTCATTAGGACCAAGAAATTGCATTGGATAGCATCTAGCCATGATTGAAGGAA
P
F S L
>CYP5005A6 8254815f 909308 bp 16 contigs 10 genes all in (+) 30% to
CYP696C1
285979 MLIIFQIIFAIIASILTLAIYSLVLYPFFWFILMKYKYGDKIKVFFNIGQGMLKEYKKGLIE
286164
286165
KNDGVAFIRNMHQKNLKAMAFNYGSKVGFSFLDPELIKQVHQNHEYFIKMDATMAVTF 286338
286339
LFGNSILYAKGKEWQKQRQFLGKSFHFDEIKSYFPQIKEICQNTFRNTYQKLTNEEQINI 286518
286519
QAVKICETVTSEVIFRTFFGKTSQNLNITKKDGSQILLAFELVEVVKSAFQMLT 286680
286681
IDKIALIKWIFLQRNSARYFRTEKEEDLFQRLTAIKECCLQIVQKRRDELIKEPTQAKK 286857
286858
IFLDQYLIDTMQNQSSQVTDDEIIDNFCGMFFAGTDTTANMTGVALYYLSIYPEIQRQAR 287037
287038 EEIIKILSSKSNDKNPDSLFSQFSIEDL 287121
SNLDLINSILKESMRLIPPAIQVFPRIACQDIKIGEFEIKKGQLVTTNFVYN
QSNPEIFPKPDNFDPFRWMNEDN 287346
(2)
TQSNTFNFTPFSQGPRNCIGQHLAMIEGKCMLVYILLLFDILPNPSVLVAKEVKV 287569
287570 IYGFQNDNLIYFKKRQNII* 287629
region
around intron in CYP5006A6
TCAAATCCTGAAATATTTCCTAAGCCAGATAATTTTGATCCATTTAGATGGATGAATGAA
W M N
E
GATAAGTAAGTTTAATTAATATTTATATTTTTAGAAAATATTTATCATTAATTTAACAAA
D K2 Q V0
TAGCACTTAGAGTAATACATTTAACTTTACTCCTTTCTCTTAAGGCCCAAGAAATTGTAT
S2 T Q0 S2 N T F
N F T
P F
>CYP5005A7P 8254815g 909308 bp 16 contigs 10 genes all in (+) 30% to
CYP694C1
one in
frame stop codon same as CYP5005A5,
might be a pseudogene
MLKILFTFLGLFL
289100
LLLVFVVYKVFIYPYLRLSTFSKKHGKKVKVYFNFGVGLLKEYKQSLSQYKDSLAFFR 289273
289274
NLHQKSLQAIAFCFGTKVGLIFVDSNLIKQVHQNHEAFMKIDASMAIIYLFGDSLLYA 289447
289448
KGKEWQRQRQFLGKSFHFEEIKNYLPSIKETSMKLFGEIKEQLKQSDQIEIQVVKTCESV 289627
289628
TSEVSFRVFFGQTSQNLKITKEDGSQISVAQELVSTITNSFQILQNDKISLIK*IF 289795
289796 FERNTIKYFPTKNEQNLQKRLIEIKKQCMQVVEKRRIELIEVIKSAKNNFLHQYLK
289963
289964
EMIVNEKSKISNEEIIDNFLALFFAGTDTTGNMTRVALYYLSLYPDIQNKAREEIIKLAS 290143
290144 SRTKSTNPVDLFNSLTFE 290197
290198
DIQNLNFLNSVLKESLRLIPPAIEVFPRVAIQDIQIGDFEVKKGDFINTFFIYNFS 290365
290366 NPEIYPEPNNFDPSRWMKQID 290428 (2)
290492 QQNTFNFTPFSLGPRNCIGQHLAMIEGKSMLAYILLNFEIFPNKNQEVVKEMK
IIYGFQKDNLVYFKNRSQ* 290707
region
around intron in CYP5006A7
TCCTTCAAGATGGATGAAACAAATAGAGTATTTATTTAATATTCATAAATAATTAGTTAA
R W
M K Q
I E
CATTAATGTATTTTTTTAATGAATAAAAAGTTAGTAGAATACATTTAATTTCACTCCCTT
S2 Q0 Q0 N T F
N F T
P F
CTCATTAGGACCAAGAAATTGCATTGGATAGCATCTAGCCATGATTGAAGGAAAATCTAT
>CYP5005A8 8254815h 909308 bp 16 contigs 10 genes all in (+) 30% to
CYP694C1
381326 MISLLHLIFGVLLIPIILIIYKVILYPLTRLLYLKYVHLDKVRIFYTYGQGFFPQFKKNLLEKK
381517
381518
DSLAFVREMYSKKLQTALFSIGTKVGFIFLDPELIKQVHQNADAFKKNDGSLAITYLF 381691
381692
SNSLLFVKGKEWQRQRQFLAKSFNFQDIKNYLPIFKDVSSKIFGLIRNNLEISGTQEIEI 381871
381872
VKTCEKVTSEAVFRVFFGQTSQNLMVTSKDGSSTLLAHELVAVVVDSFHMLLHDKLLLI 382048
382049
KFTMLGKNSINILPTQSELKLLNRLKEVKRVCLEIVEKRRNELLKDQSQFKNNFLDQYL 382225
382226
KETLINQNKLITDEEIIENFIGLFFAGTDTTGNMTGVALYYLSLYPEIQQKAREEIQKVLS 382408
382409
SKCNSNENLDELFNSLTFEDLQQLDLINSILKESLRLIPPAPSVFPRIAERDIKIGDFQL 382588
382589 KKGDFVNTYFIYNFYNPDQFSNPEVFDPYRWMNQNE
382702 (2)
382764
SQNVFNFTPFSLGPRNCIGQHFGMIEGKCMLIYALLNFDILPNKNQEVKKI
MGTIYGFEKDNLVYFKKRNINN* 382970
TTACAATTTCTACAATCCTGATTAATTTAGTAACCCAGAAGTCTTCGACCCATACAGGTG
R W
GATGAACCAAAATGAGTAAACTTTTTTATTTTAATTTTTTTGATTTAAGATCTATTTTAA
M
N Q N
E2
TTAAAAGGTCATAAAATGTATTTAACTTCACGCCTTTCTCCTTAGGCCCAAGAAATTGTA
S Q N
V F N
F T P
F S L
TAGGTCAGCATTTTGGTATGATAGAAGGAAAATGCATGCTTATCTATGCTTTGCTTAATT
>CYP5005A9 8254815i 909308 bp 16 contigs 10 genes all in (+) (+) 32% to
CYP694C1
MIGTLFYSILGI
392542
IILTFLVLLYKLVFYPKLRLAAIKSKYGDKVKTYFYVGEGLLAEYRNNLLKYNDSVYFM 392718
392719
RNLHQKNLKALAFNFSTKVCISFLDPMLIKQFHQNHDAFKKVDVSMAITYLFGDSIIF 392892
392893 ARGKQWQRQRQFLGKSFHFEEIKNYLPQIKQISENVFGNINLNGSENEEICAVQICQKVT
393072
393073
SEVVFQIFFGSTSQNLVITTQKGHQIKVAEELISAIVDSMQIFETDKIALLKWIF 393237
393238
LEKNSTKYFTTKSEKQLKDRLIAIKQACLNVIKKRNDQLSQDISLVKQNFLDLYLIEM 393411
393412 ITNQNTQITYEEIIDNYLTLFFAGTDTTGNMTGVALYYLSVNPQIQEQARDEILQKLSQK
393591
393592
TASKDPKYLFDSFSFDDLASLNLLNSILKESLRLIPPAIDVFPREAVQDIKIGEFEIKKG 393771
393772 DIVTSHFIYNQSNNEIFSNPDIFDPQRWMNGKE 393870
(2)
QQNAYNFTPFSLGPRNCIGQHLAMIEGKCMLVYILLNYQILPNTNQQLIRENNLV 394107
394108 YGFKKDNLIFFKKIIL* 394158
GATCCCTAAAGATGGATGAATGGCAAAGAGTAATTAATTATTACAATATTTCTTTAATTT
R W M
N G K
E2
TTATATTAATTAATATTTTATTCAATTTTTTTAAAAAAAAGGTAATAAAATGCATACAAT
Q Q N
A Y N
TTCACTCCTTTCTCATTAGGACCTAGAAACTGTATTGGCTAACACTTAGCTATGATAGAA
F T
P F S
L
>CYP5005A10 8254815j 909308 bp 16 contigs 10 genes all in (+) (+) 30% to
CYP696C1
404751
MIMAILITVFAVLTIIIVKLHFIPIIKIMILKRKHPNQIESIVHLGPSLVREE 404909
404910
KYNLKHYDDNIYSIRNIYKKNQDVKLILYKNAKGQLGYIFVDPELIKQVHQNPEVFQKQM 405089
405090
SNPIIKYLFQDSLLFKTGKEWQRQRQFLGKSFHFDEIKNYLQDIKQVTKEELQKVKLKIE 405269
405270
SQPDRSIEIVKTCQNITQEVVFRIFFGSTNLKVSKPDGKEIPIADEMISSIESFQVLFRS 405449
405450
NYLLFAKFVILGQNVLEVLPTNKEREIKQRFIDLKKACKLIISQRKEQL 405596
405597 IKNPTLYKYNFLDQYLKEVIVNNNQSITDEEIIQNSLSLFFAGTDTTGNLVGSALYYLSL
405776
405777 NLTIQNQAREEVLKVLSQKKLNENLEEKL
405864
ASLTLQDLSNMNFLNSILKETLRLIPPAPNVLSRICVQDFQIGDFFIKKGTPVGTYFISS 406043
406044 QMNPKLYPNPETFDPNRWMNAQE 406112 (2)
406173
QNSINFTPFSLGPRNCIGQHLAIIEAKCILSYILINYDILPNENQKIQFRSKFVYGIQPDNFITLKIR* 406379
TTGATCCAAATAGATGGATGAACGCACAAGAGTAATTAATATTTTATTTAATTAATTTAT
R W
M N A
Q E2
TCATTCAAATTAATAAAATTATATCTTATAGACAAAATTCAATAAATTTCACTCCATTTT
Q N
S I N
F T P
F
>CYP5005A11P 8254665d 877599 bp pseudogene fragment, no additional P450 seq. found
332385
QNNAFNFTPLSLGHRNCIGQYLAIIEEKYMLSNNFLNYNILLYPSEKLINNLWYLSRQFCLF* 332197
>CYP5005A12P 8254665a 877599 bp pseudogene N-term is missing, 80% to 8254665c
362819
LDPEIVKQVHQNHDAFLKMDISQAFI (fs)
FLFSNSLLCVSGKEKQRQRHF
(fs)
FLDKSFYFD
362652
362651
EIKNDFPIIKEVSLVVSDEITQDLNNSAQKMIQVVK (fs)
KICEKLTSEVAFRAFFSQNSKNIIIKRED
GSTILFSSEIVQLLIDSIRILQTKKLLLLKDVILKRKSQNIFPMKKEK
(fs)
TLFSYLNVY
(fs)
KYTFKQFIMQRKEELQKDPSLFIRNFLDLYLKEIPLNENKETIIDEIIINLCALIF
362121
362120
VGTDKIGNMTGVSLYYLSQNSKIQEEARDEVINILKSIIQSQDPQDLFKALTFEDLSQML 361941
361940
LVHSILKVSQRLIPPSE (fs)
361890
AVLGRYANKDIEIGEFLVKKGYLVNTHFIYSQSNPDLYQN 361771
361770
PEKFDFYRWMNQKE 361729 (NO INTRON BOUNDARY)
QINAFNFTPFSL (fs)
361640
LSRNCIGKYLAIEENCILTNYLLNYNIL 361557
PNPSQEPIKKTKKNVLFFSRLVSLF*
>CYP5005A13P 8254665b 877599 bp pseudogene N-term missing, several frameshifts
EIVKQVHQNHDAFLKMDISQAFVFLFSNSLLCVSVK
366568
366567
EQQRQINFLCQSFYFDEIKNDFPTIKQVSHVISDEITQDLNNSAQKMIQVVK 366412 (fs)
366412
ICQKLTSEVPFRALFSQTSKDITIKRQDESTILFSSEIVQLLIDLICILQTNKLLLLK 366239
366238
YVILKRKSQNIFPMKNETSFTCLNVYKVIMQRKEELQKDSSLFKRNLLDLYLKE 366077
366076
IPLNENKEITIDEIITNFCALFFAGTDKIGNMTGVSLYCLSQNPEIQEEARDEV 365915 (fs)
INILKFFVQSQD (fs)
365879
KTLKTYLKHLHSK
365840
IYQMPLVHSILKESLRLIPPAAAVLGRYANKDIEIGEFYVKKGYLFNTKLxx 365691 (fs)
SQRNPDLYQNPEKFDLYRWMNQKE (no intron boundary)
KINAFNFTPx
(fs and small deletion)
365545
PRNCIGKHLAIEVNCILANYLLNYNVLPNPSQKILKRQKKINGFFLDNLVYFELRK* 365375
>CYP5005A14 8254665c 877599 bp 4 genes 29% to CYP694C2 complete
368910
MISGFATFALGLLLTLLLFAFYIVILPNIKLNRIKRMYGDKIKIIYHFGVGLLGEYRKGLKNKNDSL 368710
368709 SFLREMALTEKSKVKAYSFNIGLKLGYCFLDP
ELIKQVHQNHDAFSKMDISQAFVFLFS 368533
368532
NSLLCVSGKEWQRQRHFLGKSFHFDEIKNYFPTIKEVSLVVSDEITQDLKNSSQKKMIQV 368353
368352
VKTCEKVTSEVAFRAFFGQTSKNITIKREDGSTVLASSEIVQLLTDSFRILQTNKLLLLK 368173
368172 YVILKRNSQNIFPMKEEKSLFARLNVFKDTCKQVIMQRKEELQKDPSLFKKNFLD
368008
368007
LYLKEILLNENKEITIDEIIDNFCALFFAGTDTTGNMTGVSLYYLSQNPKIQEEARDEV 367831
367830 INILKSRTQSQDPKDLFNSLAFEDLSQMP 367744
LVHSILKESLRLIPPAAAVLGRYANRDIEIGEFSVKKGDLVNTHFIYSQS 367594
367593 NPDLYPNPEKFDPYRWMNQKE 367531 (2)
367445
QKNAFNFTPFSLGPRNCIGQHLAIIEGKCMIANYLLNYNILPNPSQ 367308
367307 EPIKEAKTIYGLCPDNLVYFELRK* 367233
>CYP5005A15 8254543 195986 bp 3 contigs 28% to CYP694C2
82913
MSLLITIICIILLLLTCIAFKIVVYPQIQIFYFKNKYGKDVEVLYHFGSGLLQEFKQSLEKEG 82725
82724 DSFGRIRKIMQINPDAQAIIHNNASGKMAYSFIDPDLIKQVHQQPEYFEKQIAHPAIISL
82545
82544
FSKSILYLSGKKWQKQRQFFGKSFHFDEIRNYLPVIKQTSQDVFSSIKQSLNNSLQQEVQ 82365
82364
VVKTCEKVTSEVVVKVFFGSTSQNIIVTRSDGTKLPVAQELIEIMVDSFRVFRNSKFAFV 82185
82184 KFLLLRQRSFQILPTQSEKDLAQRIHNAREECNQIIQDRKQQLLNDPT
82041
82040
QFKSNFLDMYLKEIIINKNEDIKEEYIIDNFLALLFAGTDTTGNMTGAAFYYLSLNTDIQ 81861
81860 SRAREEIISILNRKGQQNNTEELYNSM 81780
81779
TFEDIANLNLINSILKESLRLIPPAAAVFPRRVIKDIKIGNFQLKKGDAINTQFICNHYN 81600
81599 PQIYKNPDIFDPNRWMEQKE (2)
81473 QNLFNFTPFSLGPRNCIGQHLAMIEGKCMIAYVLLNFEIIPNHEVQVKKEIKLTYGLLPDNLVYFKKRQQI*
81258
CCAATACAATTTCTAGGACCTAGTGAGAAAGGAGTGAAATTGAATAAGTTTTATCTTAAT
P T F
N F L
N Q
ATATTTAAATAATAATTAAAATACGAAAAACTTTTAGAAAAATAAATTTAAATTAATTAC
TCTTTTTATTCCATCCATCTGTTTGGATCAAAAATATCTGGATTTTTGTAAATCTAAGGA
M W
>CYP5005A16 8254716b 1191929 bp 2 genes 30% to CYP694C3
374097
MSFLAYFALIATVTVIYLAFRIVIFPKLKIKKLKNKYGDQVIVLYNFGSGLI 373942
373941
QEKKKSLKQQGDSLGTIRKLLWSNPKAKAIIHNNASGQIAYSFIDPELIKQVHQQPEFFV 373762
373761
KQVNHPAVLYLFSKSILCMQGKEWQKQRQFFGKSFHFEEIRNYLPIIKETSQSVFGSIKE 373582
373581
NLSDSSQIEIQVVKICESVTSEVVFKAFFGFNSQNLINGSQNSLVQEIVSIMTDSYRIFQ 373402
373401
FNKLAFIKFLIFRKRSYDFFPLQCEREIVQRLRKVKEECDKIVQQRKQQLLNNPSL 373234
373233 YKYNFLDQYLKEILLNKNQDIREENIIDNFLALLFAGTDTTGNMVGTSIFYLSQNPEI
373060
373059 QNKARDEVIEL 373027 LRKNHQCNKMQ
372993
DLYEQMTFEDITNFNYLNSVLKESLRLIPPAVNVFPRKVIKDLQIGKFQLKKGDAVNTNF 372814
372813 ICGHYNPQVFKNPEQFDPSRWMNANE 372721 (2)
372681
QNQFSFTPFSLGPRNCIGQHLAMIEGKCMLAYILLNFEIIPNLKVKVRKEAKMSYGLI 372508
372507 PDNLVFFKKNCKNYD* 372460
>CYP5005A17 8254693 27987 bp 1 contigs 31% to CYP694C2 complete
3973
MLVESLLYITLFFALGIIFLIFRAIIYPKIRLLQLKKKYGDKIQIVHHYGTGLPM 3809
3808
EYRKGFVERNDSMAFVRKMAAEKTNKMQVFGFNIGYNVGLSFLDPELIKQVHQSHDAFQ 3632
3631
KINISQAFNYLFSDSLFYATGKNWQRQRQFLGKSFHFDEIRNYLPSIKELCSKVTDQLN 3455
3454
KDLKANPQQEIQVINVCEKITSEVVFRVFFGSTQENMMITREDGSQLHLSEELVSLIMDS 3275
3274
FRILQQNKLLLLKNLILKSYSFKIFPLKEEKWLLNRLVQVKAACEKVVLKRKEELQ 3107
3106
QNLDQYKNNFLDLYLKEMIENKNSGITYDEIIANFCGLFFAGTDTTGNMTGVALYYLSL 2930
2929 NPQIQKEARDEVIRVISKKNQDIKGNDLFSSLQFED
2821
LSNMNLCNSILKESLRLIPPAFGVFPRYVTRDIKIGQYELKKGDLVNTHFIYNQSNPQTF 2642
2641 TNPDKFDPYRWMNGNE 2594 (2)
SQNAFN
2521
FTPFSLGPRNCIGQHLAMIEGKCMLANFLLNYDILPNESQKVQLEMKIIYGFSQDNLVYFKKRQQ* 2324
>CYP5005A18 8254583a 93101 bp 2 genes 29% to CYP694C1
74438
MIVLIYSSLIIFATIIAIGFYIFVIYPTMRFLVMKYKYGHKVKVFFSIGTGIL 74596
74597
KFYNQSLIQNKDSIAFIRNMHQKGLQAIAFNLGTKVGLSFLDPELQKQVHQSYQSFR 74767
74768
RIDAIAAMTYLLGNSMIYAADKEWKRQRLFLGKFFHFDEIKNYYPTIQETTKQVLQGVNE 74947
74948
QLNQKEQIEIKAVYIIQKITSEATFKMFFGLTTESLMITRKDGTQIAFSDELALLIRHSF 75127
75128
QLLQTDKIALLKWILLKRKSTDYFSTKGEQDLMDRLKNLRQGFIKIVSKRKEELSQ 75295
75296
DPSKAKNNFLDYYLTEMVQNQYSGITYDEIVDNFTGIFLAGTDTTSNMVGSALYYLSINP 75475
75476 EIQQKAREEVVQVVSSKQDFKKHDQLASQLRFEDITHFNFLNCILKESLRLIPPVIQITS
75655
75656
RIANHNMKIGEFDIKQGDLVTTNFAYNLSNPDIFPNPELFNPQRWMTANN 75805 (2)
75861
YSETFSFTPFSQGPRNCIGQHLAMIEGKSILASILLQFEILPNPNTEVIMDM 76016
76017 KMMYGFQNDNLIYFKKLKQ* 76076
AAATTTTGCATATAACCTATCAAATCCTGATATATTCCCAAATCCTGAATTATTTAATCC
TTAAAGGTGGATGACTGCAAATAAGTAAAATAATTTGTCTTTTTATAAGTATTTTATTTA
R W M T
A N K
AATCTAAATTTTATAATAGTTATTCCGAAACATTTAGCTTTACTCCTTTCTCACAAGGTC
Y S E
T F S
F T P
F S Q
CTAGAAATTGTATTGGCTAGCATCTTGCTATGATAGAAGGAAAGTCTATTCTTGCTTCTA
>CYP5005A19 8254583b 93101 bp 2 genes 29% to CYP694C1
78030
MIILFYIILGIIATILAIGVYILVIYPIIRFSEMRRKYGDKIKVYFSIGKGIFNFYQQD 78206
78207
LIKKNDSLAFIRSMHHENLKAFAFNYGTKVGLSFVEPEHQRQLFQNIHSFSKADGPI 78377
78378
AITFLFGHSIAFAPEKEWRRQRQFLSKSFHFEEIKNYLPLIKQSCEKIVKKLDTKLLQND 78557
78558
QIEVKVVGISQEVTSEISYKIFFGSASENQMKITKKDGTQIPISEEIVQIIIDSFTLLKT 78737
78738
DKLALIKWILLKRNSVNYFPTKGEKEILIRLQALKQVQIDVVTKRNEELIKDPSQA 78905
78906 KKNFLDQYLIEMIQDKNSGITYDEIISNFAAIYFAGTDTSGNMVGVALYYLSRFPEIQQQ
79085
79086
ARAEIIKVLSSKKEIMKTEFSQLEFEDIQNLNFIHCILKESLRIMGPSISSQIKQADH 79259
79260
NIKIGEFDIKKGDLVTAHYFYNHTNPSVFENPDDFKPERWLDSNN 79394 (2)
LKQQSNFSPFSQGPRSCIGQHLAIIEGKCLLASILLQFEIIPNT 79578
79579 SQQVIRELKSIYTFQNDNLVFFKKIKQINQKTEMMI*
79692
ACAATCACACTAATCCTTCAGTTTTTGAAAATCCAGACGATTTCAAACCTGAAAGATGGT
R W L
TGGACTCAAATAAGTATAATATTTTTTTCATATTTTTTATAATTTATTAATTATTTTTAA
D
S N K2
AATAGTCTAAAGCAACAATCAAACTTTTCTCCTTTCTCACAAGGGCCAAGAAGTTGCATT
L K
Q Q S
N F S
P F S
Q
>CYP5005A20 8254626 31828 bp 2 contigs 30% to CYP694C1 complete
27801
MIFLIYISLGIIASIIAFVSYILILYPIFRFSQMKRKYGDKVKVFFSIGQGILKFYNQDLAN 27986
27987
KNDSIAFIRNMHEPNLKAIAFNFGTKVGFSFVDPELQKQVLQNPHSYSKVDGPMAITF 28160
28161
LFANSIAYATEKEWKRQRLFLGKSFHFEEIKNYFPLIIETCQKTIKKIDQKLIERETINI 28340
28341
QILKTCQEITSEVSFKVFFGSNNENLTIITRKDGSSTTISNELVQVLIDSYYLFQTDKVA 28520
28521
LIKWILLKRKSTSFFLTKGEEQLLNRLKALRQACTAIVSKRKEELTQDRLLAKKN 28685
28686 FLDQYIIEMAQNENSGITYDEIIDNFSAIYLAGTDTTSNMAGVALYYLSLYPEIQQQARE
28865
28866
EVIKVLSVKLKENKTDQLFSLLAFEDLQNLNLINSILKESLRLMPPAIQSQTKFANQDIK 29045
29046 IGEFDVKKGDLVTNHFSYNLSNPEVFPNPDVFNPNRWM
29159
AANI (2)
HNEMANFTPFSQGPRNCIG
QHLAMIEGKCILASLLLQYEILPNPAEKVVRQMRIVYGFQYDNLVYFKKINV*
29441
ATTTATCAAATCCTGAAGTATTCCCTAACCCAGATGTCTTTAATCCTAATAGATGGATGG
R W M
A
CTGCAAATATGTAAATTTTTTTATTTCATTTTAAAGTATTAAAGTATTCATTTTATATAA
A
N M2
ATAGTCACAACGAAATGGCAAACTTTACTCCCTTTTCATAAGGGCCAAGGAATTGCATTG
H N
E M
>CYP5006A1 8254798 983259 bp 23 contigs 28% to CYP689A1 no introns
753493
MSIIQLFIGFIIGTIIYQTVIKTLYLYYVYKRRYGKDIIVLFYPVIGWYYFVNQ 753654
753655 SFKKYGDSHQFYKTLIKENPNAKAILALSGFTNIITIIDTNWVKKCLTNQHLFFKKDGPF
753834
753835
GFDLLFGEGLLFKNKQEWKNQRAFLQGNFHFNALMERFQMFKDCNLQFCSEVPSNGEYVK 754014
754015
LDYYNEAAKVSSDIISYSFMGRSINEIKSNQKSFSIELINLVKDCFVYRITNPF 754176
754177
FMIKASVFGPRKSTTSFNSSTEKGLLSRIKSINNCLYNIIEEREQQLKK 754323
KYKNHNEMVPANFLDMYLKEIWKQEEENKESKNKITLT
754438
RKEIVNQYITVIFAGTDTTSHLIGNILFELSRNPDVYQRLQKEIDDNILSFENMKYEDLN 754617
754618
NLPYLRLVIKETQRLYPAIFYLFGRFMDKSYTDEDLFISKDLEVQIRLVSQQEKQSGLFS 754797
754798
DIQKFNPDRYNSTDKEDPYDFIPFSAGPRNCIGQYLALIETKVAIVYALKKFHLKKRDDY 754977
754978 ELKMFQEFVYSPVEKDLFYIKRK* 755049
>CYP5007A1 8254605 384764 bp 4 contigs 29% to CYP694C2
93611
MIKYIVYLVALILTLLSLIILRQAYKSLRLCLIYRKKYGNDVKILFYPILGIGYFMNQ 93438
93437
SFEKHDNSYEWMKRIVRENPNVKAILILSGLFDIIILQDEKWVKNFSQDHILYYKN 93270
93269 DGNFGYTFAFEKGLFYSSFGDWKRQRIFLNSSFHFESLKSYVPIIKEQTKKLLANLKEED
93090
93089
GFVKLNIFQELQKITSEIITISYFGKSLVNVKCKSGIELYKENNDIIQDSYIYR 92928
92927
FTSPYFILKACIFGAKKASMIYHNQAEKKLIARMKDIFETYEKIIDEELEKIKMSKI 92757
92756 DPMDIQPKTMIELYLKEYLIQQ 92691
QNRATMNPNDIFE
92651 KKEIIHQFLTFYFAGSETTSHLVAMTFYELTKNQQIYDKLMKEINENAKDFDKVEHSDLS
92472
92471
KFQYLDIVFKETGRLHNAIAFTSPRVTDQDYIKEDMYLNKDLLVCHRLTQYSQQCEVLKD 92292
92291
SDEYIPERYSRNTKGFSSFDVIPFSTGPRNCIGQHLALIKSKFLVIYFLKNFELKELPGY 92112
92111 KLRTMQKLSNHPIDEEIVLIKRKKQ* 92034
>CYP5007B1 8254600b 816542 bp 2 genes 30% to CYP694A1 50% to
8254600a complete
505285
MIFNLLIGFGLIILLILLRITYKTARLYLIYKNKYGNQVKILFYPILGLAYFMQNSLKLHDDGY 505476
505477
QFLKQIVKENPKVKAIIILSGFMDIFIANDNEWIKYLSQDQKQFYKDDAFFGYDSLH 505647
505648 GEGMPFCDYNRWKKQRLFLNSSFHFDSLKQRVPILKDEINNFCSFLKPSDGFVKINCYNQ
505827
505828
LKLITSEFFTRTFFGRPFKDVKCKNGKNISLEICDICSDGYTLRISSLYFIIKA 505989
505990
AIFGCAKASHILQSQAEKKIMNRIRDLYSIVEGVVDEKISEMSKMKDLIQYQPTN 506154
506155
FSELYVKEYLLQQKNIQNYKKEDIISKVELVQQYLTFFFAGTDTTAQLTSMCLYEISKNPSI
KEKIFSELKQCGKNYEQLESSDLQKLPYLDAIIRECNRLYSVVAFLFPRRTDTNYQRGDL
YIDKDLLATNRLIQYADEKEFFNESEKFIPERFLNQKNVQQLSPYDMIPFSSGERICIGQ
NMAIMEAKFLILYILTHFEIAEVPNYKLRMSQFMINQPVDLEIIQIKRI*
>CYP5007C1 8254600a 816542 bp 6 contigs 2 genes 30% to CYP689A1 complete
508388 MINLIKISFYLFLTVFSILIAIQVWKTLRIYLIYKKKYGNQIEFMFFPLLGLIYFM
508555
508556
QKSLKENDDGAHFIKKVCREKPHVKAIIILSGLFNYFIAIDNEWIKFLSQDQDRFYKN 508729
508730
DGIFAFSKMYGDSLLFTWSKMWKRQRALLNSSFHYDSLKARVPIIKDEAKNFCNFLKPE 508906
508907 QGFVKVDMFDETQKITSEIITRTFFNREFKNKFISNGKSLTQEIGDITQEAYNY
509068
509069
RLSSPYFILKSSFLGNTGASFILNSPTEKEYLQRMDQILKIIEQDVDQSIQEVIST 509236
509237 GIDIFEYQPRNFIEIYIKEYLIQQ 509308
KNLKSIPQADIITKKEIVQQYLTFYFAGTETTAHLISMTLYALA 509440
509441
NNKQVYEKLMKEIKENIKQYDEFQHNNLLQLNYLDLVIKESTRLYVAAPFIFPRVTNQDY 509620
509621
KRENLFIRKDLLVMYKIAQYSEDQSQVFKDSDLFIPERFLNGKINNPYEMIPFSSGSRNC 509800
509801
IGQHMAILEAKYILIHILMNFEIEPVPNYKLSIIQKVTNAPLDQKLVLIKRK* 509959
>CYP5008A1 8254716a 1191929 bp 2 genes 37% to CYP694C1 = EST seq 5
and seq 4
MIGYIAIALVLLFV
414374
YRFIIKNLMIYFTYKSKFGDKVVFMFYPLLGNFGISRKSFRQYGDSLHILKTIQRTKPDV 414553
414554
KAVMVFNGFIIQIIIIDTQLQKVFLQNTKPYYKVEGPFGARDAFGQGLVVAEEKVWTRQ 414730
414731
RNFLSNSFHFNALKNRVLSSRRLPRSSWATLPSDGKTPITIIEELQNITSE
VVIQTFFGENLKGMTVNGLQPSVEISKIIGDGFSYKANSFAYFLKLMVFGQEKASRVLNT
TFEKNFLKRVENYNQFIEGIVDKRLSELEKLTDTSKVDENFLNLYLLEYIKQQKALKENPKIYADYEIIP
415274
KREIVHQFTTFFFAGMDTTANQTGICLWVLAQHPELQQKIRAEIDSVIQTFDDLKHEDLN 415453
415454
KLEYFNAFFKESLRVYPTAPQVIPRVSARDHMVDDFPIPKGAFVSNLTIQYNEQKFPLLC 415633
415634 KDIDTFNPDRFLDKNIIQDHFSFIPYSAGPRNCIGQHLALIEAKIMIAYILKNYVVLPNE
415813
415814 EHQKVRFNHLFLYTSYERDIIKLKKLTA* 415900
>CYP5008A2 8254176 54804 bp 4 contigs 34% to
CYP694C2
32112
MLGYLLIPLFLYFVYRFIIINLIIYFRYKIKFGDKVAFLFYPLLGGLGISRKSYR 31948
31947 KYGDSLHILKQIQRNKPDMKAVVIFNGFNIQITLIDTGLLKVFLQNTKNYYKAEGPFGAR
31768
31767
YVFGLGLVKSEGKIWARQRSLLSNTFHFNALKDRVPVIKEITKEYLQKLDSYKDSAVPI 31591
31590
IEELQNITSEVIIQTFFGENLRDKKVNDLQPSVEISKIILEGFKYKANSFPYFLKLMVF 31414
31413
GQEKAFRVLNTQFEKDFIKRVENFNQFIEEIIEIRLNQLQNTYETSQVDENFLNLYLL 31240
31239 EYIKQQKALKENSKQFADYELIS
31170
KREIAHQFSTFFFAGMDTTANQTGICLWLLAKYPEIQKRVSDEINQAVNNFDQLKHEDLS 30991
30990
KLDYFNAFFKESLRMYPTAPHIIPRVSACDHFVEDFFVPKGAIINGVTIQHNEQKYPSLC 30811
30810
KEIDTFNPDRFLNQNVIQDHFSFIPFSAGPRNCIGQHLALIEAKIIIIYMLKNYVIIPND 30631
30630 ELENVEFNHLFVYTSIQRNLIKLKKIKSE* 30541
>CYP5009A1 8254437 547543 bp 2 contigs 29% to CYP691A1
118190
MITTILIALTFIGIALLLFKAFIQPLYRISFYTKQGLKQKFFVPFLGL 118047
118046
LYQMIKDMNGKQKDAMYTLKHLPQNIKEQNLDGFVANLGSTPVIYLLDATLIGEFLQKQ 117870
117869 TEYYQKSEFATKMSILLFGKGVVFSEGEEWKTKRKLLSNTFHYNLIQDQVKLVLNVTEKI
117690
117689
FEQQDQNSVCFLQTVEKIGGDVGLMSFLGADINNYTLFGQEPTAGINQYINQLM 117528
117527
SYMKNPLTVLMGGYLFNNQVRGSDKKVKQGSQEIKNLLKKVVLDVIDRTKNQMNDGKQNV 117348
117347 DTSIISLMLKNGQLKTEEEIDELVILTLNFYFAAKDTVSRLVANLMYYIGKNDHIYNKLE
117168
117167
KEILSFKDEEINIENMKKYNYLEAVIKESLRVCGPSPFLIQRTALQDHNL 117018 (1)
116968
GKYKIQKGTDVNCVFIYNFYNEKYFENPFEFNPERWLDQAQLEKIKINPFSYLPFSGGS 116792
116791
RNCIGQYFAMMEIKAIMIYFMRTYEKFQIPENFQYHIIIKTTIETKDHLTCSLKKR 116624
CAGTTCCTTTCTATATTTTGTATTTACCTATTTATAAAATTTAAAATAAGAATTTTTATG
Y K G
I
AAAATATTCAGTTTACCTAAATTATGATCTTAAAGTGCTGTCCTCTGAATTAAAAATGGA
G1 L N H
D
>CYP5010A1 8254362 275164 bp 1 contigs 26% to CYP694C2
80265 MGLLQLLVLVCLVFIILLFLKYYVLTKIAMRYYSKQGIKEYKYYPFTGIASEWIDQSLDD
80086
80085
SLKTIKYIYFDDKFKSQDAILSNFLDKALLIFINPNLINEFLQNQQNYVKFENAL 79921
79920
TGFKQFLGNGIAFEEGEKWKSKRRFVSQLFKFDVVVNYLSSIRKVTQMMLNRVEDSNFQ 79744
79743
VQKQFEALNSAISVQIFMGANIDEYKIDDKNSYDAIDQMCSDIFYHYQN 79597
79596 ITTAFLGEKFVKLKLRQSDKVICDKVQYVRQQMKNIILDTINREKINIQQNKESEN
79429
79428
ISIIALLLKQGHQIQEEKDIIDVLELTFSLYFASRDTLASVLTIMLYYLLKNPDCFQTVE 79249
79248
KEILSLPQNYTFSDIQKLDFLQACFKETIRINTSAPIILMREAKVDHKIGNYKIQKG 79078
79077
TFVNVGLISNQLNERYFKDPLTFNPNRWFDQSTENVIKENPFVYLPFSGGMRNCIGQHLA 78898
78897 NLQAKIVLIEFIKKYQ 78850
QPELPKDFVLELVIKQNYQIKNPFILNLQKRQNN*
78745
>CYP5010A2 8254542 59795 bp 2 contigs 27% to CYP691A1 complete
8155 MTILDFFGIS
8125
LLTFVIFLVLKYFVFTKMAMRFYANQGINEYKYYPISGMAREWMDKSSNDNLKTVKYIK 7949
7948 FDDKYKGEDAILSNLLDRALLIFIEPNLINEFLQNQQNYVKFEAPLIGFKQFLGNGI
7778
7777
ALEEGEKWKIQRKFISKLFKFDVVVNYLSSIRRVSKTMLDRIEDSNFQVLKQFEAINSAI 7598
7597
SVQIFMGANIDDYKIDGKNPYNAIDTMCAEIFHHYMSIIPIIFGEKFVK 7451
7450
LKLRKSDRIICEKVQYVRQQMKNIILDTIKREKFHIEQNKEQENINIIALLIKEGH 7283
7282
KMEDEQEITDILNLTFSLYFASRDTLASVLTMMLYYLIKNPDYFKQVEKEILNLPPNYTF 7103
7102
SDIQKLDFLQACLKETIRINTSAPILFTREAKVDHKIGNYNIQKGTYVNVGLISNQL 6932
6931
DEKYFKDPLNFNPNRWLDHSTENMIKENPFVYLPFSGGMRNCIGQHLANLQAKIVFAEFI 6752
6751 KKYQLPELPNDFVLEFTLKQNYQIKNPLILNLKKRQSN*
6635
>CYP5010A3 8254220 4918 bp 2 contigs 27% to CYP694C2 complete
1829 MAILELIGAS
1799
LLAFIIFLVIKYYVLTKMAMRFYANQGIKEYKYYPVTGMAKEWMDYSSSDNLKTIKYI 1626
1625
KFDDKFKGEDAILSNLLDRALLIFTNPNLINEFLQNQQNYIKFEAPVVGFKQFLGNG 1455
1454 IALEEGEKWKSKRKFISKLFKFDVVVNYLNSIRKISKMMLDRVENDSNFSVQKSFEAINS
1275
1274
AISVQIFMGANIDDYKIDGKNPYDAIDTMCFEIFQHYLNLIPGILGERFVKLK 1116
1115
LRKSDRIICEKVQYVRQQMKNIILDTIKREKSHAEQNKESENINIIALLIKE 960
959
GHKLDEEKEIIDILELTFSLYFASRDTLASVLTMMLYYLIKNPDYFKLVEKEVLSLPQN 783
782
YSFSDIQKLDFLQACFKETIRINTSAPILFTREAKVDHKIGNYNIQKGTYVNVGLIS 612
611
NQIDETYFKDPLTFNPNRWLDHSAENMIKENPFVYLPFSGGMRNCIGQHLANLQAKIVLV 432
431
EFVKKYQIPQMPNDFVLEFVIKQNYQIKNPLILNLTKRQAY* 306
>CYP5010A4 8254598b 267927 bp 3 genes 27% to CYP694C2
267521 MAFLELIGASILAFIIFLVIKYYVLTKMAMRFYANQGIKEYKYYPVTGMAKEWMDYSSSDNLKTIKYI
267318
267317
KFDDKFKGEDAILSNLLDRALLIFTNPNLINEFLQNQQNYIKFEAPVVGFKQFLGNG 267147
267146
IALEEGEKWKSKRKFISKLFKFDVVVNYLNSIRKISKMMLDRVENDSNFSVQKSFEAINS 266967
266966
AISVQIFMGANIDDYKIDGKNPYDAIDTMCFEIFQHYLNLIPGILGERFVKLK 266808
266807
LRKSDRIICEKVQYVRQQMKNIILDTIKREKSHAEQNKESENINIIALLIKE 266652
266651
GHKLDEEKEIIDILELTFSLYFASRDTLASVLTMMLYYLIKNPDYFKLVEKEVLSLPQN 266475
266474
YSFSDIQKLDFLQACFKETIRINTSAPILFTREAKVDHKIGNYNIQKGTYVNVGLIS 266304
266303 NQIDETYFKDPLTFNPNRWLDHSAENMIKENPFVYLPFSGGMRNCIGQHLANLQAKIVLV
266124
266123
EFVKKYQIPQMPNDFVLEFVIKQNYQIKNPLILNLTKRQAY* 265998
>CYP5010A5 8254598a 267927 bp 2 genes 27% to CYP694C2 1aa diff
to 8254598b 3aa
diffs to 8254220
258120 MAFLELIGASILAFIIFLVIKYYVLTKMAMRFYANQGIKEYKYYPVTGMAKEWMDYSSSDNLKTIKYI
257917
257916
KFDDKFKGEDAILSNLLDRALLIFTNPNLINEFLQNQQNYIKFEAPVVGFKQFLGNG 257746
257745
IALEEGEKWKSKRKFISKLFKFDVVVNYLNSIRKISKMMLDRVENDSNFSVQKSFEAINS 257566
257565
AISVQIFMGANIDDYKIDGKNPYDAIDTMCFEIFQHYLNLIPGILGERFVKLK 257407
257406
LRKSDRIICEKVQYVRQQMKNIILDTIKREKSHAEQNKESENINIIALLIKE 257251
257250
GHKLDEEKEIIDILELTFSLYFASRDTLASVLTMMLYYLIKNPDYFKLVEKEVLSLPQN 257074
257073
YSFSDIQKLDFLQACFKETIRINTSAPILFTREAKVDHKIGNYNIQKGTYVNVGLIS 256903
256902 NQIDETYFKDPLTFNPNRWLDHSAENIIKENPFVYLPFSGGMRNCIGQHLANLQAKIVLV
256723
256722
EFVKKYQIPQMPNDFVLEFVIKQNYQIKNPLILNLTKRQAY* 256597
>CYP5010B1 8254645 1076301 bp 3 contigs 27% to CYP696C1
1020997
MIFLIIQALVISFVLYLVGKFIVLPKLRMQFYRNQNIKEYNFFPFYGWFWDSILQK 1021164
1021165 TNKHKDCIHQIKHISESADFKDQDIILSNLNSKASLLIVNPDLISEFLQCQQHYIKYEL
1021341
1021342
PFHSLNKLIGTGLAKHYGDGWKKSRKILSNMFTFEQIALQLGSIRELSKRYISQQNEKSF 1021521
1021522
NIQKVFEMTAGASDLSIFLGFDMDEYKNKDGKTLGELIGSFANDVYLRDVGMF 1021680
1021681
SMIFGTKATELGLRKSDKDLNERGKNLRKFMYEIILECKKREQQPGYQPKYPSFI 1021845
1021846
HHLVQENLLNTEEELEDLLAISFNLFLAAKETTSKIAANVVYYAIKYPEQYKKIQEEIKT 1022025
1022026
YVPNDDYTFADLQKMNILQANVKECLRYDTSGPFLFQREVVQDHTLGKYNIKKGTLVNCG 1022205
1022206
YVVNFFNEKYFENPFSYQPERWLDSNNDSKMKNSHLVYIPFSGGARNCIGQHLAVMQIKA 1022385
1022386 MAIEFFKQFKLPEIPKDFDTEKIMTLSYGFKNDLLLNLERI*
1022511
>CYP5010C1 8253873a 25773 bp 1 contigs 3 genes 26% to CYP694C3 complete
9357
MLLELIVLAIFSIFLYYFAYYVVLPKYRMQFYKNQGLKEYEFRPVYGWLTDITLMKRNQ 9533
9534
WNDCLYYIKRLSQNPDYKDQDIILSNLADQALLTLINPELINEFLQRQNEYIKYQLPVQG 9713
9714
IEDFLGNGLSRQYGEKWKNGRKVISNMFTFENITAQLQRVKEISRKYINNEDTEKFNIK 9890
9891 YVFEKISGSQNLQVLLGAEVEKYKDSEGRHLGECVQQLSNDICALYISPFAMI
10049
10050
FGQKILKWNLREADRQIQKRMKDMRELMTKILIDCVNHEKDPSYEAKFPSYI 10205
10206
TFLLRAGYLQTDEEINDLRATAFSIFLGAKESTSKLANMTIYNLIKHPEQYKLVYDEIQ 10382
10383
KYVSHDDYNYLDIQKLNHLQANIKETQRTDPSSAFLRLRVAQNDHTLGKYQIKKGSIINV 10562
10563
GYLANFYNEKYFRDPFTYNPSRWFDKQEDLKIKENHFVFIPFSAGSRNCIGQHLATMQIK 10742
10743 VMIVEFIKKFHLPQIPPDFDDEKTMIQSYGFKNPLIVKLQKRS*
10874
>CYP5010C2 8254686 541833 bp 6 contigs 28% to CYP694C2
314867
MIFQLISFAICVVIFYYAAYYFLIPKYRMQFYKNQGIKEYKFVPIYGWLKDILFLKKN 314694
314693
QWNDSLYQFKRISQDPEFKDEDILLGNVANQAFLTLIKPELINEFLQRQNEYEKYSVP 314520
314519 LRGVMNFLGNGLARIYGDKWKKSRKVISNMFTFENITAQVGRIKEVSRKYINNENPEKF
314343
314342
NIKYVFEKISGSQDLQVLLGAEVEKYKDSQGRHLGECIEQLSSDICIHY 314196
314195
ISPFAMIFGQKILTWNLRESDRYISKRMKDMRQFMTQILTDCVNREKDPTYQAK 314034
314033
YPSYITFLLKAGYLQKEEEINELLATSFSLFMAAKDTTSKLTSMAIYNLIKHPEQYKLV 313857
313856 FEEIQKYVSHDDYDYNDLQKLSHLHANLKETLRMDPSVPFILSRVAQADHMLGKYKIKKG
313677
313676
TIINIGYLCNLYNEKYFKDPFNYNPSRWLDNEEDMKMKENHFVFIPFSAGPRNCIGQHLA 313497
313496
MMQMKVMIVEFIKKFYLPQIPKDYDDEKILTQSYGFKNPLTVKLQVRN* 313350
>CYP5011A1 8253811 495369 bp 4 contigs 28% to CYP694C3 complete INTRONS?
369901
MIFVAIKILFFLILCLSFFKMVIYPVANLIRYRIQGVSIIKYYPISGLWGILQKDFKKYGDSLRFFN (2)
VEKKNIQEVKLYKQIGGNIALYLIDPEMIKEFY
SLNCYQKSPFIVKMFSRVLGNGFLFDEISHNRKRNIFSKLFHFDNLKQVVPKIEQITQNW
VIQKCPLVKKTVQRIDMIDLSQHISGDIITQQFFGINMNNQRYKEKTIPQNLYHLNTATC
NQMRTIRYILTGTYTFDKGWCRFDRVVNEDIKNIKQILIDALYSRINKQ (1)
NKNSIIDQIIKNGYILDKKSSDMSKQEKEELTKSNKILPEELLEEYLTFYLAGMETT (1)
368711 GHTVGFCFYFYCKYPH 368661 (0)
KLRKEINQHFSHDQPITYEKISQLNYLDCFIKEIFRFYGPTSTLMYRI (0)
368353 ATQDHKIKDIEIKKGSMVNLAILTNFYNSKIFKNPEEFNPDRWLEKNQINSYAYLPFSAG
368174
368173
RRNCIGQHLGLLTIKIVFSVILRDFEVTLTNPDYQMDRLQKFVHGPSSPIYCNFERLKQKN* 367988
POSSIBLE
INTRONS
NQMRTIRYILTGTYTFDKGWCRFDRVVNEDIKNIKQILIDALYSRINKQ (1)
NTENSIIDQIIKNGYILDKKSSDMSKQEKEELTKSNKILPEELLEEYLTFYLAGMETT
(1)
MY CHOICE
NQMRTIRYILTGTYTFDKGWCRFDRVVNEDIKNIKQILIDALYSRINKQ (1)
NKNSIIDQIIKNGYILDKKSSDMSKQEKEELTKSNKILPEELLEEYLTFYLAGMETT
(1)
NQMRTIRYILTGTYTFDKGWCRFDRVVNEDIKNIKQILID
ALYSRINKQSQQQKQSQSQEDFQVDTENSIIDQIIKN (1)
ELTKSNKILPEELLEEYLTFYLAGMETT (1)
CCATTCCAGCTAAATAAAAGGTCAAGTACTCTTCTAAAAGTTCTTCTGGTAATATTTTGT
Y0 E1 E1 L L E1 E1 P L I K
N
TTGACTTAGTTAGCTCTTCCTTTTCCTGTTTACTCATATCTGAGGATTTTTTATCTAATA
S0 K T L0 E1 E0
K E0 Q K S1 M D1 S S K K
D1 L I
TATAACCATTTTTGATGATCTGATCGATGATTGAATTTTCTGTATCTACTTAAAAATCTT
Y
G1 N K I
I0 Q D I
I S N
E1 T D1 V0 Q F
D E
CTTAGCTTTAAGATTATTTTTATTGTTAACTTTATTTATTTATCCTTGAGTACAAAGCAT
Q
S Q S
Q K Q
Q Q S1
CAATTAGAATTTGTTTAATATTTTTTATGTCTTCATTAACTACTCTGTCAAATCTACACC
AACCTTTATCAAATGTGTAAGTTCCTGTTAGAATATATCTGATTGTTCTCATTTAATTGC
AAGTAGCTGTATTTAAATGATATAGGTTTTGGGGAATGGTTTTCTCTTTGTAACGTTAAT
TATTCATATTTATACCAAAAAACTATTATGTGATAATATCTCCACTGATGTGTTAAGATA
AATCAATCATATCAATTCTTTAAACTGTTTTTTTAACTAATGGACATTTTTATATAACCC
AATTTTAAGTTATCTACTCAATCTTTGGCACCACTTGCTTCAAATTGTCAAAATGAAAAA
GCTTAGAAAAAATATTTCGTTTTCTATTATGGGAAATTTCGTCAAATAAAAATCCATTTC
CTAATACTCTACTAAACATTTTCACTATGAAAGGACTTTTTTAGTAGCAGTTTAGAGAGT
AAAATTCTTTTATCATTTCAGGATCTATCAAGTAGAGAGCAATGTTTCCTCCTATTTATT
TATATAGTTTTACTTCTTATATGTTCTTTTTTTCCACACTTATTAATTTACCTATGTTTA
S2 I L K
G1 I N I
TAGCTAAGGATTTAGCATCAGGATGCTCAATTCCTACTTACTTAAAGAACCTAAGAGAGT
A1 L S K A
D
K2 F F R2 L S D
CACCATACTTTTTAAAATCTTTCTGTAAAATACCCCATAAACCTGATATAGGATAGTATT
G
Y K
>CYP5012A1 8254582b 1075194 bp 7 contigs 2 genes 27% to CYP694C3 complete
626541
MNILIQALLLIAVLLLLMILFFVNKYYIKTRSLMRYYENQGINNSYYFPLVGKYKRMLD 626717
SIKEHGDCMHYYKQLLTNQPDCKYTVTNL
(1)
AHNVCIYFYDLELLNQLYKCSDLVKHQFGV
626951
EIFKQVMAGLVFNEGEEHSRKRKILSQAFHYQLLQNMIPQLSQIYEEFFEKIRNKRVS 627124
627125
DCIMMGQEIVAESIVRLFFSNSFQGLKFKGKTMTEAVSGIIERVGSMAK 627271
627272 SASYALFGKICFSFGQMASVKKDLKQIKEIYKEIITNRINSTNLEELQHKKQKDLIDYL
627448
627449 IESKGISKIESEESITIDQIIEEFVTFQVAGLDSS (1)
GHTLGMVLYYLCIFPE (0)
LVEEIEQTIKTPDDLNTESIKKMKYLGAFVNEVLRYYSVADQIFPRT (0)
CNKDIQIGDLIIKK
627962
GTQVNVGIIENHYNPKYFKDPHQFNPMRWLDGSLDKLNSYAFNPFSSGKHNCIGQHFAQLEIKVGII 628162
628163 KFLQEFDLELDQDYKFILKFSFLTEPLNPIPITFKSKKQ*
628282
>CYP5012A2 8254582a 1075194 bp 7 contigs 2 genes 28% to CYP696C1 complete
and boundaries checked
MGLILQIIIFILTIISLLALYLVYKFYVKPKILMG
FYEKQGVKNLCFYPFIGKYSFVSRDIMLKGDGFYSIKDFLAQNP
GCKYSMTNL (1)
623793
AEFVCIQLYDPDIIKEFYNCKHIVKHKFGVEIFQEFMNGIVFAEGEEHTRKRKIISQAFHYQLLQS 623990
623991
MISPLSEIYDEFIDKVKDKKVDDCISMFQDISGESVLRLFFSKSFKDYKYRNKTMTETLS 624170
624171
DLINCAGQMAKSTEYAFLGKGALRFGKLSTLQKTTKDIKKIFKEVITNRIDS 624326
624327 TNLDDLKHKQRKDMIDYLIENQGIDKIESENKITIETIIEEFITFYVAGLDTT
624485 (1)
GHTLGMAMYYLSVYPE (0)
LSEEIKATIKSQADLTSENIKNMKYLQAFVNEVLRHYTIADQLFMRT
(0)
ATQDIQLGNLKIKKGTVLNVGNLE
624929
NHFDPKYFKDPHQFNPQRWLDGSLDKLPPYIFNPFSAGKNNCIGQHFAQINIKIGLI 625099
625100 KFIQSFDFEIDPNYKLILTFQFLIEPVQPIALTFKSKKL*
625219
>CYP5013A1 8254607c 557789 bp 5 contigs 3 genes 29% to CYP694C1 = EST seq
1
113493
MFIEILIIILFFAALRLVIIPYFKFLKYKKYGDGRFVPLLGELVEIKEAIK
KYEDVDYFVSHQCDENPDLRLYVVNL
113723 (1)
GSKIKLRL
113801 VDPDLMRDFFLNQQYYEKDTFYIGNVLRCAPQGIAFVEGEQ
WKKARKMFSQAFHFEYLT 113977
113978
SLAPLIEQIASKVFNQAMESSEILANYDPLVYSQKITGQVVIATFFGEQVNE 114133
114134
KKFRGMDLVSALTHMLNLLGEQSMSLQYFLFGADFFKLRLTKSQRYVDDIIKDFRSFMK 114310
114311
DLIEEKIESFQKELKEQGKISVPSILAQIVSTQEITEESKKYLFDDFTTFYAAGTDTT 114484
114485
GHLLAMTFYYITQNPELQKALQREVDANQDQSPQGLAQLNYLNAFLKETLRYYGPANF 114658
114659 TFDRI (0)
114752
ATKDHYLGDIPIAKGTIVVPISQSTHRNKKYYEDPHRYNPERWLDNKQKQIHNYAYMPFS 114931
114932
SGQRNCIGQHLAMIVARITLNKFVKMFEFSCDPNYKLVITQRFMSEPLDPIPLR 115093
TAAGATTATATGTTGTCAATCTTGGTAATAAAATTACTTAAAAGTATTAAATTTTACTAT
R
L Y V
V N L
G
GTAATCTTAAAAAATAGGCTCTAAGATAAAGCTTAGATTAGTAGATCCCGACTTAATGAG
S K I
K L R
L
TTTTGATAGAATTGTAAACAAACAATTTTAAAACAAAAAAAATATTTAAAAAAATGTGAT
F
D R I
V
TTTACAAATTAATTATTTTTTTAATAAAAAGGCAACTAAAGATCATTACTTAGGAGATAT
A T K
D
>CYP5013B1 8254607b 557789 bp 5 contigs 3 genes 26% to CYP694C1
note: the
name given previously was incorrect,
this seq
is not CYP5013C1. (3/9/2009, DRN)
117635 MIFYFILIALGIYCYYYLVKPFKIFNKYKKQGEGFFFPLIGEFLLQNSCFEKFGDVDYIY
117814
117815 KHINDNPANQKKMFVENI 117868 (1)
117921
GKNIKIKLTSIDMMKDFYSYLGDKLTIDTFYTQNFKRIFGNGLIFSQGER 118070
118071
WKNMRKMFSPAFHFEYLKNLSSSIEKIVDNALLEHINQDELQNVNIINLIFDMTSQV 118241
118242 VFFSFLGDNLQDIKFKGKSLAQAITHILSALSNQVMELGYLVFGAKYLKLGIF
118400
118401
KQHRELENYLQEFKSFFLDIIKSKKEELQKEFQETGDIQQNCVLSILIKENNIQNLSDSD 118580
118581
LLDNLMTFFIGGTDTTSHLIAMAIYYVCQNEHFYSMTQKEVDYYKQDKSKSMQDLPFM 118754
118755 NAVIKETLRIYGPGNKSFDQK (0)
VTEDIMINNIKIEKDIVLTSYVN 118934
118935 SVHRDPQIYKDPHTFRPQRWINQETNDLPSY 119027
AFIPFSSG
119052
KKNCIGQHLAMIEARIILFKFFDYFDIEPQKDYVLKIKYGLLPQPVNPLLFNLKTRQNNID* 119237
TTATTAAAAAGTTTATGTAACCAATATTGGTAAATTATTTATTTATTTATGAGCAAATGA
V T N
I G
AATTATTTTATTAAAAAATATATAAATAAATAGGTCCTTATATTAAATTAAGAATATTTG
P Y I
K L
CAAATATTCTCTTTGATAGAATCGTAATATTTTTGTTAATTTAAAATGTTGCAAACATAT
D R I
V
TAAGAAACCTAATACTGCAAAACATTATATAATTATTTTATTTTAATATAAAGGCTACAT
A T
S
CAGATCACGTTATTGGAGGCATACCTATTAAAAAAGGAATGATTATTACCCCTTTTGCTA
D
H V
>CYP5013C1 8254607a 557789 bp 5 contigs 3 genes 33% to CYP694A1
54% to
CYP5013C2
note: the
name given previously was incorrect,
this seq
is not CYP5013B1. (3/9/2009, DRN)
115472
MNIILILVLFLAAFIIVKLFILPYKLHKKYKKYGQGQFVPILGEILEFQQNLQKNQDMEYSLK 115660
115661 HQLDKDPYQKVYVTNI 115708 (1)
GPYIKLRIFEPEIIRDFFQNQHNYKKFE
115856
115857
FFTGNLMRFVNGIITAEGQDWKSSRKLFSQAFHFDYIQKLAPLIEKSTNTIIDEIIA 116027
116028
KNELQNFNSITCMQELTGRVVIASFFGESLENERLKGRTIVETLSYILNALARQTGT 116198
116199 LIYLIFGKKFFQLGLTKHSREINQLIEEFNSFLQNKIASKIEEIQNDIKQNGNTS
116363
116364
NFSILAQLVSTSQIDQISRQQLFEDFKAFYLAGMDTTGHLLGMTMYYLTQYKDVYSKLQQ 116543
116544 EIDTNQDQSIQCISQLPYLNAVIKESLRYYGPANILFDRI
116663 (0)
116754
ATSDHVIGGIPIKKGMIITPFAISMHRNQNIFENPHAYNPSRWLDKKISEINSFSYIPFS 116933
116934 SGQRNCIGQHLAQMQTKIILNKFIKKFNFSCPENYKLVMAFKFLSEPVDPLCLKLTLRQ*
117113
TTAAAAAAAGATGTTTGTTGAAAATATAGGTATTTAAAAATTTTAATTGATTTATTATAT
V E N
I G
TCATTCTCATATTTTATAAAGGAAAAAATATCAAAATAAAGCTGACATCCATTGATATGA
G K N
I K
AATCATTTGATCAAAAGGTATGATTTGCATTTTAAAGATTTTAGTAATTATATTAAAAAA
S
F D Q
K V
ATTAGGTTACAGAAGATATCATGATCAACAATATTAAAATTGAGAAAGATATAGTTCTTA
V T
E D I
>CYP5013C2 8254373 875719 bp 4 contigs 29% to CYP694C3
486130
MIFELILIAVALFAYFKIAKPYFSYLKYRKYGKGFYYPILGEMIEQEQDLKQHADADYSV 486309
486310 HHALDKDPDQKLFVTNL 486360 (1)
GTKVKLRL
486439
IEPEIIKDFFSKSQYYQKDQTFIQNITRFLKNGIVFSEGNTWKESRKLFSPAFHYEYIQ 486615
486616
KLTPLINDITDTIFNLAVKNQELKNFDPIAQIQEITGRVIIASFFGEVIEGEKFQGLTI 486792
486793
IQCLSHIINTLGNQTYSIMYFLFGSKYFELGVTEEHRKFNKFIAEFNKYLLQKI 486954
486955
DQQIEIMSNELQTKGYIQNPCILAQLISTHKIDEITRNQLFQDFKTFYIAGMDTTGHLLG 487134
487135
MTIYYVSQNKDIYTKLQSEIDSNTDQSAHGLIKNLPYLNAVIKETLRYYGPGNILFDRI 487311 (0)
487422
AIKDHELAGIPIKKGTIVTPYAMSMQRNSKYYQDPHKYNPSRWLEKQSSDLHPDANIPFS 487601
487602 AGQRKCIGEQLALLEARIILNKFIKMFDFTCPQDYKLMMNYKFLSEPVNPLPLQLTLRKQ*
487784
TATTCTGTTCATCATGCTTTAGATAAAGATCCAGATTAAAAATTGTTTGTAACAAATTTA
V T N
L
GGTAATTATTTTTTAATGAATATTCCTTACTGAATTTATTTTTGAATCTTATAAGGAACA
G G T
AAAGTCAAATTAAGATTAATTGAACCTGAAATAATTAAGGATTTCTTCTCAAAATCTCAG
K V
K L
AAAGAAACTCTTAGATATTATGGTCCTGGAAATATTTTATTTGATAGAATAGTAATTAAT
D R I
V0
TAATTACTTTTTTATTTATTTTATTATATATAATAAAATTAATAACTTTAAGAATTATCT
AAGTATTTGTTTATATTAAATAAAATATATCTATTAAAAAGGCTATTAAAGATCATGAGC
A L K
D H
>CYP5013D1 8254719 598405 bp 5 contigs 31% to CYP694A1 = EST seq 2 and
seq 3a
NO INTRONS
491991
MVSYFALAGLAIVLYILYVFIINPYLQYRKYLKWGKGSFYPFVGVFYGAGLRVK 492152
492153
QYKDVDHHLKHMYDDGSDPKIYVENNATGAIIKISDPEYIKEFVQLENKAYQKTTLLID 492329
492330
NIIRLVGQGIIFSEGPQWKKNRNVLSGVFHFEQLSKRVPSIEKITKEVYKRYIDSGNV 492503
492504 KNVDVIELFQEITAEVVIQTFFSNISKDKSFYGMSLSVALSYLINS
492641
492642
VGKQISTPFYFLFRTNFFKWGIRESDRKLNKQIKEFRQMIGDIINERIKEEEELEKRGE 492818
492819
QTTKEDLVYYLKKNNLLGVLSLDEIISEFMTFYVAGMDTTGHLCGMAIWFLTQHPEIKKK 492998
492999
LQEELDANTDYSQNGLLKLPYLNGVIQETQRLYGPAGQLFNRVALRDHMLKDIPIKKG 493172
493173 TIVKPSPCSVHRHPKYFEDPHSFKPERWFNKNTVTPYTFIPFNAGPRNCIGQHLALIEAR
493352
493353
IMINYFMKTFDFESDPNFEMVLNYAFLIEPVDKLRINLKLKENPLIYQ* 493499
>CYP5013E1 8254431 715652 bp 1 contigs 28% to 693A1 COMPLETE
603788 MIYIIITLILMSILYKYLVKPYSLYRKYKKYGDGYFFPLFGEAAEYYQNLQKHQDCEYS
603964
603965 FKHYFDKRKPEDNR 604006
604007 IIVSNM (1)
GTSVKINL
604112
VDPDLIKQFYQKSENYHMDHFFMSNLYRLFGTEEEDQKLHRNFRKQFAPAFN 604267
604268
FEYLQNLSSQITKITDNKINSLIQANNFKDIDPIAFSSEITGQVILLSFLGDTLENLS 604441
604442
FRNMKLPHALSYILNCLCGQMGEIGYILFGAKYTQKGYFKQHKEVEDFMQEFKNY 604606
604607
LRVILKQKIEIYRKEFEQTGNIKQTCILSLFIQNGEIDKIDLDDLLKYFLNFFLAGTETT 604786
604787
SNLVGMTIYYLTQNKEVQQKLQQEIDQNTDYSAEALSNLPYLNATIKETLRYQGPGNS 604960
604961 LLDRI (0)
605028 AVKEHYLEDIKIEKGVIVNVYMKAVHRDSRFYSDPHTYNPQRWLDSKENKSLHPFTYLPF
605207
605208
SAGQRNCIGQHLALIEARIILNSMIKNFNFKINEGYKLILDFKFSPQPLNPLQIHFDRR* 605381
CTCATTCAAGCATTATTTTGACAAAAGAAAGCCTGAAGATAATAGGATAATAGTTAGTAA
N R I
I V S
N
CATGGGTAACTTATATTAATTAAAAAATGTAATTATTAGTAATTAAATTTATTTTAATAA
M
G N L
Y Q L
K N V
I I S
N Q I
Y F N
N
TAAAATAGGGACTTCTGTCAAAATCAATTTAGTTGATCCAGATTTGATAAAATAGTTTTA
K
I G
TATAAAAGAAACACTTAGATATTAAGGTCCTGGCAACAGTCTATTGGATAGAATCGTATT
E T
L R Y
Q G P
G N S
L L D
R I V
L
ATTAAAAATTTATTTAATATTTATTAAATCATTTTCATTTAAAAAAGGCTGTTAAGGAAC
K A V
K E H
L
K I Y
L I F
I K S
F S F
K K G
ATTATCTTGAAGATATTAAAATCGAAAAAGGTGTAATTGTAAATGTATATATGAAAGCAG
Y
L E D
I K I
E K