48 Tetrahymena P450 genes

 

This file is being updated to complete these genes.  Currently 37 are

complete (magenta).  There are very few short introns present.

There are four full length pseudogenes and one solo exon pseudogene.

Note: this is the first time CYP names have used four digit numbers for families.

 

Blast server including Tetrahymena P450s

 

Tree of 47 Tetrahymena P450s

 

sequence alignment of 47 Tetrahymena P450s

 

D. Nelson April 23, 2004

 

>CYP5001A1 8254555 596115 bp 4 contigs 32% to CYP689A1 This gene's N-term is very

hard to detect.  I have assumed one intron, but this may not be true.

MPIQHNSQPKEVQTCEQMMKQLENNGFIRLKSYRPFQEEDLFKYDDELYSVKQ

FKSRYPNAKGFICYSDSSQKVPYNYIGICDADLIHEFMVEKEKYYERDVTLSYVARILGD

SLLFNLSDIWRMKRKVYGQMFHFYQYHQLVKLVREITTKNSYIIDMNSQKPVQLISHFEDILGEWSLHLFFGY

NASQYDLQSKLKAAYNNARSAILLLQPQNIINCLDPLKEVLLEYLYITSDQIKSDPDYQSKPQNNITC (2)

YSFIHSFYLLIIFLKFKNRFLLNSKEVQWNKDFL

105041 DQICQDIIIFFFAGRDTMAHTLQMMFYYLCIYPEYKKKIDEEINSLNGDYSVQNISNLN 105217

105218 YLEAVLKETLRHYGPTTTILPKVVMERHSFMNDKVFLEEGTIVSLAIQELNYDPQHYENP 105397

105398 FKFYPERWFDENLKQKEKKIFMPFSYGSHKCIGDKLALPVMKMLYITLNEYFSIDIDPSF 105577

105578 KLKLVALIQYTPSTQIPFLITKKKNIQSNNEVYFS* 105685

 

ATTCTAGGAGAATGGTCTCTTCATCTATTTTTCGGATATAATGCTAGCTAATATGATTTA

                           F  F  G  Y  N  A  S  Q  Y  D  L

CAATCTAAATTAAAAGCAGCTTATAACAATGCTCGCTCTGCCATACTCCTTTTACAGCCC

Q  S  K  L  K  A  A  Y  N  N  A  R  S  A  I  L  L  L  Q  P

                                        P  Y  S  F  Y  S  P

CAAAATATAATAAACTGTCTTGACCCTTTAAAGGAAGTCCTCCTTGAATATCTATACATC

Q  N  I  I  N  C1 L  D  P  L  K  E  V0 L  L  E  Y  L  Y  I

 K  I  Q  Q  T  V  L  T  L  Q  R  K  S  S  L  N  I  Y  T  S 

ACTTCTGATTAGATAAAAAGCGATCCTGATTACTAAAGCAAACCTTAAAATAACATCACA

T  S  D  Q  I  K  S  D  P  D  Y  Q  S  K  P  Q  N  N  I  T

 L  L  I  R2 Q  K  A1 I  L  I  T  K  A1 N  L  K  I  T  S  H

TGGTATAAAATTTAAAAATTAATTAAAAACAATAAAATAAATGTTTTTTTTAGTTAGTTA

W2 Y  K  I  Q  K  L  I  K  N  N  K  I  N  V0 F  F  S1 Q2 L 

 G  I  K  F  K  N  Q  L  K  T  I  K  Q  M  F  F  L  V  S2 Y

TTCATTCATTCATTCATTTTACTTACTAATAATTTTTTTGAAATTTAAAAACAGGTTTTT

F  I  H  S  F  I  L  L  T  N  N  F  F  E  I  Q  K  Q  V0

 S  F  I  H  S  F  Y  L  L  I  I  F  L  K  F  K  N  R2 F  L

GCTAAATAGCAAAGAAGTTTAATGGAACAAAGATTTCCTTGATTAAATTTGCTAAGATAT

 L  N  S2 K  E1 V1 Q  W  N  K  D1 F  L  D  Q  I  C  Q  D  I

TATCATCTTTTTCTTCGCTGGAAGAGATACGATGGCTCATACTCTTTAAATGATGTTTTA

             F  A  G  R  D  T

TTATTTATGTATTTACCCTGAATATAAAAAGAAGATAGACGAAGAAATAAATAGCCTAAA

CGGAGATTATTCTGTTTAAAATATTTCAAATTTAAATTATTTAGAAGCTGTTTTAAAAGA

 

5 POSSIBLE INTRON LOCATIONS THAT MAKE SENSE.

 

VPYNYIGICDADLIHEFMVEKEKYYERDVTLSYVARILGDSLLFNLSDIWRMKRKVYGQM

FHFYQYHQLVKLVREITTKNSYIIDMNSQKPVQLISHFEDILGEWSLHLFFGYNASQYDL

QSKLKAAYNNARSAILLLQPQNIIN (1) SILITKANLKITSH

GIKFKNQLKTIKQMFFLVSYSFIHSFYLLIIFLKFKNRFLLNSKEVQWNKDFLDQICQDI

IIFFFAGRDTMAHTLQMMFYYLCIYPEYKKKIDEEINSLNGDYSVQNISNLNYLEAVLK

 

VPYNYIGICDADLIHEFMVEKEKYYERDVTLSYVARILGDSLLFNLSDIWRMKRKVYGQM

FHFYQYHQLVKLVREITTKNSYIIDMNSQKPVQLISHFEDILGEWSLHLFFGYNASQYDL

QSKLKAAYNNARSAILLLQPQNIIN (1) SNLKITSH

GIKFKNQLKTIKQMFFLVSYSFIHSFYLLIIFLKFKNRFLLNSKEVQWNKDFLDQICQDI

IIFFFAGRDTMAHTLQMMFYYLCIYPEYKKKIDEEINSLNGDYSVQNISNLNYLEAVLK

 

MY CHOICE BASED ON COMPARISON TO PARAMECIUM P450S

VPYNYIGICDADLIHEFMVEKEKYYERDVTLSYVARILGDSLLFNLSDIWRMKRKVYGQM

FHFYQYHQLVKLVREITTKNSYIIDMNSQKPVQLISHFEDILGEWSLHLFFGY NASQYDL

QSKLKAAYNNARSAILLLQPQNIINCLDPLKEVLLEYLYITSDQIKSDPDYQSKPQNNITC

 (2) YSFIHSFYLLIIFLKFKNRFLLNSKEVQWNKDFLDQICQDI

IIFFFAGRDTMAHTLQMMFYYLCIYPEYKKKIDEEINSLNGDYSVQNISNLNYLEAVLK

 

VPYNYIGICDADLIHEFMVEKEKYYERDVTLSYVARILGDSLLFNLSDIWRMKRKVYGQM

FHFYQYHQLVKLVREITTKNSYIIDMNSQKPVQLISHFEDILGEWSLHLFFGYNASQYDL

QSKLKAAYNNARSAILLLQPQNIINCLDPLKEVLLEYLYITSDQIKSDPDYQSKPQNNIT

WYKIQKLIKNNKINVFF (1) KVQWNKDFLDQICQDI

IIFFFAGRDTMAHTLQMMFYYLCIYPEYKKKIDEEINSLNGDYSVQNISNLNYLEAVLK

 

VPYNYIGICDADLIHEFMVEKEKYYERDVTLSYVARILGDSLLFNLSDIWRMKRKVYGQM

FHFYQYHQLVKLVREITTKNSYIIDMNSQKPVQLISHFEDILGEWSLHLFFGYNASQYDL

QSKLKAAYNNARSAILLLQPQNIINCLDPLKEVLLEYLYITSDQIKSDPDYQSKPQNNIT

WYKIQKLIKNNKINVFF (1) IQWNKDFLDQICQDI

IIFFFAGRDTMAHTLQMMFYYLCIYPEYKKKIDEEINSLNGDYSVQNISNLNYLEAVLK

 

>CYP5002A1 8254654 510086 bp 17 contigs 30% to CYP696C1

MLTELFYTVILLIGVLFYLTFLRPLYKLLMLKIKHGKEILIKFEPIIGIHAYLKKSEKIHGDPMKLLKLEMAKN

PKLKAVACSLGFGVLITSVDPDMNKSFLIDNYQKSVKLAPIFNYKILKNGIIFSE

GEQWKKQRKMISDSFHFDSKYYFFSQKFNLLTLIMFKKVQNR (1)

VSVDSQLFFTKIKGEIGIRVFFGPDFGDHQIDGKNISEELKE (0)

IVDQTIVYSYTSIFFQLK

RIFLGDELAMKFLTAKEKKLLSRSDRINQIAMEQIEKKIKYYEGQPSVQSDYLIDV

292370 LVRNYLTKQDPNLTKEQ (0)

ICQQCITMFIAALDNTGCIIA 292543

292544 WIIFCLANYKEEGDKVRQEINSVFPTQETLDNIQFDDLKNLNYTSAFIDECFRHYTSSMG 292723

292724 VFLRRVVEPFKSGNFEIQK (1)

       GDIIQSLWH

292894 PTQFNPDWFENPDKFDVNRFLNGKEKQGNSYAFTPFSIGPRNCIGKQMSLVEVKVSLIYF 293073

293074 VKKFNCIRNSVPLKTTASIVYKPVDQNLIKISFCL* 293181

 

AATATTTAACTATAAAATACTTAAAAATGGAATAATCTTTTCAGAAGGAGAACAGTGGAA

                                                       W  K

GAAATAGCGCAAAATGATATCAGATTCCTTCCATTTCGATAGTAAATACTATTTTTTCAG

 K  Q  R  K  M  I  S  D  S  F  H  F  D  S1 K  Y  Y  F  F  S1

TCAAAAATTTAATTTATTAACTCTCATTATGTTCAAAAAGGTTTAAAATCGAGGTTTTAA

 Q  K  F  N  L  L  T  L  I  M2 F  K  K  V0 Q  N  R  G1 F  K2

GTTATTGCATAAGTAACTAAAGAAACAATTGACAGTCTCAGCAAATATGTAGGAAAGGAT

 L  L  H  K2 Q  L  K  K  Q  L  T  V0 S  A  N  M2 Q  E  R  I 

TACTTCCCTTTATTTGATGAAATGATGAAGCATTCAGGTAATTACTTTAATTAACATTTT

 T  S  L  Y  L  M  K  *

AGTTTCAGTTGATTCTCAACTTTTTTTTACTAAAATTAAAGGCGAAATAGGTATTAGGGT

 V1 S  V1 D                    K  I  K  G  E  I  G  I  R  V

TTTCTTTGGACCAGATTTTGGTGATCACCAAATAGATGGTAAGAATATATCTGAAGAGCT

 F  F  G  P  D  F  G  D  H  Q  I  D  G  K  N  I  S  E  E  L

TAAGGAGGTAAATGATTAATTGATTCATTTATTTATTTATTTCGTAGGTATATTATAATT

 K  E  V

TGTATATGATTTATCTTAACATCAAGAATGAAAACATATTTATATAAATGCAAAAATAAA

                               K  H  I  Y  I  N  A  K  I  K

ATTAAAGATTGTAGATCAGACTATTGTTTACAGCTATACTTCTATTTTCTTCTAATTAAA

 L  K  I  V  D  Q  T  I  V  Y  S  Y  T  S  I  F  F  Q  L  K

GAGAATCTTTTTGGGAGATGAATTGGCAATGAAGTTTTTAACTGCGAAAGAAAAGAAACT

 R  I  F  L  G  D  E  L  A  M  K  F  L  T  A  K  E  K  K  L

TTTGTCTAGGAGTGATAGGATAAATTAAATTGCTATGGAATAAATAGAAAAAAAAATAAA

ATACTATGAAGGTTAACCTAGTGTACAATCAGACTACCTTATAGATGTATTAGTGAGAAA

TTATCTCACAAAATAAGACCCAAATCTAACAAAAGAATAAGTAATCCATTTTATATTAAG

                D  P  N  L  T  K  E  Q  V  I  H  F  I  L  S

CTTTTTTTAAAAATAAATGTCTTCATTTATATCTAATTAGATATGCTAATAGTGCATAAC

 F  F  Q  K  Q  M  S  S  F  I  S  N  Q  I  C

TATGTTTATTGCAGCTTTAGACAACACTGGATGCATAATAGCTTGGATAATTTTTTGTTT

GGCTAACTATAAAGAAGAAGGAGATAAAGTAAGATAAGAAATCAACTCCGTCTTTCCAAC

ATAGGAGACCTTGGATAACATCTAATTTGATGATTTAAAAAATCTTAATTATACTTCAGC

ATTTATTGATGAATGTTTCAGGCACTATACAAGTTCTATGGGCGTTTTTCTCCGTAGAGT

                      H  Y  T  S  S  M  G  V  F  L  R  R  V

AGTTGAACCCTTTAAGTCTGGAAACTTTGAAATATAAAAAGGTACTGATTAATCTATCAT

 V  E  P  F  K  S  G  N  F  E  I  Q  K  G

TAATAAATATTAATTATTTTAATTTAAAACTAAACTAATTAACTAAAAATTTTTAAATTA

AATAAAGGAGATATTATTTAAAGTTTATGGCATCCTACCTAATTTAATCCTGACTGGTTT

   K  G  D  I  I  Q  S

GAAAATCCTGATAAATTTGATGTGAATAGATTCTTAAATGGAAAAGAAAAGTAAGGAAAC

 

>CYP5003A1 8254752 1171415 bp INTRONS 1, 3 AND 4 HAVE UNDEFINED BOUNDARIES 29% TO CYP694C3

MILLIIGLLIFSLFSYFAYLIFVKPYVRNKWYVKQYGKICKPIYYPILGNIKFVRESFKKYG

DSLQWIRELLKNDNDVQFLIGYNNIHIVDAQLAKDITLNNLQNIRKDMGMQFGDLLFKQG

IVFSEGEKWKKQRQILSHSFSFDVLKNRVSKINNIVKEMINKIQIEE (1)

GKPTKIVTECTKITAEVVLRSFFGERFSQQQHKGINLGIYLAQL

LLSVFDQTRRLRMLIPSFFFPKFMSKIFADEKYYEVQKNCIEFRQLIEKEVNLRIENYDQ

NKTQKDFLDILVDEYKLNYSKNDGKVEEYITKESIIHQFIVFFFA (1)

GMDTTGNLTGIMLYW (2)

VSKRKDIYEKLVQEIKSVFGQNKDPEINDEQLKKLNYCHMFIQECLRYHCPAMLL (2)

FTRRAERDFYIGDNILVQKGMQVNISLHGVLRRE

262363 KYFQNPDEFIPERFSEENKKNINHLAFIPFSSGPKNCIGQHMALIEAKIILVQFILNFDI 262184

262183 TNNENVPLKMDSNSLYCPVNDELLFLKRKQ* 262091

 

AATCTCTTTCAGCTCTACGTGTGAACCTTAAAAATTTACAAAAGTTAATTTTAATAGACT

 D1 R2 E  A1 R  R  T  F  R2 L  F  K  C  F  N  I  K  I  S  Q

                                                   L  L  S0

AAAAATATTATTTATTTACAATAACATTGCTGGGCAGTGGTATCTGAGACATTCTTAGAT

F  I  N  N  I  Q  L2 L  M2 A  P

AAACATGTGGCAATAATTTAATTTCTTTAGTTATTCGTCATTGATTTCAGGATCTTTATT

                                                           

TTACCCAAATACACTTTTTATTTCTTAAACTAATTTCTCATATATATCTTTTCTTTTAGA

                                                   R  K  S

AACCCTATTTGATAATAATTTACATTTAATTATTATTTTTATTATTTTTATAATATCTAC

V  R2                                                      W

CAATAAAGCATGATGCCAGTCAAATTTCCAGTTGTATCCATACCTAAATTTTAAATGAGT

 

 

CAAAAAAGGATCTTAAAACAACTTCTGCCGTTATCTT AGTGCATTCAGTTACAATTTTAG

                         A  T  I  K0

TTGGTTTTCCTATAAAAAAATAATTAAATTATTTACATACGATATTGTTTTAACTTTATA

 P  K  G1 I

TTCATCAAATCAGATTAATTATAAATAACTAAATTACCTTCTTCAATTTATATTTTATTA

                          V  L  N  G  E  E  I  Q  I  K  N  I

ATCATTTCTTTAACAATATTATTGATTTTAGATACTCTATTTTTCAACACATCAAAAGAG

 

>CYP5004A1 8254638 979488 bp 2 contigs 31% to CYP694C3 COMPLETE, NO INTRONS

747540 MIILIASLLTILTFCILLLIFSPKIKLLHLAKKYKDTEVLFSPVGGIVNLYQQSFEKY 747367

747366 GDALYFIRDLIQKKPNTKAILYNLGVDIVVNFQDAELVKEVHQSYSDYYEKLKGILTLQY 747187

747186 LFDKGMLMTDGDEWQKQRKLLANTFLFQNIKSRLPVTKQVVQEIYSSKQLDLSKPINALH 747007

747006 MSEKVTSDVIFQTFFGISFRGKQINGVEIQKELSNVLIEGFNYIQSSFYYKLKMVILK 746833

746832 EKAVGFKPFKGEEDLLKRMKNLRDTTEEVIQNRLKQIKSGNLDISQSELFLDILLRDYI 746656

746655 QKNSESEKDIQQMIDQFMTFFFGGTDTTADLVALTLYHLSHNQNFQTKLRQEIKSIINNF 746476

746475 EELNYDNLNQMNYLQCVIKESLRIHPPAVGVLPRVCIKDHKVGQIEMKKGMLMDTHFIGV 746296

746295 LNNPKYYDNPQDFNPDRWNDSKKMESAPFSPFGIGKRSCIGQHLGMMNAKVIICFLVMNY 746116

746115 LVKPNHQQKLRMSLNTVYTPFNDDLVYFEKIN* 746017

 

>CYP5005A1 8254815a 909308 bp 16 contigs 10 genes all in (+) 29% to CYP694C2

       MILVSYIALGA

241470 IVLLIAIFIYNLILYPQIRLNLMKQKHGDKIKTFFHIGEGIFKEYYTNLTQKKDSVAFIR 241649

241650 GIHQKNIKALAYNLGTRICLNFVDPELVKQVHQNPEAFRKNDASLAITFLFGNSILFA 241823

241824 KGKSWQRQRQFLGKSFHFEEIKNYLPLIKEICDKVFERVSEKLTQNDQLEINAVKICEQV 242003

242004 TSEVVFRAFFGSTTQNLVITRQDGSQIPIADELIQAIMNSFQLLKKDKIALFKYLIFGRN 242183

242184 STKFLRTQGEQSILNRLIAIKESCLQVVQQRKNQLQNDPTQAKKNFLDQYLYDMITNKQ 242360

242361 SEVNNEEIIDNFLGLFFAGTDTTGNMTGVALYYLSRYPDIQKQAREEVIQILSQNSNKRN 242540

242541 HSELFSQLTFENLQNMNLINSILKESLRLIPPAIEVFPRVAIHNMKIGEFQIKKGDLVST 242720

242721 YFIYNQSNPELYQDPEIFNPQRWMNVKD 242804 (2)

242867 SQNNFNFTPFSLGPRNCIGQHLAMIEGKCMLACILLNFDILPNYQQEVVKELRVIY 243034

243035 GFQKDNLIYFKQRKQNN* 243088

 

ATCCAGAAATATTTAATCCTCAAAGGTGGATGAATGTAAAAGAGTAAGTCTTTTATATTT

                          W  M  N  V  K  E2

ATTATATTCTATATTTATAATGTATCTTCTTAATATAAATCAAAGCTCATAGAATAATTT

                                           S2 S  Q0 N  N  F

CAACTTTACTCCGTTTTCACTAGGCCCTAGAAATTGCATCGGTTAGCATCTTGCTATGAT

 N  F  T  P  F  S

 

>CYP5005A2 8254815b 909308 bp 16 contigs 10 genes all in (+) 30% to CYP694C1

244655 MLLNSIIISFVIILIIYKFLLYPKLRLIQIKRKYGDKIKVIHHYGPGLLVEYKK 244816

244817 SYTQNKDSLSFLRQMAYQNGKSIQAYAFNIGTNVGYSFVDPELIKQVHQNHDAFQKIDV 244993

244994 SQAIAYLFSNSLLFATGKSWQKQRQFLGKSFHYDEIKNYFPSIKDICLKVSDRISEELI 245170

245171 QNPNKEIKVVKTCEQITSEVVFRAFFGATSENIFISTENGSTKIPISQELVSVIMDSFRI 245350

245351 LQQSKLIQLKSQLLKRKTFNIFPLKEEKKLLNRLITLKQACKNIVLKRK 245497

245498 EELVSDPSLYKKNFIDLYLKEIIQNSNNEITIEEIVDNFCALFFAGTDTTGNMTGAALYF 245677

245678 LSLNPKIQQEAREEVMQIIKQKANSSDLKDIYNYLSFEDLTKLDLINSILKESLRLLPPASGV 245866

245867 LPRISNRDIKIGQFEVKKGDLVNTHFIYHQSNPEVYENQDQFNPYRWMKGKE 246043 (2)

246053 QNNAFNFTPFSLGPRNCIGQHLAMIEGKCILINFLINFDIMPIPQKQV 246229

246230 QYEMKVIYGLQPDDLVYFRKRNN* 246301

 

TAGATGGATGAAAGGAAAAGAGTAAAACATTTTTATATAATTATAAAAAAATTAAATAAT

    W  M  K  G  K  E

TTTTAAATAAATATAAATTAAAAGGTAAAATAATGCATTCAATTTTACTCCTTTTTCGTT

                         Q  N  N  A  F  N  F  T  P  F  S  L

AGGACCTAGAAACTGTATAGGATAACATTTGGCGATGATAGAAGGTAAGTGTATTCTTAT

 

>CYP5005A3 8254815c 909308 bp 16 contigs 10 genes all in (+) 31% to CYP694C1

247451 MIFVAIYTILGILALIFAIALYNLVFYPYLRLKIMKQKYGDKIQVFFNIGDGILKKYKNDLE 247636

247637 DKGDSLAFIRGMHQKNLKAIGFNLGAKVGLSFVDPDLIKQVHQNHDSFHKIDAAMAIT 247810

247811 FLFGNSILYAKGKDWQRQRQFLGKSFHFEEIKNYLPLIKEVCQNTFQNVNKQLALKEQTE 247990

247991 IQAVKICEQITSEVVFRVFFGSTSQNLDIKKEDGTRIPIAHELVETIMNSFQLLQNDKIA 248170

248171 LIKWLLFKRNSTKFLRTQGEESILNRLITVKQTCLQVVIKRRDELFKDPSQAKKNF 248338

248339 LDQYLIDMIQNKNSKVTYDEIVDNFSGLFFAGTDTTGNMTGVALYYLSLYPEIQQQAREE 248518

248519 VIKVLSQKQKEKKTDQLF

248573 NQLTFDDLSNLDLINSILKESLRLVPPANEVFPRIADHDMKIGDFQIEKGDLVNTHFIYN 248752

248753 QSNPEYYPNPDIFDPYRWMNAKD 248836 (2)

248875 QQNTFHFTPFSLGPRNCIGQHLAMIEGKCMLASILLQFEILPNHSAKIVKEVKVIYGLKN

       DNIIFFKKLKAN* 249093

 

CCCTGAATATTATCCAAATCCAGATATTTTTGATCCTTATAGATGGATGAACGCAAAAGA

                                        R  W  M  N  A  K  E

GTAAATTTTATTTTTATATTCATTCTTAAAAAATAATATTTAACTAAAAATAGTTAATAG

                                                   S  Q  Q

AACACATTTCACTTTACTCCATTCTCTTTAGGTCCCAGAAATTGCATAGGTTAACATCTA

N  T  F  H  F  T  P  F  S  L

 

>CYP5005A4 8254815d 909308 bp 16 contigs 10 genes all in (+) 29% to CYP696C1

249796 MFVENINLIPVLFILGILFILYKMILYPKIRLMQLQKKYGESIKVVHHYGTGLLMEYKKGFQQLND 249993

249994 SMAFIRKMSKENQSNVEAFAFNIGPKIGISFVYPELIKQVHKNHDAFQKIDVSQAVVYLF 250173

250174 SNSLIYATGKQWQRQRQFLGKSFHFEEIKNYLPCIKDICIKTSNQISEDIRINPQKEIQV 250353

250354 VKICEKVTSEVVFRVFFGSTQENILIEREDGQKLSISEELVSIVMDSFRILQQDKLLLIK 250533

250534 SLILKRYSMKIFPLREEKSLHNRLIQLKKACETIVLQRKKQLQLDPALYKNNFLDLYL 250707

250708 KEMIQNSNTQITIDEIIANFCGLFFAGTDTTGNMTGVALYYLSLNPQIQKEAREEVIQI 250884

250885 ISKKNSNLDLKDQFNYITFEDLSSMNLINSILKESLRLIPPAIGVFPRYANRDIKI 251052

251053 GQFELKKGDLVNTHFIYNQSNPSIFQNPEQFDPKRWMNGND (2)

251242 LQFAFSFTPFSLGPRNCIGQHLAMIEGKCMLANFLLKYDILPNKSQNIGLEMKIIYGLSPDNLVYFKQR* 251451

 

TACAATTAGTCAAATCCTTCAATATTTTAAAATCCAGAACAGTTTGATCCTAAAAGATGG

                                                      R  W

ATGAATGGAAATGAGTAATTAATTAATTAATTAATCATTTAATTTTTTATTATTTATTTA

M  N  G  N  E

CAAAAATTTATTAAATATAGTTTGTAATTTGCGTTTAGCTTTACACCATTTTCATTAGGC

                     L  Q  F  A  F  S  F  T  P  F  S  L

 

>CYP5005A5P 8254815e 909308 bp 16 contigs 10 genes all in (+) (+) 29% to CYP694C1

one in frame stop and a frameshift, stop codon missing at end, may be a pseudogene

252767 MLKILITFLGLFLFLLVFVVYKVFIYPYLRLSTFSKKHGKKVKVYFNFGVGLL 252925

252926 KEYKQSLIQYKDSLAFFRNLHQKSLQAIAFCFGTKVGLIFVDSNLIKQVHQNHEAFM 253096

253097 KIDASMAIIYLFGDSLIYAKGKEWQRQRKFLGKSFHFEEIKNYLPSIKETSIKIFGEIKD 253276

253277 QLKQSDQMEIQVVKTCESVTSEVVFRVFFGQTSQNLKNTKEDGSQISVAQELVSTITNSF 253456

253457 QILQNDKISLIK*IFFERNTIKYFPTKNEQNLQKRLIEIKKQCMQVVEKRRIELIE 253624

253625 VIKSAKNNFLHQYLKEMIVNEKSKISNEEIIDNFLALFFAGTDTTGNMTRVALYYLSLYP 253804

253805 DIQNKAREEIIKLASSRINSTNPVDLFNSLTFEDIQNLNFLNSILKESLRLIPPAIEVFP 253984

253985 RLAIQDIQIGD 254017 (fs)

       YEVKKGDFINTYFIYNFSNPEIYPEPDNFDPSRWMKQID (2)

254199 QQNTFNFTPFSLGPRNCIGQHLAMIEGKSMLAYILLNFEKFPNKNQEVVKEMKIIYGFQKDNLVYFKNRSQ 254411

RKKSGNIQLTQQFYIQIKKYLFLLCLKFQYVFFTTKKQKPSILQKFIKEKQNKFKHQKQTRFYFQHSKQKEVEINMI

LIISIIILQIKKQNMRKNISYKQKQFKQKNFLSLIELLFLSFKLKFFIFQK*

 

ATTTTGATCCTTCAAGATGGATGAAACAAATAGAGTATTTATTTAATATTCATAAATAAT

                 W  M  K  Q  I  E2

TAGTTAACATTAATGTATTTTTTTAATGAATAAAAAGTTAGTAGAATACATTTAATTTCA

                                      Q  Q  N  T  F  N  F  T

CTCCCTTCTCATTAGGACCAAGAAATTGCATTGGATAGCATCTAGCCATGATTGAAGGAA

  P  F  S  L

 

>CYP5005A6 8254815f 909308 bp 16 contigs 10 genes all in (+) 30% to CYP696C1

285979 MLIIFQIIFAIIASILTLAIYSLVLYPFFWFILMKYKYGDKIKVFFNIGQGMLKEYKKGLIE 286164

286165 KNDGVAFIRNMHQKNLKAMAFNYGSKVGFSFLDPELIKQVHQNHEYFIKMDATMAVTF 286338

286339 LFGNSILYAKGKEWQKQRQFLGKSFHFDEIKSYFPQIKEICQNTFRNTYQKLTNEEQINI 286518

286519 QAVKICETVTSEVIFRTFFGKTSQNLNITKKDGSQILLAFELVEVVKSAFQMLT 286680

286681 IDKIALIKWIFLQRNSARYFRTEKEEDLFQRLTAIKECCLQIVQKRRDELIKEPTQAKK 286857

286858 IFLDQYLIDTMQNQSSQVTDDEIIDNFCGMFFAGTDTTANMTGVALYYLSIYPEIQRQAR 287037

287038 EEIIKILSSKSNDKNPDSLFSQFSIEDL 287121

       SNLDLINSILKESMRLIPPAIQVFPRIACQDIKIGEFEIKKGQLVTTNFVYN

       QSNPEIFPKPDNFDPFRWMNEDN 287346 (2)

       TQSNTFNFTPFSQGPRNCIGQHLAMIEGKCMLVYILLLFDILPNPSVLVAKEVKV 287569

287570 IYGFQNDNLIYFKKRQNII* 287629

 

region around intron in CYP5006A6

TCAAATCCTGAAATATTTCCTAAGCCAGATAATTTTGATCCATTTAGATGGATGAATGAA

                                                W  M  N  E

GATAAGTAAGTTTAATTAATATTTATATTTTTAGAAAATATTTATCATTAATTTAACAAA

D  K2 Q  V0

 

TAGCACTTAGAGTAATACATTTAACTTTACTCCTTTCTCTTAAGGCCCAAGAAATTGTAT

 S2 T  Q0 S2 N  T  F  N  F  T  P  F 

 

>CYP5005A7P 8254815g 909308 bp 16 contigs 10 genes all in (+) 30% to CYP694C1

one in frame stop codon same as CYP5005A5, might be a pseudogene

       MLKILFTFLGLFL

289100 LLLVFVVYKVFIYPYLRLSTFSKKHGKKVKVYFNFGVGLLKEYKQSLSQYKDSLAFFR 289273

289274 NLHQKSLQAIAFCFGTKVGLIFVDSNLIKQVHQNHEAFMKIDASMAIIYLFGDSLLYA 289447

289448 KGKEWQRQRQFLGKSFHFEEIKNYLPSIKETSMKLFGEIKEQLKQSDQIEIQVVKTCESV 289627

289628 TSEVSFRVFFGQTSQNLKITKEDGSQISVAQELVSTITNSFQILQNDKISLIK*IF 289795

289796 FERNTIKYFPTKNEQNLQKRLIEIKKQCMQVVEKRRIELIEVIKSAKNNFLHQYLK 289963

289964 EMIVNEKSKISNEEIIDNFLALFFAGTDTTGNMTRVALYYLSLYPDIQNKAREEIIKLAS 290143

290144 SRTKSTNPVDLFNSLTFE 290197

290198 DIQNLNFLNSVLKESLRLIPPAIEVFPRVAIQDIQIGDFEVKKGDFINTFFIYNFS 290365

290366 NPEIYPEPNNFDPSRWMKQID 290428 (2)

290492 QQNTFNFTPFSLGPRNCIGQHLAMIEGKSMLAYILLNFEIFPNKNQEVVKEMK

       IIYGFQKDNLVYFKNRSQ* 290707

 

region around intron in CYP5006A7

TCCTTCAAGATGGATGAAACAAATAGAGTATTTATTTAATATTCATAAATAATTAGTTAA

       R  W  M  K  Q  I  E

CATTAATGTATTTTTTTAATGAATAAAAAGTTAGTAGAATACATTTAATTTCACTCCCTT

                            S2 Q0 Q0 N  T  F  N  F  T  P  F

CTCATTAGGACCAAGAAATTGCATTGGATAGCATCTAGCCATGATTGAAGGAAAATCTAT

 

>CYP5005A8 8254815h 909308 bp 16 contigs 10 genes all in (+) 30% to CYP694C1

381326 MISLLHLIFGVLLIPIILIIYKVILYPLTRLLYLKYVHLDKVRIFYTYGQGFFPQFKKNLLEKK 381517

381518 DSLAFVREMYSKKLQTALFSIGTKVGFIFLDPELIKQVHQNADAFKKNDGSLAITYLF 381691

381692 SNSLLFVKGKEWQRQRQFLAKSFNFQDIKNYLPIFKDVSSKIFGLIRNNLEISGTQEIEI 381871

381872 VKTCEKVTSEAVFRVFFGQTSQNLMVTSKDGSSTLLAHELVAVVVDSFHMLLHDKLLLI 382048

382049 KFTMLGKNSINILPTQSELKLLNRLKEVKRVCLEIVEKRRNELLKDQSQFKNNFLDQYL 382225