Rhesus monkey P450s (Macaca mulatta)

 

(in progress, 49 seqs so far)

Jan. 27, 2006

 

The sequences are shown followed by a blast alignment of the Macaca seq on top compared to the human sequence below.  At the bottom of this file a FASTA format collection of the rhesus P450s is given. One surprise finding is that two pseudogenes in humans are not pseudogenes in rhesus monkey.  CYP2G2P and CYP1A8P are both full length intact genes.

 

>CYP1A1 AY635458.1

MLFRISMSATEFLLASLIFCLVFWVIRASRPRVPKGLKNPPGPW

GWPLIGHILTLGKNPHLALSRMSQRYGDVLQIRIGSTPVLVLSGLDTIRQALVQQGDD

FKGRPNLYSFTLISNGQSMSFGPDSGPVWAARRRLAQNGLKSFSIASDPASSSSCYLE

EHVSKEAEVLISKLQEQMAGPGHFNPYRYVVISVANVICAICFGQRYDHNHQELLSLV

NLSNNFGEVVGSGNPADFIPILRYLPNRSLNGFKDLNEKFHSFMQKMIKEHYKTFEKG

HIRDITDSLIEHCQEKQLDENANIQLSDEKIVNVVLDLFGAGFDTVTTAISWSLMYLV

TNPRVQRKIQEELDTVIGRSRRPRLSDRSHLPYMEAFILETFRHSSFVPFTIPHSTTR

DTSLKGFYIPKGRCVFVNQWQINHDQKLWVNPSEFLPERFITPDGAIDKVLSEKVILF

GLGKRKCIGETIARWEVFLFLAILLQRVEFSVPPGVKVDMTPIYGLTMKHACCEHFQM

QLRS

 

>CYP1A2 (Aggarwal)

MALSQSVPFSATELLLASAIFCLVFWVLRGSRPRVPKGLKSPPEPWGWPLLGHVLTLGKN
PHLALARMSQLYGDVLQIRIGSTPVLVLSGLDTIRQALVRQGNDFKGRPDLYSFTFITDG
QSMSFSPDSGPVWAARRRLAQNALNTFSIASDPASSSSCYLEEHVSKEAEALISRLQELM
AGPGHFDPYNQVVVSVANVIGAMCFGQHFPESSDEMLSLVKNSHEFVESASSANPVDFFP
ILRYLPNPALQRFKAFNQRFRRFLQKTVQEHYQDFDK (0)
NSVQDITGALFKHSKKGPRASGNLIPQEKTVNLVNDIFGA (1)
GFDTIATAISWSLMYLVTKPEIQRKIQKEL (1)
DAVIGRGRRPRLSDRPQLPYLEAFILETFRHSSFVPFTIPH (2)
STTRDTTLNGFYIPRECCVFINQWQVNHDP  (2)
QLWGDPSEFRPERFLTAEGTTINKPLSEKIMLFGLGKRRCIGEVLGKWEVFLFLAILLQQLE FSVPPGVKVDLTPIYGLTMKHARCEHFQAR
 
>CYP1A2 NM_000761
          Length = 515
 
 Score =  968 bits (2503), Expect = 0.0
 Identities = 472/510 (92%), Positives = 493/510 (96%)
 
Query: 1   MALSQSVPFSATELLLASAIFCLVFWVLRGSRPRVPKGLKSPPEPWGWPLLGHVLTLGKN 60
           MALSQSVPFSATELLLASAIFCLVFWVL+G RPRVPKGLKSPPEPWGWPLLGHVLTLGKN
Sbjct: 1   MALSQSVPFSATELLLASAIFCLVFWVLKGLRPRVPKGLKSPPEPWGWPLLGHVLTLGKN 60
 
Query: 61  PHLALARMSQLYGDVLQIRIGSTPVLVLSGLDTIRQALVRQGNDFKGRPDLYSFTFITDG 120
           PHLAL+RMSQ YGDVLQIRIGSTPVLVLS LDTIRQALVRQG+DFKGRPDLY+ T ITDG
Sbjct: 61  PHLALSRMSQRYGDVLQIRIGSTPVLVLSRLDTIRQALVRQGDDFKGRPDLYTSTLITDG 120
 
Query: 121 QSMSFSPDSGPVWAARRRLAQNALNTFSIASDPASSSSCYLEEHVSKEAEALISRLQELM 180
           QS++FS DSGPVWAARRRLAQNALNTFSIASDPASSSSCYLEEHVSKEA+ALISRLQELM
Sbjct: 121 QSLTFSTDSGPVWAARRRLAQNALNTFSIASDPASSSSCYLEEHVSKEAKALISRLQELM 180
 
Query: 181 AGPGHFDPYNQVVVSVANVIGAMCFGQHFPESSDEMLSLVKNSHEFVESASSANPVDFFP 240
           AGPGHFDPYNQVVVSVANVIGAMCFGQHFPESSDEMLSLVKN+HEFVE+ASS NP+DFFP
Sbjct: 181 AGPGHFDPYNQVVVSVANVIGAMCFGQHFPESSDEMLSLVKNTHEFVETASSGNPLDFFP 240
 
Query: 241 ILRYLPNPALQRFKAFNQRFRRFLQKTVQEHYQDFDKNSVQDITGALFKHSKKGPRASGN 300
           ILRYLPNPALQRFKAFNQRF  FLQKTVQEHYQDFDKNSV+DITGALFKHSKKGPRASGN
Sbjct: 241 ILRYLPNPALQRFKAFNQRFLWFLQKTVQEHYQDFDKNSVRDITGALFKHSKKGPRASGN 300
 
Query: 301 LIPQEKTVNLVNDIFGAGFDTIATAISWSLMYLVTKPEIQRKIQKELDAVIGRGRRPRLS 360
           LIPQEK VNLVNDIFGAGFDT+ TAISWSLMYLVTKPEIQRKIQKELD VIGR RRPRLS
Sbjct: 301 LIPQEKIVNLVNDIFGAGFDTVTTAISWSLMYLVTKPEIQRKIQKELDTVIGRERRPRLS 360
 
Query: 361 DRPQLPYLEAFILETFRHSSFVPFTIPHSTTRDTTLNGFYIPRECCVFINQWQVNHDPQL 420
           DRPQLPYLEAFILETFRHSSF+PFTIPHSTTRDTTLNGFYIP++CCVF+NQWQVNHDP+L
Sbjct: 361 DRPQLPYLEAFILETFRHSSFLPFTIPHSTTRDTTLNGFYIPKKCCVFVNQWQVNHDPEL 420
 
Query: 421 WGDPSEFRPERFLTAEGTTINKPLSEKIMLFGLGKRRCIGEVLGKWEVFLFLAILLQQLE 480
           W DPSEFRPERFLTA+GT INKPLSEK+MLFG+GKRRCIGEVL KWE+FLFLAILLQQLE
Sbjct: 421 WEDPSEFRPERFLTADGTAINKPLSEKMMLFGMGKRRCIGEVLAKWEIFLFLAILLQQLE 480
 
Query: 481 FSVPPGVKVDLTPIYGLTMKHARCEHFQAR 510
           FSVPPGVKVDLTPIYGLTMKHARCEH QAR
Sbjct: 481 FSVPPGVKVDLTPIYGLTMKHARCEHVQAR 510

 

>CYP1A8P ortholog  possibly a functional gene in rhesus

MILNLAVTPGEVTTSLIILVMVFVFVRALRSKGRKQLSPPGPWSFPIIGNLLQLGEHPYL

TLMEMRKKYGDVFLLKLGMVPVLVVNGMEMVKQVLLKDGEHFAGRPNMHTFSFLAEGKSL

SFSVNYGESWKLHKKIASKALRTLSNAEAKSSTCSCLLEEHVTEEVSELVTVFVELSSKN

GGFDPRNAITCAVANVVCALCFGKRYDHSDEEFLKIVKTNDDLLKASSAANPADFIPCLR

YLPLQIINAPREFYRALNGFIALHVQDHLATYDK (0)

DHIRDITDALINVCHNKYAATKTDTLNDSEIISTVNDLFGA (1)

GFETVSTCLYWSFLYLIHYPEIQAKIQEEI (1)

DGNIGLKPPRFEDRKILPYTEAFISEVFRHASFLPFTIPHC (2)

TTADTTLNGYFIPRKTCTFINMYQVNHDE (2)

TIWDNPSLFRPDRFLNENRELNKSLVEKVLIFGMGIRKCLGEDVARNEIFIFITAVLQQL

KLKKCPRAKLDLTPTYGLVMRPKPYQLEAERRSSGSSSA

 

>CYP1A8P NT_008580.9|Hs9_8737 chromosome 9 Pseudogene 43% to 1A2

          Length = 508

 

 Score =  940 bits (2430), Expect = 0.0

 Identities = 472/510 (92%), Positives = 486/510 (95%), Gaps = 2/510 (0%)

 

Query: 1   MILNLAVTPGEVTTSLIILVMVFVFVRALRSKGRKQLSPPGPWSFPIIGNLLQLGEHPYL 60

           MIL+LAVTPGEVTTSLIILVMVFVFVRALRSKGRKQ+SPPGP SFPII NLLQLG+HPYL

Sbjct: 1   MILDLAVTPGEVTTSLIILVMVFVFVRALRSKGRKQVSPPGP*SFPIIENLLQLGDHPYL 60

 

Query: 61  TLMEMRKKYGDVFLLKLGMVPVLVVNGMEMVKQVLLKDGEHFAGRPNMHTFSFLAEGKSL 120

           TLMEMRKKYGDVFLLKLGMVPVLVVNGMEMVKQVL KDGEHFAGRPNMHTFSFLAEGKSL

Sbjct: 61  TLMEMRKKYGDVFLLKLGMVPVLVVNGMEMVKQVLHKDGEHFAGRPNMHTFSFLAEGKSL 120

 

Query: 121 SFSVNYGESWKLHKKIASKALRTLSNAEAKSSTCSCLLEEHVTEEVSELVTVFVELSSKN 180

           SFSVNYGESWKLHKKIASKAL T SNAEAKSSTCSC LEEHVTEE+SELVTVFVEL+SKN

Sbjct: 121 SFSVNYGESWKLHKKIASKAL*TFSNAEAKSSTCSCSLEEHVTEEISELVTVFVELTSKN 180

 

Query: 181 GGFDPRNAITCAVANVVCALCFGKRYDHSDEEFLKIVKTNDDLLKASSAANPADFIPCLR 240

           G FDPRNAITC VAN+VCALCFGKR DHSDEEFL+IVKTNDDLLKASSAANPADFIPCL

Sbjct: 181 GSFDPRNAITCVVANIVCALCFGKR*DHSDEEFLRIVKTNDDLLKASSAANPADFIPCLH 240

 

Query: 241 YLPLQIINAPREFYRALNGFIALHVQDHLATYDKDHIRDITDALINVCHNKYAATKTDTL 300

           YLPL+IINAP EFY+ALNGFIALHVQDHLATY KDHIRDITDALINVCHNKYAATKTDTL

Sbjct: 241 YLPLKIINAPLEFYQALNGFIALHVQDHLATYGKDHIRDITDALINVCHNKYAATKTDTL 300

 

Query: 301 NDSEIISTVNDLFGAGFETVSTCLYWSFLYLIHYPEIQAKIQEEIDGNIGLKPPRFEDRKILPY 360

           NDSEIISTV+DLFGAGFETVSTCL WSFLYLIHYPEIQA+IQEEI      +PPRFEDRKILPY

Sbjct: 301 NDSEIISTVSDLFGAGFETVSTCLCWSFLYLIHYPEIQARIQEEI------RPPRFEDRKILPY 358

 

Query: 361 TEAFISEVFRHASFLPFTIPHCTTADTTLNGYFIPRKTCTFINMYQVNHDETIWDNPSLF 420

           TEAF+SEVFRHASFLPFTIPH TTADTTLNGYFIPRKTCTFINMYQVNHDETIWDN SLF

Sbjct: 359 TEAFVSEVFRHASFLPFTIPHSTTADTTLNGYFIPRKTCTFINMYQVNHDETIWDNHSLF 418

 

Query: 421 RPDRFLNENRELNKSLVEKVLIFGMGIRKCLGEDVARNEIFIFITAVLQQLKLKKCPRAK 480

           RPDRFLNENRELNKSLVEKVLIFGMGIRKCLGEDVARNEIFIFIT VLQQ KLKK PRAK

Sbjct: 419 RPDRFLNENRELNKSLVEKVLIFGMGIRKCLGEDVARNEIFIFITTVLQQFKLKK*PRAK 478

 

Query: 481 LDLTPTYGLVMRPKPYQLEAERRSSGSSSA 510

           LDLTPTYGLVMRPK YQL+AE   SGSSSA

Sbjct: 479 LDLTPTYGLVMRPKLYQLQAELHPSGSSSA 508

 

>CYP1B1

MGTGLSPKDPWPLNLLSTQQTTLLLLLSVLVAVHVGQWLLRQRRRQLGSTPPGPFAWPLI

GNAAAVGQASHLSFARLARRYGDVFQIRLGSCPIVVLNGERAIHQALVQQGSAFADRPSF

ASFRVISGGRSMAFGHYSEHWKVQRRAAHSTMRNFSTRQLRSRQVLEGHVLSEARELVAL

LVRGSADGAFLDPRQLTVVAVANVMSAVCFGCRYSHDDPEFRELLSHNEEFGRTVGAGSL

VDVMPWLQYFPNPMRTAFREFEQLNRNFSNFVLDKFLRHCESLRPGAAPRDMMDAFILSA

EKKAARDSDDGGARLDLENVPATVTDIFGASQDTLSTALQWLLLLFIR (2)

YPDVQARVQAELDQVVGRDRLPCMDDQPNLPYVLAFLYEAMRFSSFVPVTIPHATNANTS

VLGYHIPKDTVIFVNQWSVNHDPVKWPNPENFDPARFLDKDGLINKDLTSRVMIFSVGKR

RCIGEELSKMQLFLFISILAHQCNFRANPNGPEMNFSYGLTIKPKSFKVNVTLRESMELL

DSAVQKLQAEETCQ

 

>CYP1B1 NM_000104

          Length = 543

 

 Score = 1021 bits (2639), Expect = 0.0

 Identities = 509/543 (93%), Positives = 519/543 (95%), Gaps = 1/543 (0%)

 

Query: 1   MGTGLSPKDPWPLNLLSTQQTTLLLLLSVLVAVHVGQWLLRQRRRQLGSTPPGPFAWPLI 60

           MGT LSP DPWPLN LS QQTTLLLLLSVL  VHVGQ LLRQRRRQL S PPGPFAWPLI

Sbjct: 1   MGTSLSPNDPWPLNPLSIQQTTLLLLLSVLATVHVGQRLLRQRRRQLRSAPPGPFAWPLI 60

 

Query: 61  GNAAAVGQASHLSFARLARRYGDVFQIRLGSCPIVVLNGERAIHQALVQQGSAFADRPSF 120

           GNAAAVGQA+HLSFARLARRYGDVFQIRLGSCPIVVLNGERAIHQALVQQGSAFADRP+F

Sbjct: 61  GNAAAVGQAAHLSFARLARRYGDVFQIRLGSCPIVVLNGERAIHQALVQQGSAFADRPAF 120

 

Query: 121 ASFRVISGGRSMAFGHYSEHWKVQRRAAHSTMRNFSTRQLRSRQVLEGHVLSEARELVAL 180

           ASFRV+SGGRSMAFGHYSEHWKVQRRAAHS MRNF TRQ RSRQVLEGHVLSEARELVAL

Sbjct: 121 ASFRVVSGGRSMAFGHYSEHWKVQRRAAHSMMRNFFTRQPRSRQVLEGHVLSEARELVAL 180

 

Query: 181 LVRGSADGAFLDPRQLTVVAVANVMSAVCFGCRYSHDDPEFRELLSHNEEFGRTVGAGSL 240

           LVRGSADGAFLDPR LTVVAVANVMSAVCFGCRYSHDDPEFRELLSHNEEFGRTVGAGSL

Sbjct: 181 LVRGSADGAFLDPRPLTVVAVANVMSAVCFGCRYSHDDPEFRELLSHNEEFGRTVGAGSL 240

 

Query: 241 VDVMPWLQYFPNPMRTAFREFEQLNRNFSNFVLDKFLRHCESLRPGAAPRDMMDAFILSA 300

           VDVMPWLQYFPNP+RT FREFEQLNRNFSNF+LDKFLRHCESLRPGAAPRDMMDAFILSA

Sbjct: 241 VDVMPWLQYFPNPVRTVFREFEQLNRNFSNFILDKFLRHCESLRPGAAPRDMMDAFILSA 300

 

Query: 301 EKKAARDSDDGGARLDLENVPATVTDIFGASQDTLSTALQWLLLLFIRYPDVQARVQAEL 360

           EKKAA DS  GGARLDLENVPAT+TDIFGASQDTLSTALQWLLLLF RYPDVQ RVQAEL

Sbjct: 301 EKKAAGDSHGGGARLDLENVPATITDIFGASQDTLSTALQWLLLLFTRYPDVQTRVQAEL 360

 

Query: 361 DQVVGRDRLPCMDDQPNLPYVLAFLYEAMRFSSFVPVTIPHATNANTSVLGYHIPKDTVI 420

           DQVVGRDRLPCM DQPNLPYVLAFLYEAMRFSSFVPVTIPHAT ANTSVLGYHIPKDTV+

Sbjct: 361 DQVVGRDRLPCMGDQPNLPYVLAFLYEAMRFSSFVPVTIPHATTANTSVLGYHIPKDTVV 420

 

Query: 421 FVNQWSVNHDPVKWPNPENFDPARFLDKDGLINKDLTSRVMIFSVGKRRCIGEELSKMQL 480

           FVNQWSVNHDPVKWPNPENFDPARFLDKDGLINKDLTSRVMIFSVGKRRCIGEELSKMQL

Sbjct: 421 FVNQWSVNHDPVKWPNPENFDPARFLDKDGLINKDLTSRVMIFSVGKRRCIGEELSKMQL 480

 

Query: 481 FLFISILAHQCNFRANPNGP-EMNFSYGLTIKPKSFKVNVTLRESMELLDSAVQKLQAEE 539

           FLFISILAHQC+FRANPN P +MNFSYGLTIKPKSFKVNVTLRESMELLDSAVQ LQA+E

Sbjct: 481 FLFISILAHQCDFRANPNEPAKMNFSYGLTIKPKSFKVNVTLRESMELLDSAVQNLQAKE 540

 

Query: 540 TCQ 542

           TCQ

Sbjct: 541 TCQ 543

 

>CYP2A23 AY635459.1

MLASGLLLVALLACLTVMVLMSVWQQRNSKGKLPPGPTPLPFIG

NYLQLNTEQMYNSIMKISERYGPVFTIHLGPRRIVVLCGYDAVKEALVDQAEEFSGRG

EQATFDWLFKGYGVVFSNGERAKQLRRFSIATLRDFGVGKRGIEERIQEEAGFLIEAL

RDTQGANIDPTFFLSRTVSNVISSIVFGDRFDYEDKEFLSLLRMMLGSFQFTATSAGQ

LYEMFSSVMKHLPGPQQQAFKELQGLEDFIAKKVEHNRRTLDPNSPRDFIDSFLIRMQ

EEEKNPNTEFHLKNLVLTSLNLFFGGTETVSTTLRYGFLLLMKHPEVEAKVHEEIDRV

IGKNRQPKFEDQARMPYMEAVIHEIQRFGDMLPLGVAHRVIKDTKFRDFFLPKGTEVF

PMLGSVLKDPKFFSNPQDFNPQHFLDEKGQFKKSDAFVPFSIGKRNCFGEGLARMELF

LFFTTIMQNFRFKSPQSPKDIDVSPKHVGFATIPPNYTMSFLPR

 

>CYP2A24 AY635460 (Y. Peng)

CYP2A6 best match = CYP2A24 (Y.Peng) partial 

MLASGLLLVALLACLTVMVLMSVWQQRNSKGKLPPGPTPLPFIG

NYLQLNTEQMCNSLMKISERYGPVFTIHLGPRRVVVLCGYDAVKEALVDQAEEFSGRG

EQATFDWVFKGYGVVFSNGERAKQLRRFSIATLRDFGVGKRGIEERIQEEAGFLIEAL

RDTHGANIDPTFFLSRTVSNVISSIVFGDRFDYEDKEFLSLLGMMLGSFQFTSTSTGQ

LYEMFSSVMKHLPGPQQQAFKELQGLEDFIAKKVEHNQHTLDPNSPRDFIDSFLIRMQE

EEKNPNTEFYLKNLMMTTLNLFIAGTETVSTTLRYGFLLLMKYPEVE (1)
AKVHEEIDRVIGKNRQPKFEDRVKMPYMEAVIHEIQRFGDVIPMSLARRVNKDTKFRDFFLPK (0)
GTEVFPMLGSVLRDPRFFSNPQDFNPQHFLDEKGQFKKSDAFVPFSI (1)
GKRNCFGEGLARMELFLFFTTIMQNFRFKSPQLPKDIDVSPKHVGFATIPPNYTMSFLPR
 
Score = 1078 (379.5 bits), Expect = 1.2e-114, P = 1.2e-114
 Identities = 204/217 (94%), Positives = 209/217 (96%)
 
Query:     2 EEKNPNTEFYLKNLVMTTLNLFIGGTETVSTTLRYGFLLLMKHPEVEAKVHEEIDRVIGK 61
             EEKNPNTEFYLKNL+MTTLNLFI GTETVSTTLRYGFLLLMK+PEVEAKVHEEIDRVIGK
Sbjct:     1 EEKNPNTEFYLKNLMMTTLNLFIAGTETVSTTLRYGFLLLMKYPEVEAKVHEEIDRVIGK 60
 
Query:    62 NRQPKFEDRAKMPYMEAVIHEIQRFGDVIPMSLARRVKKDTKFRDFFLPKGTEVYPMLGS 121
             NRQPKFEDR KMPYMEAVIHEIQRFGDVIPMSLARRV KDTKFRDFFLPKGTEV+PMLGS
Sbjct:    61 NRQPKFEDRVKMPYMEAVIHEIQRFGDVIPMSLARRVNKDTKFRDFFLPKGTEVFPMLGS 120
 
Query:   122 VLRDPSFFSNPQDFNPQHFLNEKGQFKKSDAFVPFSIGKRNCFGEGLARMELFLFFTTVM 181
             VLRDP FFSNPQDFNPQHFL+EKGQFKKSDAFVPFSIGKRNCFGEGLARMELFLFFTT+M
Sbjct:   121 VLRDPRFFSNPQDFNPQHFLDEKGQFKKSDAFVPFSIGKRNCFGEGLARMELFLFFTTIM 180
 
Query:   182 QNFRLKSSQSPKDIDVSPKHVGFATIPRNYTMSFLPR 218
             QNFR KS Q PKDIDVSPKHVGFATIP NYTMSFLPR
Sbjct:   181 QNFRFKSPQLPKDIDVSPKHVGFATIPPNYTMSFLPR 217
 

>CYP2C43 AB212264.1

MDSLVVLVLCLSCLLLLSLWRQRSGRGKLPPGPTPLPVIGNILK

IGIKDVSKSLTNLSKVYGPVFTLYFGLERMVVLHGYEAVKEALIDLGEEFSGRGHFPL

FERANRRFGLVFSNGKRWKEIRRFSLMTLRNFGMGKRSIEDRVQEEARCLVEELRKTK

ASPCDPTFILGCAPCNVICSIIFHKRFDYKDQQFLNLMEKFNENAKILSSPWIQIYNN

FSPIIDYFPGTHNKLLKNIAFVKSYILEKVKEHQESMDMNNPRDFIDCFLIKMEKEKH

NQQSEFNIENLENTAVDLFAAGTETTSTTLRYALLLLLKHPEVAAKVQEEIEHVIGRN

RSPCMQDRSRMPYTDAVVHEIQRYIDLLPTSVPHAVTCDVKFRNYLIPKGTTILISLT

SVLRDNKEFPNPEMFDPRHFLDEGGNFKNSNYFMPFSAGKRICVGEALARMELFLFLT

SILQNFNLKSLVDLKDLDTTPVFNGFVSVPPIYQLCFIPV

 

>CYP2C74 variant (S.Sarva) missing exon 1, 4 aa diffs to 2C74
FSKVYGPVFTVYFGMNPVVVLHGYETVKEALIDNAEEFSGRGILPISERITNGL (1)
GIISSNGKRWKETRRFSLTTLRNFGMGKRSIEDRVQEEARCLVEELRKTK (1)
ASPCDPTFILGCAPCNVICSVVFQKRFDYKDENFLTLMKRFTVNFRILTSPWIQ (0)
VCNNFPLLIDCFPGTHNKLLKNVALTKSYIRKKVKEHQATLDVNNPRDFIDCFLIKMEQ (0)
EKDNQQSEFTIENLVGTVADLFVAGTETTSTTLRYGLLLLLKHPEVT (1)
AKVQEEIDHVIGRHRSPCMQDRSHMPYTDAVIHEIQRYIDLVPTGVPHAVTTDIKFRNYLIPK (0)
GTIIITLLTSVLQDDKEFPNPKIFDPGHFLDENGNFKKSDYFMPFSA (1)
GKRICAGEGLARMELFLFLTTILQNFNLKSVADLKNLNTTSATRGIISLPPSYQICFIPV

 

>CYP2C8 M17397
          Length = 490
 
 Score =  823 bits (2126), Expect = 0.0
 Identities = 398/434 (91%), Positives = 413/434 (95%)
 Frame = +1
 
Query: 1    FSKVYGPVFTVYFGMNPVVVLHGYETVKEALIDNAEEFSGRGILPISERITNGLGIISSN 180
            FSKVYGPVFTVYFGMNP+VV HGYE VKEALIDN EEFSGRG  PIS+RIT GLGIISSN
Sbjct: 57   FSKVYGPVFTVYFGMNPIVVFHGYEAVKEALIDNGEEFSGRGNSPISQRITKGLGIISSN 116
 
Query: 181  GKRWKETRRFSLTTLRNFGMGKRSIEDRVQEEARCLVEELRKTKASPCDPTFILGCAPCN 360
            GKRWKE RRFSLT LRNFGMGKRSIEDRVQEEA CLVEELRKTKASPCDPTFILGCAPCN
Sbjct: 117  GKRWKEIRRFSLTNLRNFGMGKRSIEDRVQEEAHCLVEELRKTKASPCDPTFILGCAPCN 176
 
Query: 361  VICSVVFQKRFDYKDENFLTLMKRFTVNFRILTSPWIQVCNNFPLLIDCFPGTHNKLLKN 540
            VICSVVFQKRFDYKD+NFLTLMKRF  NFRIL SPWIQVCNNFPLLIDCFPGTHNK+LKN
Sbjct: 177  VICSVVFQKRFDYKDQNFLTLMKRFNENFRILNSPWIQVCNNFPLLIDCFPGTHNKVLKN 236
 
Query: 541  VALTKSYIRKKVKEHQATLDVNNPRDFIDCFLIKMEQEKDNQQSEFTIENLVGTVADLFV 720
            VALT+SYIR+KVKEHQA+LDVNNPRDF+DCFLIKMEQEKDNQ+SEF IENLVGTVADLFV
Sbjct: 237  VALTRSYIREKVKEHQASLDVNNPRDFMDCFLIKMEQEKDNQKSEFNIENLVGTVADLFV 296
 
Query: 721  AGTETTSTTLRYGLLLLLKHPEVTAKVQEEIDHVIGRHRSPCMQDRSHMPYTDAVIHEIQ 900
            AGTETTSTTLRYGLLLLLKHPEVTAKVQEEIDHVIGRHRSPCMQDRSHMPYTDAV+HEIQ
Sbjct: 297  AGTETTSTTLRYGLLLLLKHPEVTAKVQEEIDHVIGRHRSPCMQDRSHMPYTDAVVHEIQ 356
 
Query: 901  RYIDLVPTGVPHAVTTDIKFRNYLIPKGTIIITLLTSVLQDDKEFPNPKIFDPGHFLDEN 1080
            RY DLVPTGVPHAVTTD KFRNYLIPKGT I+ LLTSVL DDKEFPNP IFDPGHFLD+N
Sbjct: 357  RYSDLVPTGVPHAVTTDTKFRNYLIPKGTTIMALLTSVLHDDKEFPNPNIFDPGHFLDKN 416
 
Query: 1081 GNFKKSDYFMPFSAGKRICAGEGLARMELFLFLTTILQNFNLKSVADLKNLNTTSATRGI 1260
            GNFKKSDYFMPFSAGKRICAGEGLARMELFLFLTTILQNFNLKSV DLKNLNTT+ T+GI
Sbjct: 417  GNFKKSDYFMPFSAGKRICAGEGLARMELFLFLTTILQNFNLKSVDDLKNLNTTAVTKGI 476
 
Query: 1261 ISLPPSYQICFIPV 1302
            +SLPPSYQICFIPV
Sbjct: 477  VSLPPSYQICFIPV 490
 

>CYP2C75 AY635463.1

MDSLVVLVLCLSCLLLLSLWRQRSGRGKFPPGPTPLPVIGNILQ

IDIKDVSKSLTNLSKVYGPVFTLYFGLERMVVLHGYEAVKEALIDLGEEFSGRGHFPL

ADRANRGFGIVFSNGKRWKEIRRFSLMTLRNFGMGKRSIEDRVQEEARCLVEELRKTK

GSPCDPTFILGCAPCNVICSIIFHKRFDYKDQQFLKVMEKLNENVKILSSPWIQICNN

FPPFIDYFPGAHNKLLKNIAFLKSYILEKVKEHQESMDMNNPRDFIDCFLMKMEKEKH

NQQSEFTIENLENTAVDLFGAGTETTSTTLRYALLLLLKHPEVTAKVQEEIERVIGRN

RSPCMQDRSHMPYTDAVVHEIQRYIDLLPTNLPHAVTCDVKFRNYLIPKGTTILISLT

SVLHDNKEFPNPEMFDPRHFLDEGGNFKKSNYFMPFSAGKRICVGEALARMELFLFLT

SVLQNFNLKSLVDPKDLDTTPVVNGFASVPPFYQLCFIPV

 

>CYP2C19 best hit (N. Abdeltawab) same as Liao’s hit below

MDPAVALVLCLSCLFLLSLWRQSSGRGRLPSGPTPLPIIGNILQLDVKDMSKSLTNVSKV
YGPVFTVYFGLKPIVVLHGYEAVKEALIDHGEKFSGRGSFPVAEKVNKGLGILFSNGKRW
KEIRRFSLMTLRNFGMGKRSIEDRVQEEALCLVEELRKTASPCDPTFILGCAPCNVICSV
IFHNRFDYKDQRFLNLMEKFNENLRILSSPWIQ (1)

CNNFPALIDYLPGSHNKVVKNFAYVKS
YVLERIKEHQESLDMDNPRDFIDCFLIKMEEKHNLQSEFTIESLIATVTDMFGAGTETTS
TTLRFGLLLLLKYPEVTAKVQEEIECVVGRNRSPCMQDRSHMPYTDAVVHEIQRYIDLIP
TNLPHAVTCDVKFRNYLIPKGTTIITSLTSVLHNDKEFPNPEMFDPGHFLDRSGNFKKSD
YFMPFSAGKRMCVGEGLARMELFLFLTTILQNFNLKSQVDPKDIDITPIANAFGRVPPLY
QLCFIPV

 

>CYP2C18 M61856

          Length = 490

 

 Score =  951 bits (2459), Expect = 0.0

 Identities = 468/490 (95%), Positives = 481/490 (98%), Gaps = 3/490 (0%)

 

Query: 1   MDPAVALVLCLSCLFLLSLWRQSSGRGRLPSGPTPLPIIGNILQLDVKDMSKSLTNVSKV 60

           MDPAVALVLCLSCLFLLSLWRQSSGRGRLPSGPTPLPIIGNILQLDVKDMSKSLTN SKV

Sbjct: 1   MDPAVALVLCLSCLFLLSLWRQSSGRGRLPSGPTPLPIIGNILQLDVKDMSKSLTNFSKV 60

 

Query: 61  YGPVFTVYFGLKPIVVLHGYEAVKEALIDHGEKFSGRGSFPVAEKVNKGLGILFSNGKRW 120

           YGPVFTVYFGLKPIVVLHGYEAVKEALIDHGE+FSGRGSFPVAEKVNKGLGILFSNGKRW

Sbjct: 61  YGPVFTVYFGLKPIVVLHGYEAVKEALIDHGEEFSGRGSFPVAEKVNKGLGILFSNGKRW 120

 

Query: 121 KEIRRFSLMTLRNFGMGKRSIEDRVQEEALCLVEELRKT-ASPCDPTFILGCAPCNVICS 179

           KEIRRF LMTLRNFGMGKRSIEDRVQEEA CLVEELRKT ASPCDPTFILGCAPCNVICS

Sbjct: 121 KEIRRFCLMTLRNFGMGKRSIEDRVQEEARCLVEELRKTNASPCDPTFILGCAPCNVICS 180

 

Query: 180 VIFHNRFDYKDQRFLNLMEKFNENLRILSSPWIQ-CNNFPALIDYLPGSHNKVVKNFAYV 238

           VIFH+RFDYKDQRFLNLMEKFNENLRILSSPWIQ CNNFPALIDYLPGSHNK+ +NFAY+

Sbjct: 181 VIFHDRFDYKDQRFLNLMEKFNENLRILSSPWIQVCNNFPALIDYLPGSHNKIAENFAYI 240

 

Query: 239 KSYVLERIKEHQESLDMDNPRDFIDCFLIKME-EKHNLQSEFTIESLIATVTDMFGAGTE 297

           KSYVLERIKEHQESLDM++ RDFIDCFLIKME EKHN QSEFT+ESLIATVTDMFGAGTE

Sbjct: 241 KSYVLERIKEHQESLDMNSARDFIDCFLIKMEQEKHNQQSEFTVESLIATVTDMFGAGTE 300

 

Query: 298 TTSTTLRFGLLLLLKYPEVTAKVQEEIECVVGRNRSPCMQDRSHMPYTDAVVHEIQRYID 357

           TTSTTLR+GLLLLLKYPEVTAKVQEEIECVVGRNRSPCMQDRSHMPYTDAVVHEIQRYID

Sbjct: 301 TTSTTLRYGLLLLLKYPEVTAKVQEEIECVVGRNRSPCMQDRSHMPYTDAVVHEIQRYID 360

 

Query: 358 LIPTNLPHAVTCDVKFRNYLIPKGTTIITSLTSVLHNDKEFPNPEMFDPGHFLDRSGNFK 417

           L+PTNLPHAVTCDVKF+NYLIPKGTTIITSLTSVLHNDKEFPNPEMFDPGHFLD+SGNFK

Sbjct: 361 LLPTNLPHAVTCDVKFKNYLIPKGTTIITSLTSVLHNDKEFPNPEMFDPGHFLDKSGNFK 420

 

Query: 418 KSDYFMPFSAGKRMCVGEGLARMELFLFLTTILQNFNLKSQVDPKDIDITPIANAFGRVP 477

           KSDYFMPFSAGKRMC+GEGLARMELFLFLTTILQNFNLKSQVDPKDIDITPIANAFGRVP

Sbjct: 421 KSDYFMPFSAGKRMCMGEGLARMELFLFLTTILQNFNLKSQVDPKDIDITPIANAFGRVP 480

 

Query: 478 PLYQLCFIPV 487

           PLYQLCFIPV

Sbjct: 481 PLYQLCFIPV 490

 

>searched with 2C29 differs from 2C43, 2C74, 2C75 (Liao)

MDPAVALVLCLSCLFLLSLWRQSSGRGRLPSGPTPLPIIGNILQLDVKDMSKSLTNFSKV
YGPVFTVYFGLKPIVVLHGYEAVKEALIDHGEKFSGRGSFPVAEKVNKGLGILFSNGKRW
KEIRRFSLMTLRNFGMGKRSIEDRVQEEALCLVEELRKTNASPCDPTFILGCAPCNVICS
VIFHNRFDYKDQRFLNLMEKFNENLRILSSPWIQ

(gap in seq, missing one exon)

EKHNLQSEFTIESLIATVTDMFGAG
TETTSTTLRFGLLLLLKYPEVTAKVQEEIECVVGRNRSPCMQDRSHMPYTDAVVHEIQRY
IDLIPTNLPHAVTCDVKFRNYLIPKGTTIITSLTSVLHNDKEFPNPEMFDPGHFLDRSGN
FKKSDYFMPFSAGKRMCVGEGLARMELFLFLTTILQNFNLKSQVDPKDIDITPIANAFGR
VPPLYQLCFIPV

 

>CYP2C9 M61857

          Length = 490

 

 Score =  679 bits (1753), Expect = 0.0

 Identities = 347/490 (70%), Positives = 386/490 (78%), Gaps = 58/490 (11%)

 

Query: 1   MDPAVALVLCLSCLFLLSLWRQSSGRGRLPSGPTPLPIIGNILQLDVKDMSKSLTNFSKV 60

           MD  V LVLCLSCL LLSLWRQSSGRG+LP GPTPLP+IGNILQ+ +KD+SKSLTN SKV

Sbjct: 1   MDSLVVLVLCLSCLLLLSLWRQSSGRGKLPPGPTPLPVIGNILQIGIKDISKSLTNLSKV 60

 

Query: 61  YGPVFTVYFGLKPIVVLHGYEAVKEALIDHGEKFSGRGSFPVAEKVNKGLGILFSNGKRW 120

           YGPVFT+YFGLKPIVVLHGYEAVKEALID GE+FSGRG FP+AE+ N+G GI+FSNGK+W

Sbjct: 61  YGPVFTLYFGLKPIVVLHGYEAVKEALIDLGEEFSGRGIFPLAERANRGFGIVFSNGKKW 120

 

Query: 121 KEIRRFSLMTLRNFGMGKRSIEDRVQEEALCLVEELRKTNASPCDPTFILGCAPCNVICS 180

           KEIRRFSLMTLRNFGMGKRSIEDRVQEEA CLVEELRKT ASPCDPTFILGCAPCNVICS

Sbjct: 121 KEIRRFSLMTLRNFGMGKRSIEDRVQEEARCLVEELRKTKASPCDPTFILGCAPCNVICS 180

 

Query: 181 VIFH--------------------------------NRF----DY---------KDQRFL 195

           +IFH                                N F    DY         K+  F+

Sbjct: 181 IIFHKRFDYKDQQFLNLMEKLNENIKILSSPWIQICNNFSPIIDYFPGTHNKLLKNVAFM 240

 

Query: 196 N--LMEKFNENLRIL--SSPW---------IQVEKHNLQSEFTIESLIATVTDMFGAGTE 242

              ++EK  E+   +  ++P          ++ EKHN  SEFTIESL  T  D+FGAGTE

Sbjct: 241 KSYILEKVKEHQESMDMNNPQDFIDCFLMKMEKEKHNQPSEFTIESLENTAVDLFGAGTE 300

 

Query: 243 TTSTTLRFGLLLLLKYPEVTAKVQEEIECVVGRNRSPCMQDRSHMPYTDAVVHEIQRYID 302

           TTSTTLR+ LLLLLK+PEVTAKVQEEIE V+GRNRSPCMQDRSHMPYTDAVVHE+QRYID

Sbjct: 301 TTSTTLRYALLLLLKHPEVTAKVQEEIERVIGRNRSPCMQDRSHMPYTDAVVHEVQRYID 360

 

Query: 303 LIPTNLPHAVTCDVKFRNYLIPKGTTIITSLTSVLHNDKEFPNPEMFDPGHFLDRSGNFK 362

           L+PT+LPHAVTCD+KFRNYLIPKGTTI+ SLTSVLH++KEFPNPEMFDP HFLD  GNFK

Sbjct: 361 LLPTSLPHAVTCDIKFRNYLIPKGTTILISLTSVLHDNKEFPNPEMFDPHHFLDEGGNFK 420

 

Query: 363 KSDYFMPFSAGKRMCVGEGLARMELFLFLTTILQNFNLKSQVDPKDIDITPIANAFGRVP 422

           KS YFMPFSAGKR+CVGE LA MELFLFLT+ILQNFNLKS VDPK++D TP+ N F  VP

Sbjct: 421 KSKYFMPFSAGKRICVGEALAGMELFLFLTSILQNFNLKSLVDPKNLDTTPVVNGFASVP 480

 

Query: 423 PLYQLCFIPV 432

           P YQLCFIPV

Sbjct: 481 PFYQLCFIPV 490

 

>CYP2B30 AY635461.1

MELSVLLFLALLTGLLLLLVQRHPNAHGRLPPGPCPLPLLGNLL

QMDRRGLLRSFLRFREKYGDVFTVYLGPRPVVMLCGVEAIREALVDNAEAFSGRGKIA

ITDPVFQGYGVVFANGNRWKVLRRFSLTTMRDFGMGKRSVEERIQEEAQCLIEELRKS

KGALVDPTFLFHSITANIICSIVFGKRFHYQDQEFLKILNLFYHTFSLASSMFGQLFE

LLSGFLKYFPGAHRQVYKNLQEINAYIGHSVEKHRETLDPSAPQDLIDSYLLQMEKEK

SNPHSEFSHRNLIINTLSLFFAGTETTSTTLRYGFLLMLKYPHVAERIYKEIEQVIGP

HRPPALDDRAKMPYTEAVIHEIQRFADLLPMGVPHIVTQQTSFRGYIIPKDTEVFPLL

STALHDPHYFEKPDTFNPDHFLDANGALKKNEAFIPFSLGRRMCLGEGIARNELFLFF

TTILQNFSVASPVAPEDIDLTPQESGVGKIPPTYQIRFLPR

 

>CYP2B6 search (M.Puljic) partial

GALVDPTFLFHSITANIICSIVFGKRFHYQDQEFLKILNLFYHTFSLASSMFGQLFELLS

GFLKYFPGAHRQVYKNLQEINAYIGHSVEKHRETLDPSAPQDLIDSYLLQMEKEKSNPHS

EFSHRNLIINTLSLFFAGTETTSTTLRYGFLLMLKYPHVAER

 

Identities = 162/162 (100%), Positives = 162/162 (100%), Gaps = 0/162 (0%)

 

Query  1    GALVDPTFLFHSITANIICSIVFGKRFHYQDQEFLKILNLFYHTFSLASSMFGQLFELLS  60

            GALVDPTFLFHSITANIICSIVFGKRFHYQDQEFLKILNLFYHTFSLASSMFGQLFELLS

Sbjct  162  GALVDPTFLFHSITANIICSIVFGKRFHYQDQEFLKILNLFYHTFSLASSMFGQLFELLS  221

 

Query  61   GFLKYFPGAHRQVYKNLQEINAYIGHSVEKHRETLDPSAPQDLIDSYLLQMEKEKSNPHS  120

            GFLKYFPGAHRQVYKNLQEINAYIGHSVEKHRETLDPSAPQDLIDSYLLQMEKEKSNPHS

Sbjct  222  GFLKYFPGAHRQVYKNLQEINAYIGHSVEKHRETLDPSAPQDLIDSYLLQMEKEKSNPHS  281

 

Query  121  EFSHRNLIINTLSLFFAGTETTSTTLRYGFLLMLKYPHVAER  162

            EFSHRNLIINTLSLFFAGTETTSTTLRYGFLLMLKYPHVAER

Sbjct  282  EFSHRNLIINTLSLFFAGTETTSTTLRYGFLLMLKYPHVAER  323

>CYP2F6 AY952296.1

MDSISTAILLLLLALVCLLLTLSSRDKXKLPPGPRPLPLLGNLLLLRSQNMLTSLTQ

LSKEYGSVYTVHLGPRRVVVLSGYQAVKEALVDQGEEFSGRGDYPVFFNFTKGN

GIAFSNGDRWKVLRRFSIQILRNFGMGKRSIEERILEEGSFLLAELRKTE

GEPFDPTFVLSRSVSNIICSVLFGSRFDYDDERLLTVIRLINDNFQIMSSPWGE (0)

LYNIFPSLLDWVPGPHQRIFQNFKRLRDLIAHRVHDQQASLDPRSPRDFIDCFLTKMAE

EKEDPLSHFHMDTLLMTTHNLLFGGTETVGTTLRHAFLALMKYPKVQ

ARVQEEIDLVVGRTRLPTLEDRAAMPYTDAVIHEVQRFADIIPMNLPHRVIRDTAFRXFLIPK

GTDIITLLNTVHYDPSQFLXPQEFNPEHFLDANQSFKKSPAFMPFSA

GRRLCLGESLARMELFLYLTAILQSFSLQPLGAPEDIDLTPLSSGLGNLPRSFQLCLCPR

 

>CYP2D42 (Vasser) also AY635464.1
MELDALVPLAVIVAIFLLLVDLMHRRQRWAARYPPGPLPLPGLGNLLHVDFKNTPYCFDQ (0)
LRRRFGDVFSLQLAWTPVVVLNGLAAVREALVTCGEDTADRPPVPINQVLGVGPRSQ (1)
GVFLARYGPAWREQRRFSVSTLRNLGLGKKSLEQWVTEEAACLCAAFTDQA (1)
GRPFRPNSLLDKAVSNVIASLTYGRRFEYDDPRFLRLFDLTHEALKEESGFLRE (0)
VLNAVPLLLRIPGLAGKVLRSQKAFLTQLDELLTEHRMTWDPAQPPRDLTEAFLAEMEK (0)
AKGNPESSFNEENLRIVVADLFSAGMVTTSTTLAWGLLLMILHPDVQR (1)
RVQQEIDNVIGQVRRPEMGDQARMPYTTAVIHEVQRFGDIVPLGVTHMTSRDIELQGFLIPK (0)
GTTLFTNLSSVLKDEAVWEKPFRFHPEHFLDAQGHFVKPEAFLPFSA (1)
GRRACLGEPLARMELFLFFTCLLQRFSFSVPAGQPRPSHHGVFAFLVTPSPYELCAVPR*
 
>CYP2D6 NM_000106
Length = 497
Score =  940 bits (2430), Expect = 0.0
Identities = 465/497 (93%), Positives = 476/497 (95%)
 
Query: 1   MELDALVPLAVIVAIFLLLVDLMHRRQRWAARYPPGPLPLPGLGNLLHVDFKNTPYCFDQ 60
           M L+ALVPLAVIVAIFLLLVDLMHRRQRWAARYPPGPLPLPGLGNLLHVDF+NTPYCFDQ
Sbjct: 1   MGLEALVPLAVIVAIFLLLVDLMHRRQRWAARYPPGPLPLPGLGNLLHVDFQNTPYCFDQ 60
 
Query: 61  LRRRFGDVFSLQLAWTPVVVLNGLAAVREALVTCGEDTADRPPVPINQVLGVGPRSQGVF 120
           LRRRFGDVFSLQLAWTPVVVLNGLAAVREALVT GEDTADRPPVPI Q+LG GPRSQGVF
Sbjct: 61  LRRRFGDVFSLQLAWTPVVVLNGLAAVREALVTHGEDTADRPPVPITQILGFGPRSQGVF 120
 
Query: 121 LARYGPAWREQRRFSVSTLRNLGLGKKSLEQWVTEEAACLCAAFTDQAGRPFRPNSLLDK 180
           LARYGPAWREQRRFSVSTLRNLGLGKKSLEQWVTEEAACLCAAF + +GRPFRPN LLDK
Sbjct: 121 LARYGPAWREQRRFSVSTLRNLGLGKKSLEQWVTEEAACLCAAFANHSGRPFRPNGLLDK 180
 
Query: 181 AVSNVIASLTYGRRFEYDDPRFLRLFDLTHEALKEESGFLREVLNAVPLLLRIPGLAGKV 240
           AVSNVIASLT GRRFEYDDPRFLRL DL  E LKEESGFLREVLNAVP+LL IP LAGKV
Sbjct: 181 AVSNVIASLTCGRRFEYDDPRFLRLLDLAQEGLKEESGFLREVLNAVPVLLHIPALAGKV 240
 
Query: 241 LRSQKAFLTQLDELLTEHRMTWDPAQPPRDLTEAFLAEMEKAKGNPESSFNEENLRIVVA 300
           LR QKAFLTQLDELLTEHRMTWDPAQPPRDLTEAFLAEMEKAKGNPESSFN+ENLRIVVA
Sbjct: 241 LRFQKAFLTQLDELLTEHRMTWDPAQPPRDLTEAFLAEMEKAKGNPESSFNDENLRIVVA 300
 
Query: 301 DLFSAGMVTTSTTLAWGLLLMILHPDVQRRVQQEIDNVIGQVRRPEMGDQARMPYTTAVI 360
           DLFSAGMVTTSTTLAWGLLLMILHPDVQRRVQQEID+VIGQVRRPEMGDQA MPYTTAVI
Sbjct: 301 DLFSAGMVTTSTTLAWGLLLMILHPDVQRRVQQEIDDVIGQVRRPEMGDQAHMPYTTAVI 360
 
Query: 361 HEVQRFGDIVPLGVTHMTSRDIELQGFLIPKGTTLFTNLSSVLKDEAVWEKPFRFHPEHF 420
           HEVQRFGDIVPLG+THMTSRDIE+QGF IPKGTTL TNLSSVLKDEAVWEKPFRFHPEHF
Sbjct: 361 HEVQRFGDIVPLGMTHMTSRDIEVQGFRIPKGTTLITNLSSVLKDEAVWEKPFRFHPEHF 420
 
Query: 421 LDAQGHFVKPEAFLPFSAGRRACLGEPLARMELFLFFTCLLQRFSFSVPAGQPRPSHHGV 480
           LDAQGHFVKPEAFLPFSAGRRACLGEPLARMELFLFFT LLQ FSFSVP GQPRPSHHGV
Sbjct: 421 LDAQGHFVKPEAFLPFSAGRRACLGEPLARMELFLFFTSLLQHFSFSVPTGQPRPSHHGV 480
 
Query: 481 FAFLVTPSPYELCAVPR 497
           FAFLV+PSPYELCAVPR
Sbjct: 481 FAFLVSPSPYELCAVPR 497

 

>CYP2E1 AY635465.1

MSALGVSVALLVWVAVLLLVSIWRQVHSSWNLPPGPFPLPIIGN

LFQLELKNIPKSFTRLAQRFGPVFTLYVGSRRVVVVHGYKAVREVLLDHKDEFSGRGD

IPAFHAHRDRGIIFNNGPTWKDIRRFSLTTLRNYGMGKQGNESRIQREAHFLLEALRK

TQGQPFDPTFLIGCAPCNVIADILFRKHFDYNDEKFLRLMYLFNENFQLLSTPWLQLY

NNFPSLLHYLPGSHRKVMKNVAEIKEYVSERVKEHLQSLDPNCPRDLTDCLLVEMEKE

KHSAERLYTMDGITVTVADLFFAGTETTSTTLRYGLLILMKYPEIEEKLHEEIDRVIG

PSRIPAIKDRQEMPYMDAVVHEIQRFITLVPSNLPHEATRDTIFRGYIIPKGTVIVPT

LDSVLYDNQEFPDPEKFKPEHFLDESGKFKYSDYFKPFSAGKRVCAGEGLARMELFLL

LSAILQHFNLKPLVDPKDIDISPVNIGFGCIPPRFKLCVIPRS

 

>CYP2E1 3 aa diffs, partial (F.Zhang)

MSALGVSVALLVWVAVLLLVSIWRQVHSSWNLPPGPFPLPIIGNLFQLEWKNIPKSFTRL
AQRFGPVFTLYVGSRRVVVVHGYKAVREVLLDHKDEFSGRGDIPAFHAHRDRGIIFNNRP
TWKDIRRFSLTTLRNYGMGKQGNESRIQREAHFLLEALRKTQGQPFDPTFLIGCSPCNVI
ADILFRKHFDYNDEKFLYNNFPSLLHYLPGSHRKVMKNVAEIKEYVSERVKEHLQSLDPN
CPRDLTDCLLVEMEKEKHSAERLYTMDGITVTVADLFFAGTETTSITLRYGLLILMKYPE
IE

 

>CYP2E1 J02843
          Length = 493
 
 Score =  576 bits (1484), Expect = e-167
 Identities = 283/322 (87%), Positives = 294/322 (91%), Gaps = 20/322 (6%)
 Frame = -1
 
Query: 906 MSALGVSVALLVWVAVLLLVSIWRQVHSSWNLPPGPFPLPIIGNLFQLEWKNIPKSFTRL 727
           MSALGV+VALLVW A LLLVS+WRQVHSSWNLPPGPFPLPIIGNLFQLE KNIPKSFTRL
Sbjct: 1   MSALGVTVALLVWAAFLLLVSMWRQVHSSWNLPPGPFPLPIIGNLFQLELKNIPKSFTRL 60
 
Query: 726 AQRFGPVFTLYVGSRRVVVVHGYKAVREVLLDHKDEFSGRGDIPAFHAHRDRGIIFNNRP 547
           AQRFGPVFTLYVGS+R+VV+HGYKAV+E LLD+KDEFSGRGD+PAFHAHRDRGIIFNN P
Sbjct: 61  AQRFGPVFTLYVGSQRMVVMHGYKAVKEALLDYKDEFSGRGDLPAFHAHRDRGIIFNNGP 120
 
Query: 546 TWKDIRRFSLTTLRNYGMGKQGNESRIQREAHFLLEALRKTQGQPFDPTFLIGCSPCNVI 367
           TWKDIRRFSLTTLRNYGMGKQGNESRIQREAHFLLEALRKTQGQPFDPTFLIGC+PCNVI
Sbjct: 121 TWKDIRRFSLTTLRNYGMGKQGNESRIQREAHFLLEALRKTQGQPFDPTFLIGCAPCNVI 180
 
Query: 366 ADILFRKHFDYNDEKF--------------------LYNNFPSLLHYLPGSHRKVMKNVA 247
           ADILFRKHFDYNDEKF                    LYNNFPS LHYLPGSHRKV+KNVA
Sbjct: 181 ADILFRKHFDYNDEKFLRLMYLFNENFHLLSTPWLQLYNNFPSFLHYLPGSHRKVIKNVA 240
 
Query: 246 EIKEYVSERVKEHLQSLDPNCPRDLTDCLLVEMEKEKHSAERLYTMDGITVTVADLFFAG 67
           E+KEYVSERVKEH QSLDPNCPRDLTDCLLVEMEKEKHSAERLYTMDGITVTVADLFFAG
Sbjct: 241 EVKEYVSERVKEHHQSLDPNCPRDLTDCLLVEMEKEKHSAERLYTMDGITVTVADLFFAG 300
 
Query: 66  TETTSITLRYGLLILMKYPEIE 1
           TETTS TLRYGLLILMKYPEIE
Sbjct: 301 TETTSTTLRYGLLILMKYPEIE 322

 

>CYP2G2P best hit (Li Chen) Note this does not look like a pseudogene
see red regions below.
exon 2 = trace archive file 456149111
MELGGAVTIFLALCLSCLLVLIAWKRMNKAGKLPPGPTPIPFLGNLLQVRTDATFQSFMK (0)
LKEKYGPLFTVYMGLWPVVVLCGHEAVKEALIDQADEFSGRGKLASIEQNFQGH (1)
GVALANGERWRILRRFSLTILRDFGMGKRSIEERILEEASYLLEEFRKTK (1)
GAPIDPTFLLSRTVSNVISSVVFGSRFDYEDKQFLNLLRLINESFIEMSTPWAQ (0)
LYDMYSGIMQYLPGRHNRVYYLIEQLKDFIASRVKINEASFDSQNPRDFIDCFLIKMHQ (0)
DKNNPRTEFNLKNLVLTALNLFFAGTETVSSTLRYGFLLLMKHPEVE (1)
ARIHEEINQVIGPHRLPSVDDRVKMPYTDAVIHEIQRLVDIVPMGVPHNVIRDTQFRGYLLPK (0)
GTDVFPLLGSVLKDPKYFRYPEAFYPQHFLDEQGRFKKNEAFVPFSS (1)
GKRICLGEAMARMELFLYFTSILQNFSPRSLVPPADIDITPKLSGFGNIPPTYELCLVAR

 

Score =  933 bits (2411), Expect = 0.0

 Identities = 463/496 (93%), Positives = 476/496 (95%), Gaps = 2/496 (0%)

 

Query: 1   MELGGAVTIFLALCLSCLLVLIAWKRMNKAGKLPPGPTPIPFLGNLLQVRTDATFQSFMK 60

           ME+GGAVTIFLALCLSCLL+LIAWK MNKAGKLPPGPTPIPFLGNLLQVRTDATFQSFMK

Sbjct: 1   MEMGGAVTIFLALCLSCLLILIAWK*MNKAGKLPPGPTPIPFLGNLLQVRTDATFQSFMK 60

 

Query: 61  LKEKYGPLFTVYMGLWPVVVLCGHEAVKEALIDQADEFSGRGKLASIEQNFQGHGVALAN 120

           L+EKY P+FTVYMG  PVVVLCGHEAVKEAL+DQADEFSGRG+LASI+QNFQGHGVALAN

Sbjct: 61  LREKYSPVFTVYMGPRPVVVLCGHEAVKEALVDQADEFSGRGELASIKQNFQGHGVALAN 120

 

Query: 121 GERWRILRRFSLTILRDFGMGKRSIEERILEEASYLLEEFRKTKGAPIDPTFLLSRTVSN 180

           GERWRIL RF LTILRDFGMGKRSIEERI EEASYLLEEFRKTKGAPIDP FLLSRTVSN

Sbjct: 121 GERWRIL*RFPLTILRDFGMGKRSIEERIQEEASYLLEEFRKTKGAPIDPIFLLSRTVSN 180

 

Query: 181 VISSVVFGSRFDYEDKQFLNLLRLINESFIEMSTPWAQLYDMYSGIMQYLPGRHNRVYYL 240

           VISSVVF SRFDYEDKQFLNLLRLINESFIEMSTPWAQLYDMYSGIMQYLPGRHN +YYL

Sbjct: 181 VISSVVFRSRFDYEDKQFLNLLRLINESFIEMSTPWAQLYDMYSGIMQYLPGRHNLIYYL 240

 

Query: 241 IEQLKDFIASRVKINEASFDSQNPRDFIDCFLIKMHQDKNNPRTEFNLKNLVLTALNLFF 300

           +E+LKDFIASRVKINEASFD QNPRDFIDCFLIKMHQDKNNPRTEFNLKNLVLT LNLFF

Sbjct: 241 VEELKDFIASRVKINEASFDPQNPRDFIDCFLIKMHQDKNNPRTEFNLKNLVLTTLNLFF 300

 

Query: 301 AGTETVSSTLRYGFLLLMKHPEVEARIHEEINQVIGPHRLPSVDDRVKMPYTDAVIHEIQ 360

           AGTETVSSTLRYGFLLLMKHPEVEA+IHEEINQVIGPHRLP VDDRVKMPYTDAVIHEIQ

Sbjct: 301 AGTETVSSTLRYGFLLLMKHPEVEAKIHEEINQVIGPHRLPRVDDRVKMPYTDAVIHEIQ 360

 

Query: 361 RLVDIVPMGVPHNVIRDTQFRGYLLPKGTDVFPLLGSVLKDPKYFRYPEAFYPQHFLDEQ 420

           RLVDIVPMGVPHN+IRDTQFRGYLLPKGTDVFPLLGSVLKDPKYFRYP+AFYPQHFLDEQ

Sbjct: 361 RLVDIVPMGVPHNLIRDTQFRGYLLPKGTDVFPLLGSVLKDPKYFRYPDAFYPQHFLDEQ 420

 

Query: 421 GRFKKNEAFVPFSS--GKRICLGEAMARMELFLYFTSILQNFSPRSLVPPADIDITPKLS 478

           GRFKKNEAFVPFSS  GKRICLGEAM RMELFLYFTS LQNFS  SLVPP DIDITPKLS

Sbjct: 421 GRFKKNEAFVPFSSGRGKRICLGEAMDRMELFLYFTSTLQNFSLHSLVPPVDIDITPKLS 480

 

Query: 479 GFGNIPPTYELCLVAR 494

           GFGNIPPTYELCLVAR

Sbjct: 481 GFGNIPPTYELCLVAR 496

 

>CYP2J2 best hit (Z. Zhang) partial

GLIMSSGQIWKEQRRFTLTALRNFGLGKKSLEERIQEEAQHLTEAIKEENGQPFDPHFKI
NNAVSNIICSITFGERFDYQDSQFQELLKLLDEVTYLEASKTCQLYNIFPWLMKFLPGPH
QTLFSNWEKLKLFVSHMIEKHRKDWNPAETRDFIDAYLKEMSKHTGNSTSSFHEENLICS
TLDLFFAGTETTSTTLRWALLYMALYPEIQEKVQAEIDRVIGQGQQPSTAARESMPYTNA
VIHEVQRMGNIVPLNVPREVTVDTTLAGYHLPKRACLGEQLARTELFIFFTSLVQKFTFR
PPNNEKLSLKFRMGITISPVSHHLC

 

>CYP2J2 NM_000775 chr 1
          Length = 497
 
 Score =  618 bits (1594), Expect = e-180
 Identities = 311/373 (83%), Positives = 319/373 (85%), Gaps = 48/373 (12%)
 
Query: 1   GLIMSSGQIWKEQRRFTLTALRNFGLGKKSLEERIQEEAQHLTEAIKEENGQPFDPHFKI 60
           GLIMSSGQ WKEQRRFTLTALRNFGLGKKSLEERIQEEAQHLTEAIKEENGQPFDPHFKI
Sbjct: 125 GLIMSSGQAWKEQRRFTLTALRNFGLGKKSLEERIQEEAQHLTEAIKEENGQPFDPHFKI 184
 
Query: 61  NNAVSNIICSITFGERFDYQDSQFQELLKLLDEVTYLEASKTCQLYNIFPWLMKFLPGPH 120
           NNAVSNIICSITFGERF+YQDS FQ+LLKLLDEVTYLEASKTCQLYN+FPW+MKFLPGPH
Sbjct: 185 NNAVSNIICSITFGERFEYQDSWFQQLLKLLDEVTYLEASKTCQLYNVFPWIMKFLPGPH 244
 
Query: 121 QTLFSNWEKLKLFVSHMIEKHRKDWNPAETRDFIDAYLKEMSKHTGNSTSSFHEENLICS 180
           QTLFSNW+KLKLFVSHMI+KHRKDWNPAETRDFIDAYLKEMSKHTGN TSSFHEENLICS
Sbjct: 245 QTLFSNWKKLKLFVSHMIDKHRKDWNPAETRDFIDAYLKEMSKHTGNPTSSFHEENLICS 304
 
Query: 181 TLDLFFAGTETTSTTLRWALLYMALYPEIQEKVQAEIDRVIGQGQQPSTAARESMPYTNA 240
           TLDLFFAGTETTSTTLRWALLYMALYPEIQEKVQ EIDRVIGQGQQPSTAARESMPYTNA
Sbjct: 305 TLDLFFAGTETTSTTLRWALLYMALYPEIQEKVQVEIDRVIGQGQQPSTAARESMPYTNA 364
 
Query: 241 VIHEVQRMGNIVPLNVPREVTVDTTLAGYHLP---------------------------- 272
           VIHEVQRMGNI+P NVPREVTVDTTLAGYHLP                            
Sbjct: 365 VIHEVQRMGNIIPQNVPREVTVDTTLAGYHLPKGTMILTNLTALHRDPTEWATPDTFNPD 424
 
Query: 273 --------------------KRACLGEQLARTELFIFFTSLVQKFTFRPPNNEKLSLKFR 312
                               KRACLGEQLARTELFIFFTSL+QKFTFRPPNNEKLSLKFR
Sbjct: 425 HFLENGQFKKREAFMPFSIGKRACLGEQLARTELFIFFTSLMQKFTFRPPNNEKLSLKFR 484
 
Query: 313 MGITISPVSHHLC 325
           MGITISPVSH LC
Sbjct: 485 MGITISPVSHRLC 497

 

>CYP2R1 (G.Zhu) partial

IFSLDLGGISTVVLNGYDVVKECLVHQSGIFADRPCLPLFMKMTKMGGLLNSRYGQGWVE
HRRLAVNSFRYFGYGQKSFESKILEETKFFTDAIETYKGRPFDFKQLITSAVSNITNLII
FGERFTYEDTDFQHMIELFSENVELAASASVFLYNAFPWIGILPFGKHQQLFRNASVVYD
FLSRLIEKASVNRKPQLPQHFVDAYFDEMDQGKNDPSSTFSKENLIFSVGELIIAGTETT
TNVLRWAILFMALYPNIQGQVQKEIDLIMGPNGKPSWDDKFKMPYTEAVLHEVLRFCNIV
PLGIFHATSEDAVVRGYSIPKGTTVITNLYSVHFDEKYWRDPEVFHPERFLDSSGYFAKK
EALVPFSLGRRHCLGEQLARMEMFLFFTALLQRFHLHFPHELVPDLKPRLGMTLQPQPYL
ICAERR

 

Alignment of the two sequences, 97% identical

Query: 1   IFSLDLGGISTVVLNGYDVVKECLVHQSGIFADRPCLPLFMKMTKMGGLLNSRYGQGWVE 60
           IFSLDLGGISTVVLNGYDVVKECLVHQS IFADRPCLPLFMKMTKMGGLLNSRYG+GWV+
Sbjct: 76  IFSLDLGGISTVVLNGYDVVKECLVHQSEIFADRPCLPLFMKMTKMGGLLNSRYGRGWVD 135
 
Query: 61  HRRLAVNSFRYFGYGQKSFESKILEETKFFTDAIETYKGRPFDFKQLITSAVSNITNLII 120
           HRRLAVNSFRYFGYGQKSFESKILEETKFF DAIETYKGRPFDFKQLIT+AVSNITNLII
Sbjct: 136 HRRLAVNSFRYFGYGQKSFESKILEETKFFNDAIETYKGRPFDFKQLITNAVSNITNLII 195
 
Query: 121 FGERFTYEDTDFQHMIELFSENVELAASASVFLYNAFPWIGILPFGKHQQLFRNASVVYD 180
           FGERFTYEDTDFQHMIELFSENVELAASASVFLYNAFPWIGILPFGKHQQLFRNA+VVYD
Sbjct: 196 FGERFTYEDTDFQHMIELFSENVELAASASVFLYNAFPWIGILPFGKHQQLFRNAAVVYD 255
 
Query: 181 FLSRLIEKASVNRKPQLPQHFVDAYFDEMDQGKNDPSSTFSKENLIFSVGELIIAGTETT 240
           FLSRLIEKASVNRKPQLPQHFVDAY DEMDQGKNDPSSTFSKENLIFSVGELIIAGTETT
Sbjct: 256 FLSRLIEKASVNRKPQLPQHFVDAYLDEMDQGKNDPSSTFSKENLIFSVGELIIAGTETT 315
 
Query: 241 TNVLRWAILFMALYPNIQGQVQKEIDLIMGPNGKPSWDDKFKMPYTEAVLHEVLRFCNIV 300
           TNVLRWAILFMALYPNIQGQVQKEIDLIMGPNGKPSWDDK KMPYTEAVLHEVLRFCNIV
Sbjct: 316 TNVLRWAILFMALYPNIQGQVQKEIDLIMGPNGKPSWDDKCKMPYTEAVLHEVLRFCNIV 375
 
Query: 301 PLGIFHATSEDAVVRGYSIPKGTTVITNLYSVHFDEKYWRDPEVFHPERFLDSSGYFAKK 360
           PLGIFHATSEDAVVRGYSIPKGTTVITNLYSVHFDEKYWRDPEVFHPERFLDSSGYFAKK
Sbjct: 376 PLGIFHATSEDAVVRGYSIPKGTTVITNLYSVHFDEKYWRDPEVFHPERFLDSSGYFAKK 435
 
Query: 361 EALVPFSLGRRHCLGEQLARMEMFLFFTALLQRFHLHFPHELVPDLKPRLGMTLQPQPYL 420
           EALVPFSLGRRHCLGE LARMEMFLFFTALLQRFHLHFPHELVPDLKPRLGMTLQPQPYL
Sbjct: 436 EALVPFSLGRRHCLGEHLARMEMFLFFTALLQRFHLHFPHELVPDLKPRLGMTLQPQPYL 495
 
Query: 421 ICAERR 426
           ICAERR
Sbjct: 496 ICAERR 501

 

>CYP2S1 exons 2,3 from CO649282.1, gene fragmented on multiple scaffolds

MEATGTWALLLALALLLLLTLALSGTRARGQLPPGPTPLPLLGNLLQLRPGALYSGLMR (0)

LSKKYGPVFTIYLGPWRPVVVLVGQEAVREALGGQAEEFSGRGTVAMLEGTFDGH (1)

GVFFSNGERWRQLRKFTMLALRDLGMGKREGEELIQAEARCLVETFQGTE (1)

GRPFDPSLLLAQATSNVVCSLLFGLRFSYEDKEFQAMVRAAGGTLLGVSSRGGQ (0)

TYEMCSWFLWPLPGPHKQLLHHVSTLAAFTVRQVQQHQGNLDASGPARDLVDAFLLKMAQ (0)

EEQNPDTEFTNKNMLMTVIYLLFAGTMTVSATVGYTLLLLMKYPHVQ (1)

KRVREELTQELGSGQAPSLGDRTRLPYTDAVLHEAQRLLALVPMGIPRTLMRTTRFRGYTLPQ (0)

GTEVFPLLGSILHDPSIFKHPEEFNPDHFLDADGRFRKHEAFLPFSL (1)

GKRVCLGEGLAKAELFLFFTTILQAFSLESPCPLDSLSLKPTISGLFNIPPAFQLQVRPTDLHSTTQTT*

 

>CYP2S1 AC011510 one exon per line 78% to mouse 2s1 49% to 2B6 47% to 2A13

          Length = 504

 

 Score =  970 bits (2508), Expect = 0.0

 Identities = 485/503 (96%), Positives = 494/503 (98%)

 

Query: 1   MEATGTWALLLALALLLLLTLALSGTRARGQLPPGPTPLPLLGNLLQLRPGALYSGLMRL 60

           MEATGTWALLLALALLLLLTLALSGTRARG LPPGPTPLPLLGNLLQLRPGALYSGLMRL

Sbjct: 1   MEATGTWALLLALALLLLLTLALSGTRARGHLPPGPTPLPLLGNLLQLRPGALYSGLMRL 60

 

Query: 61  SKKYGPVFTIYLGPWRPVVVLVGQEAVREALGGQAEEFSGRGTVAMLEGTFDGHGVFFSN 120

           SKKYGPVFTIYLGPWRPVVVLVGQEAVREALGGQAEEFSGRGTVAMLEGTFDGHGVFFSN

Sbjct: 61  SKKYGPVFTIYLGPWRPVVVLVGQEAVREALGGQAEEFSGRGTVAMLEGTFDGHGVFFSN 120

 

Query: 121 GERWRQLRKFTMLALRDLGMGKREGEELIQAEARCLVETFQGTEGRPFDPSLLLAQATSN 180

           GERWRQLRKFTMLALRDLGMGKREGEELIQAEARCLVETFQGTEGRPFDPSLLLAQATSN

Sbjct: 121 GERWRQLRKFTMLALRDLGMGKREGEELIQAEARCLVETFQGTEGRPFDPSLLLAQATSN 180

 

Query: 181 VVCSLLFGLRFSYEDKEFQAMVRAAGGTLLGVSSRGGQTYEMCSWFLWPLPGPHKQLLHH 240

           VVCSLLFGLRFSYEDKEFQA+VRAAGGTLLGVSS+GGQTYEM SWFL PLPGPHKQLLHH

Sbjct: 181 VVCSLLFGLRFSYEDKEFQAVVRAAGGTLLGVSSQGGQTYEMFSWFLRPLPGPHKQLLHH 240

 

Query: 241 VSTLAAFTVRQVQQHQGNLDASGPARDLVDAFLLKMAQEEQNPDTEFTNKNMLMTVIYLL 300

           VSTLAAFTVRQVQQHQGNLDASGPARDLVDAFLLKMAQEEQNP TEFTNKNMLMTVIYLL

Sbjct: 241 VSTLAAFTVRQVQQHQGNLDASGPARDLVDAFLLKMAQEEQNPGTEFTNKNMLMTVIYLL 300

 

Query: 301 FAGTMTVSATVGYTLLLLMKYPHVQKRVREELTQELGSGQAPSLGDRTRLPYTDAVLHEA 360

           FAGTMTVS TVGYTLLLLMKYPHVQK VREEL +ELG+GQAPSLGDRTRLPYTDAVLHEA

Sbjct: 301 FAGTMTVSTTVGYTLLLLMKYPHVQKWVREELNRELGAGQAPSLGDRTRLPYTDAVLHEA 360

 

Query: 361 QRLLALVPMGIPRTLMRTTRFRGYTLPQGTEVFPLLGSILHDPSIFKHPEEFNPDHFLDA 420

           QRLLALVPMGIPRTLMRTTRFRGYTLPQGTEVFPLLGSILH+P+IFKHPEEFNPD FLDA

Sbjct: 361 QRLLALVPMGIPRTLMRTTRFRGYTLPQGTEVFPLLGSILHEPNIFKHPEEFNPDRFLDA 420

 

Query: 421 DGRFRKHEAFLPFSLGKRVCLGEGLAKAELFLFFTTILQAFSLESPCPLDSLSLKPTISG 480

           DGRFRKHEAFLPFSLGKRVCLGEGLAKAE+FLFFTTILQAFSLESPCP D+LSLKPT+SG

Sbjct: 421 DGRFRKHEAFLPFSLGKRVCLGEGLAKAEVFLFFTTILQAFSLESPCPPDTLSLKPTVSG 480

 

Query: 481 LFNIPPAFQLQVRPTDLHSTTQT 503

           LFNIPPAFQLQVRPTDLHSTTQT

Sbjct: 481 LFNIPPAFQLQVRPTDLHSTTQT 503

 

>CYP2T2P ortholog, SCAFFOLD100362 (+) 38209-41795

frameshift in exon 4 after VIC, numerous other defects

MIAGIAALLLWLLVLALARWG*GGCRARMRGSLPPRPRPLPLLGNLQLQSGGTDHALHS (?)

LSGRWGPVFTAQLGPRPAVVLCGYAALRDALVLQADAFSGRGSMAVFERFTRGH (1)

GIFLSNGPRWWTLRNFAVGALKELGLGTRTIQAHVLEEAACLLDEMQATI (1)

GAPFDPMRLLDNAVSNVICX

LVFGNRYGYGDPEFLRLLNLFSDNFRIMSSRWGE (0)

(?) SLMDWLPGRHRRIFRNF

SELWVFISEQIQQHWQMRQPAEPRDFINCLTRWVRRGSQ

QDPESHFQEETSVMMTHLFFGGTETSTTLCYGLLVLLKYPEVA (1)

AKVQELDPVVGWRCAPSPDDHQRLPYTNAVLLQIQRFISVVPLGLPRX

TLNTHLHSHCLPK (1)

GTFVIPLLVTAHXDRTQFKDPDCFNATNFLDKGKSQGNDPFMPFAS (1)

(?) GKQMCLGAGLAHLEIFLFLTATLPRFRLLPVVNPGTINLT

QFTGLGSVPPAFQLQLVAC

 

>CYP2T2P AC008537

          Length = 457

 

Score = 2103 (740.3 bits), Expect = 7.1e-222, P = 7.1e-222

 Identities = 413/482 (85%), Positives = 427/482 (88%)

 

Query:     1 MIAGIAALLLWLLVLALARWG*GGCRARMRGSLPPRPRPLPLLGNLQLQSGGTDHALHSL 60

             M AGIAALLLWLLVLA A WG*GGCRA+MRGSLPPRPRPLPLLGNLQLQSGG D ALHSL

Sbjct:     1 MXAGIAALLLWLLVLAPAWWG*GGCRAQMRGSLPPRPRPLPLLGNLQLQSGGLDRALHSL 60

 

Query:    61 SGRWGPVFTAQLGPRPAVVLCGYAALRDALVLQADAFSGRGSMAVFERFTRGHGIFLSNG 120

             SGRWG VFT +LGPRPAV LCGYAALRDALVLQADA SGRGSMAVFERFTRG+GI  SN

Sbjct:    61 SGRWGRVFTVRLGPRPAVGLCGYAALRDALVLQADAVSGRGSMAVFERFTRGNGILFSNR 120

 

Query:   121 PRWWTLRNFAVGALKELGLGTRTIQAHVLEEAACLLDEMQATIGAPFDPMRLLDNAVSNV 180

             P WWTLRNFA+GALK+ GLGTRT++A VLEEAACLLDE QATIGAPFDP+RLLDNAVSNV

Sbjct:   121 PCWWTLRNFALGALKKFGLGTRTVEARVLEEAACLLDEFQATIGAPFDPVRLLDNAVSNV 180

 

Query:   181 ICXLVFGNRYGYGDPEFLRLLNLFSDNFRIMSSRWGESLMDWLPGRHRRIFRNFSELWVF 240

             IC LVFGNRY YGDPEFLRLLNLFSDNF I+SSRWGESLMDWLPG H RIFRNFSEL V

Sbjct:   181 ICSLVFGNRYRYGDPEFLRLLNLFSDNFCIISSRWGESLMDWLPGPHHRIFRNFSELRV- 239

 

Query:   241 ISEQIQQHWQMRQPAEPRDFINCLTRWVRRGSQQDPESHFQEETSVMMTHLFFGGTET-S 299

             ISEQIQ+HWQMRQPAEPRDFI+CLTRWVR G QQDPESHFQE TSVM TH FFG TET S

Sbjct:   240 ISEQIQRHWQMRQPAEPRDFIDCLTRWVRHG-QQDPESHFQE*TSVMTTHFFFGVTETTS 298

 

Query:   300 TTLCYGLLVLLKYPEVAAKVQELDPVVGWRCAPSPDDHQRLPYTNAVLLQIQRFISVVPL 359

             TTLCYGLL+LLKY EVAAKVQELDPVVGWR APS D    LPY NAVLL+IQ FISVVPL

Sbjct:   299 TTLCYGLLILLKYLEVAAKVQELDPVVGWRPAPSLDYRVCLPYANAVLLEIQCFISVVPL 358

 

Query:   360 GLPRX-TLNTHLHSHCLPKGTFVIPLLVTAHXDRTQFKDPDCFNATNFLDKGKSQGNDPF 418

             GLPR  TL+THLHSHCLPKGTFVIPLLVTAH D TQFKDPDCFN TNFLDKGK QGND F

Sbjct:   359 GLPRTLTLDTHLHSHCLPKGTFVIPLLVTAHRDPTQFKDPDCFNPTNFLDKGKFQGNDAF 418

 

Query:   419 MPFASGKQMCLGAGLAHLEIFLFLTATLPRFRLLPVVNPGTINLTQFTGLGSVPPAFQLQ 478

             MPFA  KQMCLG GLAH  IFLFLTATL RF LLPVV PGTINLTQ TGLGSVPP FQLQ

Sbjct:   419 MPFAPAKQMCLGTGLAHSGIFLFLTATLQRFCLLPVVRPGTINLTQCTGLGSVPPDFQLQ 478

 

Query:   479 LVAC 482

              VAC

Sbjct:   479 PVAC 482

 

>CYP2U1 (li Chen) note gc boundary between exons 7,8
MSSPGPPQPPAEDPPWPARLLRAPLGLLRMDPSGDALLLCGLVAVLGWSWLRRRRARGIP PGPTPWPLVGNFGHVLLPPFLRRRSWLSSRTRAAGIDPSVVGPQVLLAHLARVYGSIFSF FIGHYLVVVLSDFHSVREALVQQAEVFSDRPRVPLISIVTKEK (1)
GVVFAHYGPIWRQQRKFSHSTLRHFGLGKLSLEPKIIEEFKYVKAEMQKHGEDPFCPFSI
ISNAVSNIICSLCFGQRFDYTNSEFKKMLGFMSRGLEICLNSQVLMVNICPWLYYLPFGP
FKELRQIEKDITSFLKKIIKDHQESLDRENPQDFIDMYLLHMEEERKNNSNSSFDEEYLF
YIIGDLFIAGTDTTTNSLLWCLLYMSLNPDVQ (1)
EKVHEEIERVIGANRAPSLTDKAQMPYTEATIMEVQRLTVVVPLAIPHMTSGNT (1)
VLQGYTIPKGTLILPNLWSVHRDPAIWEKPEDFYPNRFLDDQGQLIKKETFIPFGI (1)

GKRVCMGEQLAKMELFLMFVSLMQSFAFALPEKSKKPLLTGRFGLTLAPHPFNITISRR*

 

Score = 1098 bits (2841), Expect = 0.0

 Identities = 535/544 (98%), Positives = 540/544 (99%)

 

Query: 1   MSSPGPPQPPAEDPPWPARLLRAPLGLLRMDPSGDALLLCGLVAVLGWSWLRRRRARGIP 60

           MSSPGP QPPAEDPPWPARLLRAPLGLLR+DPSG ALLLCGLVA+LGWSWLRRRRARGIP

Sbjct: 1   MSSPGPSQPPAEDPPWPARLLRAPLGLLRLDPSGGALLLCGLVALLGWSWLRRRRARGIP 60

 

Query: 61  PGPTPWPLVGNFGHVLLPPFLRRRSWLSSRTRAAGIDPSVVGPQVLLAHLARVYGSIFSF 120

           PGPTPWPLVGNFGHVLLPPFLRRRSWLSSRTRAAGIDPSV+GPQVLLAHLARVYGSIFSF

Sbjct: 61  PGPTPWPLVGNFGHVLLPPFLRRRSWLSSRTRAAGIDPSVIGPQVLLAHLARVYGSIFSF 120

 

Query: 121 FIGHYLVVVLSDFHSVREALVQQAEVFSDRPRVPLISIVTKEKGVVFAHYGPIWRQQRKF 180

           FIGHYLVVVLSDFHSVREALVQQAEVFSDRPRVPLISIVTKEKGVVFAHYGP+WRQQRKF

Sbjct: 121 FIGHYLVVVLSDFHSVREALVQQAEVFSDRPRVPLISIVTKEKGVVFAHYGPVWRQQRKF 180

 

Query: 181 SHSTLRHFGLGKLSLEPKIIEEFKYVKAEMQKHGEDPFCPFSIISNAVSNIICSLCFGQR 240

           SHSTLRHFGLGKLSLEPKIIEEFKYVKAEMQKHGEDPFCPFSIISNAVSNIICSLCFGQR

Sbjct: 181 SHSTLRHFGLGKLSLEPKIIEEFKYVKAEMQKHGEDPFCPFSIISNAVSNIICSLCFGQR 240

 

Query: 241 FDYTNSEFKKMLGFMSRGLEICLNSQVLMVNICPWLYYLPFGPFKELRQIEKDITSFLKK 300

           FDYTNSEFKKMLGFMSRGLEICLNSQVL+VNICPWLYYLPFGPFKELRQIEKDITSFLKK

Sbjct: 241 FDYTNSEFKKMLGFMSRGLEICLNSQVLLVNICPWLYYLPFGPFKELRQIEKDITSFLKK 300

 

Query: 301 IIKDHQESLDRENPQDFIDMYLLHMEEERKNNSNSSFDEEYLFYIIGDLFIAGTDTTTNS 360

           IIKDHQESLDRENPQDFIDMYLLHMEEERKNNSNSSFDEEYLFYIIGDLFIAGTDTTTNS

Sbjct: 301 IIKDHQESLDRENPQDFIDMYLLHMEEERKNNSNSSFDEEYLFYIIGDLFIAGTDTTTNS 360

 

Query: 361 LLWCLLYMSLNPDVQEKVHEEIERVIGANRAPSLTDKAQMPYTEATIMEVQRLTVVVPLA 420

           LLWCLLYMSLNPDVQEKVHEEIERVIGANRAPSLTDKAQMPYTEATIMEVQRLTVVVPLA

Sbjct: 361 LLWCLLYMSLNPDVQEKVHEEIERVIGANRAPSLTDKAQMPYTEATIMEVQRLTVVVPLA 420

 

Query: 421 IPHMTSGNTVLQGYTIPKGTLILPNLWSVHRDPAIWEKPEDFYPNRFLDDQGQLIKKETF 480

           IPHMTS NTVLQGYTIPKGTLILPNLWSVHRDPAIWEKPEDFYPNRFLDDQGQLIKKETF

Sbjct: 421 IPHMTSENTVLQGYTIPKGTLILPNLWSVHRDPAIWEKPEDFYPNRFLDDQGQLIKKETF 480

 

Query: 481 IPFGIGKRVCMGEQLAKMELFLMFVSLMQSFAFALPEKSKKPLLTGRFGLTLAPHPFNIT 540

           IPFGIGKRVCMGEQLAKMELFLMFVSLMQSFAFALPE SKKPLLTGRFGLTLAPHPFNIT

Sbjct: 481 IPFGIGKRVCMGEQLAKMELFLMFVSLMQSFAFALPEDSKKPLLTGRFGLTLAPHPFNIT 540

 

Query: 541 ISRR 544

           ISRR

Sbjct: 541 ISRR 544

 

>CYP2W1 Macaca mulatta rhesus monkey (Mahrous)

LSERYGPVFTVHLGCQKTVVLTGFEVVKEALAGPGQELADRPPIAIFQLIQRGG (1)

GIFFSSGARWRAARQFTVRALHSLGVGRKPVADKILQELKCLLGQLDGYR (1)

GQPFPLALLGWAPSNITFTLLFGRRFDYRDPVFVSLLGLIDEVMVLLGSPGLQ (0)

LFNVYPWLGALLQLHRPVLRKIEEVRAILRTLLEARRPHMRPGDPVCSYVDALIQQGQ (0)

GDDPEGLFAEDNAVACTLDMVMAGTETTSATLQWAALLMGRHPDVQ (1)

GRVQEELDRVLRRGRPPQPEDQQVLPYTSAVLHEVQRFITLLPHVPRCTATDMQLGGFLLPK (0)

GTPVIPLLTSVLLDETQWQTPDQFNPGHFLDADGHFVKQEAFLPFSA (1)

GRRVCVGERLARTELFLLFAGLLQKYYLLPPPGVSPASLDTTPAQAFTMRPRAQALCAVPRP

 

Missing part =

ggggatgaccccgagggctxgtttgctgaggacaacgcagtggcx

G  D  D  P  E  G  L  F  A  E  D  N  A  V  A

   

>CYP2W1 AC073957.7 chromosome 7 clone RP11-449P15 40% to 2F1

          Length = 490

 

 Score =  785 bits (2027), Expect = 0.0

 Identities = 395/432 (91%), Positives = 403/432 (93%), Gaps = 14/432 (3%)

 

Query: 1   LSERYGPVFTVHLGCQKTVVLTGFEVVKEALAGPGQELADRPPIAIFQLIQRGGGIFFSS 60

           LSERYGPVFTVHLG QKTVVLTGFE VKEALAGPGQELADRPPIAIFQLIQRGGGIFFSS

Sbjct: 59  LSERYGPVFTVHLGRQKTVVLTGFEAVKEALAGPGQELADRPPIAIFQLIQRGGGIFFSS 118

 

Query: 61  GARWRAARQFTVRALHSLGVGRKPVADKILQELKCLLGQLDGYRGQPFPLALLGWAPSNI 120

           GARWRAARQFTVRALHSLGVGR+PVADKILQELKCL GQLDGYRG+PFPLALLGWAPSNI

Sbjct: 119 GARWRAARQFTVRALHSLGVGREPVADKILQELKCLSGQLDGYRGRPFPLALLGWAPSNI 178

 

Query: 121 TFTLLFGRRFDYRDPVFVSLLGLIDEVMVLLGSPGLQLFNVYPWLGALLQLHRPVLRKIE 180

           TF LLFGRRFDYRDPVFVSLLGLIDEVMVLLGSPGLQLFNVYPWLGALLQLHRPVLRKIE

Sbjct: 179 TFALLFGRRFDYRDPVFVSLLGLIDEVMVLLGSPGLQLFNVYPWLGALLQLHRPVLRKIE 238

 

Query: 181 EVRAILRTLLEARRPHMRPGDPVCSYVDALIQQGQ--------------VCTLDMVMAGT 226

           EVRAILRTLLEARRPH+ PGDPVCSYVDALIQQGQ               CTLDMVMAGT

Sbjct: 239 EVRAILRTLLEARRPHVCPGDPVCSYVDALIQQGQGDDPEGLFAEANAVACTLDMVMAGT 298

 

Query: 227 ETTSATLQWAALLMGRHPDVQGRVQEELDRVLRRGRPPQPEDQQVLPYTSAVLHEVQRFI 286

           ETTSATLQWAALLMGRHPDVQGRVQEELDRVL  GR P+ EDQQ LPYTSAVLHEVQRFI

Sbjct: 299 ETTSATLQWAALLMGRHPDVQGRVQEELDRVLGPGRTPRLEDQQALPYTSAVLHEVQRFI 358

 

Query: 287 TLLPHVPRCTATDMQLGGFLLPKGTPVIPLLTSVLLDETQWQTPDQFNPGHFLDADGHFV 346

           TLLPHVPRCTA D QLGGFLLPKGTPVIPLLTSVLLDETQWQTP QFNPGHFLDA+GHFV

Sbjct: 359 TLLPHVPRCTAADTQLGGFLLPKGTPVIPLLTSVLLDETQWQTPGQFNPGHFLDANGHFV 418

 

Query: 347 KQEAFLPFSAGRRVCVGERLARTELFLLFAGLLQKYYLLPPPGVSPASLDTTPAQAFTMR 406

           K+EAFLPFSAGRRVCVGERLARTELFLLFAGLLQ+Y LLPPPGVSPASLDTTPA+AFTMR

Sbjct: 419 KREAFLPFSAGRRVCVGERLARTELFLLFAGLLQRYRLLPPPGVSPASLDTTPARAFTMR 478

 

Query: 407 PRAQALCAVPRP 418

           PRAQALCAVPRP

Sbjct: 479 PRAQALCAVPRP 490

 

>CYP2AB1P SCAFFOLD46808:34-204, SCAFFOLD101629:758-8003 (no ESTs found)

missing exons 2,3,7 exon 2 = Trace archive 650631246, 555635842

frameshift after THG and gc boundary in exon 2 (gc also in human)

exon 3 = 497888434

MLSLLSGLALLAISFLLLKLGTFCWDRNRLPPGPFPFPILGNLWQLRFQLHPETLLQ (0)

LAQTHG

VCLFTVWVSPIPIVVLSGFRAVKEALVSNSEQFSGRPLTSLFQDLFGEQ (1)

GIVCSRRHMWWQQRRFCLVTLQGLGLGKLALEVQLQKQAAELVEAFRQEL (1)

SRSFDPQVSIVRSTVRVIGALVFGHHFLSEDPIFQELTQAIDFGLALVRTVWHW (0)

LHDVFPRALCHLPGSHREIFRYQGVVRSFTRREITGRKLKALEALKDFINCSLAQISK (0)

AMDEPVSTFHEENLVQVVIDLFLGGTNTTATTQRWALVYMIQHGAVQ (1?)

 

GTIILPLCRGSVLYDPECWETPPQFNPGHFLDKDGNFVANEAFLPFSA (1)

GHCVCPGDQLARMELFLMFATLLRTFRFQLPEGSPGLKLEYIFGGTLQPQPQEICAVPR

 

>CYP2AB1 comparing only the known sequence, leaving out missing exon 7.

Length = 423, human has lost the KYG motif (THG in rhesus), lost heme Cys

 

 Score = 1794 (631.5 bits), Expect = 3.6e-189, P = 3.6e-189

 Identities = 353/428 (82%), Positives = 376/428 (87%)

 

Query:     1 MLSLLSGLALLAISFLLLKLGTFCWDRNRLPPGPFPFPILGNLWQLRFQLHPETLLQLAQ 60

             MLSLLSGLALLAISFLLLKLGTFCWDR+ LPPGP PFPILGNLWQL FQLHPETLLQLAQ

Sbjct:     1 MLSLLSGLALLAISFLLLKLGTFCWDRSCLPPGPLPFPILGNLWQLCFQLHPETLLQLAQ 60

 

Query:    61 THGVCLFTVWVSPIPIVVLSGFRAVKEALVSNSEQFSGRPLTSLFQDLFGEQGIVCSRRH 120

             +    +FTVWV PIP+ VLSGF+ VKEALVSNSEQFSGR LT LFQDLFGE+GI+CS  H

Sbjct:    61 S----VFTVWVGPIPVAVLSGFQVVKEALVSNSEQFSGRSLTPLFQDLFGERGIICSSGH 116

 

Query:   121 MWWQQRRFCLVTLQGLGLGKLALEVQLQKQAAELVEAFRQELSRSFDPQVSIVRSTVRVI 180

              W Q+RRFCLV + GLGLGKLALEVQLQK+AAEL EAFRQE  R FDPQVSIVRSTVRVI

Sbjct:   117 TWRQKRRFCLVMI*GLGLGKLALEVQLQKEAAELAEAFRQEQGRPFDPQVSIVRSTVRVI 176

 

Query:   181 GALVFGHHFLSEDPIFQELTQAIDFGLALVRTVWHWLHDVFPRALCHLPGSHREIFRYQG 240

             GALVFGHHFL EDPIFQELTQAIDFGLA V TVW  L+DVFP ALCHLPG H+EIFRYQ

Sbjct:   177 GALVFGHHFLLEDPIFQELTQAIDFGLAFVSTVWRRLYDVFPWALCHLPGPHQEIFRYQE 236

 

Query:   241 VVRSFTRREITGRKLKALEALKDFINCSLAQISKAMDEPVSTFHEENLVQVVIDLFLGGT 300

             VV S   +EIT  KL+A EA +DFI+C LAQISKAMD+PVSTF++ENLV VVIDLFLGGT

Sbjct:   237 VVLSLIHQEITRHKLRAPEAPRDFISCYLAQISKAMDDPVSTFNQENLV*VVIDLFLGGT 296

 

Query:   301 NTTATTQRWALVYMIQHGAVQGTIILPLCRGSVLYDPECWETPPQFNPGHFLDKDGNFVA 360

             +TTATT  WAL++MIQHGAVQGTIILP    SVLYDPECWETP QFNPGHF DKDGNFVA

Sbjct:   297 DTTATTLCWALIHMIQHGAVQGTIILPNL-ASVLYDPECWETPRQFNPGHFSDKDGNFVA 355

 

Query:   361 NEAFLPFSAGHCVCPGDQLARMELFLMFATLLRTFRFQLPEGSPGLKLEYIFGGTLQPQP 420

             NEAFLPFSAGH V P DQLA+MELFLMFATLLRTFRFQLPEGSPGLKLEYIFGGT QPQP

Sbjct:   356 NEAFLPFSAGHRVYPADQLAQMELFLMFATLLRTFRFQLPEGSPGLKLEYIFGGTWQPQP 415

 

Query:   421 QEICAVPR 428

             QEICAVPR

Sbjct:   416 QEICAVPR 423

 

>CYP2AC1P SCAFFOLD 55146 (+) 91919    103589

note: upstream neighbor is the rhag gene that flanks the Xenopus 31 gene cluster.  This large gene cluster may have originated from the CYP2AC1 gene.  This gene is on human chromosome 6.  In humans rhag is about 22kb upstream of CYP2AC1.

Exon 1 = SCAFFOLD 70822:43519-43602

Exon 3 = SCAFFOLD 103481:817-966

1 MSEFDASAILPIRVLILIFILSIKKFMTEASKQLSPPGPRPLLVIGNLYFLNLKRPYQTMLE (0)

 

3 GIAFFHGETWKTMRWFSLTTLQNFGMDEWIIEDTIIEECQNLIQNSEFHR

4 GKSFEMKTIMNASVVNIIVLVLPGKWFDYQDSQFLRLLALIGENVKLIGGLRIAVN (1)

5 SFQYVSFWGVLLKSHKTVFRNRDELFSFIRMIFLDHCHKLDKNDPRSFTDAFLVTQQE (0)

6 ENDTFADHFSDENLMALVNNLFTTGTETTASTLPWGILLVICLRSRV (1)

7 KKVHNEVTKVARSAQP*LAHQTQMPHTDAVSHEVQRFANILPTSLPHATPTNIFKNYYIPK (0)

8 ATEVIILLASVRRDQAQWEKPDTFNPEHFLTSKGKFIKREAFLPFTV (1)

9 GRRMCAGESSAR MELFLFFTSLLQ KFTFQPPLGVSHLDLDLSLDIGFTT*

 

>CYP3A64 AY582531.1

MDLIPDLAVETWLLLAVTLVLLYLYGTHSHGLFKKLGIPGPTPL

PLLGNILSYRKGFWTFDMECYKKYGKVWGFYDGRQPVLAITDPNMIKTVLVKECYSVF

TNRRPFGPVGFMKNAISIAEDEEWKRIRSLLSPTFTSGKLKEMVPIIAKYGDVLVRNL

RREAETGKPVTLKDVFGAYSMDVITSTSFGVNIDSLNNPQDPFVENTKKLLRFDFLDP

FFLSITIFPFIIPILEVLNISIFPREVTSFLRKSVKRIKESRLKDTQKHRVDFLQLMI

DSQNSKETESHKALSDQELVAQSIIFIFAGYETTSSVLSFIIYELATHPDVQQKLQEE

IDTVLPNKAPPTYDTVLQMEYLDMVVNETLRIFPIAMRLERVCKKDVEINGIFIPKGV

VVMIPSYALHHDPKYWPEPEKFLPERFSKKNNDNIDPYIYTPFGSGPRNCIGMRFALM

NMKLAIIRVLQNFSFKPCKETQIPLKLRLGGLLQTEKPIVLKIESRDGTVSGA

 

>CYP3A43 ortholog? partial assembly (Aggarwal) 72% to 3A64
MDLIPNFAMETWVLVATSLVLL (2)
YIYGTHSHKLFKKLGIPGPTPLPFLGTILFYLR (0)
GLWKFDRECNEKYGEMWG (2)
LYEGQQPMLVIMDPDMIKTVLVKECYSVFTNRM (0)
PLGPMGFMKSALSFAEDEEWKRIRTLLSPAFTSVKFKE  (0)
MVPIISQCGDMLVRSLRREAENSKPTNLKE
 
>CYP3A43 AC011904 one exon per line
          Length = 504
 
 Score =  348 bits (893), Expect = 5e-99
 Identities = 168/174 (96%), Positives = 171/174 (98%)
 
Query: 1   MDLIPNFAMETWVLVATSLVLLYIYGTHSHKLFKKLGIPGPTPLPFLGTILFYLRGLWKF 60
           MDLIPNFAMETWVLVATSLVLLYIYGTHSHKLFKKLGIPGPTPLPFLGTILFYLRGLW F
Sbjct: 1   MDLIPNFAMETWVLVATSLVLLYIYGTHSHKLFKKLGIPGPTPLPFLGTILFYLRGLWNF 60
 
Query: 61  DRECNEKYGEMWGLYEGQQPMLVIMDPDMIKTVLVKECYSVFTNRMPLGPMGFMKSALSF 120
           DRECNEKYGEMWGLYEGQQPMLVIMDPDMIKTVLVKECYSVFTN+MPLGPMGF+KSALSF
Sbjct: 61  DRECNEKYGEMWGLYEGQQPMLVIMDPDMIKTVLVKECYSVFTNQMPLGPMGFLKSALSF 120
 
Query: 121 AEDEEWKRIRTLLSPAFTSVKFKEMVPIISQCGDMLVRSLRREAENSKPTNLKE 174
           AEDEEWKRIRTLLSPAFTSVKFKEMVPIISQCGDMLVRSLR+EAENSK  NLKE
Sbjct: 121 AEDEEWKRIRTLLSPAFTSVKFKEMVPIISQCGDMLVRSLRQEAENSKSINLKE 174
 
>CYP4A11 match (ramy.Naguib) partial seq. Missing exon 1 (added later)
MSVSVLSPSRLLGGVSGILQVASLLILLLLLIKAAQLYLHRQWLLKAFQQFPCSPSHWLFGHKQE (0)
FQQDQELQRIRKWVEMFPSACPLWLWGGKARVQLHDPDYMKVILGRS (1)
DPKSQDPYRFLAPWI (1)
GYGLLLLNGQTWFQHRRMLTPAFHYDILKAYVALMADSVRVML (0)
DKWEKLLGQDSPLEVFQHVSLMTLDTIMKCAFSHQGSIQVDR (2)
DSQSYIQAISDLNNLVFSRVRNVFHQNDTIYSLTSTGRWTHRACQLAHQHT (1)
DQVIQLRKAQLQKEGELEKVKRKKHLDFLDILLLAK (0)
MENGSILSDKDLRAEVDTFMFEGHDTTASGISWILYALATHPKHQERCREEIHGLLGDGASITW (2)
NHLDQMPYTTMCIKEALRLYPPVPGISRELSTPVTFPDGRSLPK (1)
GITVMLSIYGLHHNPKVWPNPE (0)
VFDPSRFAPGSAQHSHAFLPFSGGSR (2)
NCIGKQFAMNELKVATALTLLRFELLPDPTRIPIPMARLVLKSKNGIHLRLRRLPNPCEDKDQL
 
The alignment of the two proteins:
 

Score = 1010 bits (2612), Expect = 0.0

 Identities = 491/519 (94%), Positives = 499/519 (96%)

 

Query: 1   MSVSVLSPSRLLGGVSGILQVASLLILLLLLIKAAQLYLHRQWLLKAFQQFPCSPSHWLF 60

           MSVSVLSPSRLLG VSGILQ ASLLILLLLLIKA QLYLHRQWLLKA QQFPC PSHWLF

Sbjct: 1   MSVSVLSPSRLLGDVSGILQAASLLILLLLLIKAVQLYLHRQWLLKALQQFPCPPSHWLF 60

 

Query: 61  GHKQEFQQDQELQRIRKWVEMFPSACPLWLWGGKARVQLHDPDYMKVILGRSDPKSQDPY 120

           GH QE QQDQELQRI+KWVE FPSACP WLWGGK RVQL+DPDYMKVILGRSDPKS   Y

Sbjct: 61  GHIQELQQDQELQRIQKWVETFPSACPHWLWGGKVRVQLYDPDYMKVILGRSDPKSHGSY 120

 

Query: 121 RFLAPWIGYGLLLLNGQTWFQHRRMLTPAFHYDILKAYVALMADSVRVMLDKWEKLLGQD 180

           RFLAPWIGYGLLLLNGQTWFQHRRMLTPAFHYDILK YV LMADSVRVMLDKWE+LLGQD

Sbjct: 121 RFLAPWIGYGLLLLNGQTWFQHRRMLTPAFHYDILKPYVGLMADSVRVMLDKWEELLGQD 180

 

Query: 181 SPLEVFQHVSLMTLDTIMKCAFSHQGSIQVDRDSQSYIQAISDLNNLVFSRVRNVFHQND 240

           SPLEVFQHVSLMTLDTIMKCAFSHQGSIQVDR+SQSYIQAISDLNNLVFSRVRN FHQND

Sbjct: 181 SPLEVFQHVSLMTLDTIMKCAFSHQGSIQVDRNSQSYIQAISDLNNLVFSRVRNAFHQND 240

 

Query: 241 TIYSLTSTGRWTHRACQLAHQHTDQVIQLRKAQLQKEGELEKVKRKKHLDFLDILLLAKM 300

           TIYSLTS GRWTHRACQLAHQHTDQVIQLRKAQLQKEGELEK+KRK+HLDFLDILLLAKM

Sbjct: 241 TIYSLTSAGRWTHRACQLAHQHTDQVIQLRKAQLQKEGELEKIKRKRHLDFLDILLLAKM 300

 

Query: 301 ENGSILSDKDLRAEVDTFMFEGHDTTASGISWILYALATHPKHQERCREEIHGLLGDGAS 360

           ENGSILSDKDLRAEVDTFMFEGHDTTASGISWILYALATHPKHQERCREEIH LLGDGAS

Sbjct: 301 ENGSILSDKDLRAEVDTFMFEGHDTTASGISWILYALATHPKHQERCREEIHSLLGDGAS 360

 

Query: 361 ITWNHLDQMPYTTMCIKEALRLYPPVPGISRELSTPVTFPDGRSLPKGITVMLSIYGLHH 420

           ITWNHLDQMPYTTMCIKEALRLYPPVPGI RELSTPVTFPDGRSLPKGI V+LSIYGLHH

Sbjct: 361 ITWNHLDQMPYTTMCIKEALRLYPPVPGIGRELSTPVTFPDGRSLPKGIMVLLSIYGLHH 420

 

Query: 421 NPKVWPNPEVFDPSRFAPGSAQHSHAFLPFSGGSRNCIGKQFAMNELKVATALTLLRFEL 480

           NPKVWPNPEVFDPSRFAPGSAQHSHAFLPFSGGSRNCIGKQFAMNELKVATALTLLRFEL

Sbjct: 421 NPKVWPNPEVFDPSRFAPGSAQHSHAFLPFSGGSRNCIGKQFAMNELKVATALTLLRFEL 480

 

Query: 481 LPDPTRIPIPMARLVLKSKNGIHLRLRRLPNPCEDKDQL 519

           LPDPTRIPIP+ARLVLKSKNGIHLRLRRLPNPCEDKDQL

Sbjct: 481 LPDPTRIPIPIARLVLKSKNGIHLRLRRLPNPCEDKDQL 519

 

>CYP4A22 search (Puljic) same seq as the 4A11 hit above

MSVSVLSPSRLLGGVSGILQVASLLILLLLLIKAAQLYLHRQWLLKAFQQFPCSPSHWLF

GHKQEFQQDQELQRIRKWVEMFPSACPLWLWGGKARVQLHDPDYMKVILGRSGYGLLLLN

GQTWFQHRRMLTPAFHYDILKAYVALMADSVRVMLDKWEKLLGQDSPLEVFQHVSLMTLD

TIMKCAFSHQGSIQVDRDSQSYIQAISDLNNLVFSRVRNVFHQNDTIYSLTSTGRWTHRA

CQLAHQHTETK*SN*GRLNYRRRGSWRRSRGRSTWISWTSSSWPKWRMGASCQTRTSVPK

WTHSCSRATTPQPVGSPGFCMLWPHTPSIRRGAGKRSMASWVMEPPSPGENHLDQMPYTT

MCIKEALRLYPPVPGISRELSTPVTFPDGRSLPKGITVMLSIYGLHHNPKVWPNPEVFDP

SRFAPGSAQHSHAFLPFSGGSRNCIGKQFAMNELKVATALTLLRFELLPDPTRIPIPMAR

LVLKSKNGIHLRLRRLPNPCEDKDQL

 

>CYP4B1

MVPSFLSLRLSCLGLWASGLILVLGFLKLIRLLLRRQRLAKAMGNFPGPPTHWLFGHALE (0)

IQQTGSLDKVVSWAHQFPYAHPLWFGQFIGFLNIYEPDYAKAVYSRG (1)

DPKAPDVYDFFLQWI (1)

GRGLLVLEGPKWFQHRKLLTPGFHYDVLKPYVALFAESTRVML (0)

DKWEEKAQEGKSFDIFCDVGHMALNTLMKCTFGRGDTGLGHR (2)

DSSYYLAVSDLTLLMQQRLVSFHYHNDFIYWLTPHGRRFLRACQVAHDHT (1)

DQVIRERKAALQDEKVRKKIQNRRHLDFLDILLGAR (0)

DEDDSKLSDADLRAEVDTFMFEGHDTTTSGISWFLYCMALYPEHQHRCREEVREILGDQDSFQW (2)

DDLGKMTYLTMCIKESFRLYPPVPQVYRQLSKPVTFVDGRSLPA (1)

GSLISMHIYALHRNSAVWPDPE (0)

VFDPLRFSTENASKRHPFAFMPFSAGPR (2)

NCIGQQFAMSEMKVVTAMCLLHFEFSLDPSRLPIKMLQLVLRSKNGIHLHLKPLGPGSGK

 

>CYP4B1 NM_000779

          Length = 511

 

 Score = 1020 bits (2638), Expect = 0.0

 Identities = 490/511 (95%), Positives = 495/511 (96%)

 

Query: 1   MVPSFLSLRLSCLGLWASGLILVLGFLKLIRLLLRRQRLAKAMGNFPGPPTHWLFGHALE 60

           MVPSFLSL  S LGLWASGLILVLGFLKLI LLLRR+ LAKAM  FPGPPTHWLFGHALE

Sbjct: 1   MVPSFLSLSFSSLGLWASGLILVLGFLKLIHLLLRRRTLAKAMDKFPGPPTHWLFGHALE 60

 

Query: 61  IQQTGSLDKVVSWAHQFPYAHPLWFGQFIGFLNIYEPDYAKAVYSRGDPKAPDVYDFFLQ 120

           IQ+TGSLDKVVSWAHQFPYAHPLWFGQFIGFLNIYEPDYAKAVYSRGDPKAPDVYDFFLQ

Sbjct: 61  IQETGSLDKVVSWAHQFPYAHPLWFGQFIGFLNIYEPDYAKAVYSRGDPKAPDVYDFFLQ 120

 

Query: 121 WIGRGLLVLEGPKWFQHRKLLTPGFHYDVLKPYVALFAESTRVMLDKWEEKAQEGKSFDI 180

           WIGRGLLVLEGPKW QHRKLLTPGFHYDVLKPYVA+F ESTR+MLDKWEEKA+EGKSFDI

Sbjct: 121 WIGRGLLVLEGPKWLQHRKLLTPGFHYDVLKPYVAVFTESTRIMLDKWEEKAREGKSFDI 180

 

Query: 181 FCDVGHMALNTLMKCTFGRGDTGLGHRDSSYYLAVSDLTLLMQQRLVSFHYHNDFIYWLT 240

           FCDVGHMALNTLMKCTFGRGDTGLGHRDSSYYLAVSDLTLLMQQRLVSF YHNDFIYWLT

Sbjct: 181 FCDVGHMALNTLMKCTFGRGDTGLGHRDSSYYLAVSDLTLLMQQRLVSFQYHNDFIYWLT 240

 

Query: 241 PHGRRFLRACQVAHDHTDQVIRERKAALQDEKVRKKIQNRRHLDFLDILLGARDEDDSKL 300

           PHGRRFLRACQVAHDHTDQVIRERKAALQDEKVRKKIQNRRHLDFLDILLGARDEDD KL

Sbjct: 241 PHGRRFLRACQVAHDHTDQVIRERKAALQDEKVRKKIQNRRHLDFLDILLGARDEDDIKL 300

 

Query: 301 SDADLRAEVDTFMFEGHDTTTSGISWFLYCMALYPEHQHRCREEVREILGDQDSFQWDDL 360

           SDADLRAEVDTFMFEGHDTTTSGISWFLYCMALYPEHQHRCREEVREILGDQD FQWDDL

Sbjct: 301 SDADLRAEVDTFMFEGHDTTTSGISWFLYCMALYPEHQHRCREEVREILGDQDFFQWDDL 360

 

Query: 361 GKMTYLTMCIKESFRLYPPVPQVYRQLSKPVTFVDGRSLPAGSLISMHIYALHRNSAVWP 420

           GKMTYLTMCIKESFRLYPPVPQVYRQLSKPVTFVDGRSLPAGSLISMHIYALHRNSAVWP

Sbjct: 361 GKMTYLTMCIKESFRLYPPVPQVYRQLSKPVTFVDGRSLPAGSLISMHIYALHRNSAVWP 420

 

Query: 421 DPEVFDPLRFSTENASKRHPFAFMPFSAGPRNCIGQQFAMSEMKVVTAMCLLHFEFSLDP 480

           DPEVFD LRFSTENASKRHPFAFMPFSAGPRNCIGQQFAMSEMKVVTAMCLL FEFSLDP

Sbjct: 421 DPEVFDSLRFSTENASKRHPFAFMPFSAGPRNCIGQQFAMSEMKVVTAMCLLRFEFSLDP 480

 

Query: 481 SRLPIKMLQLVLRSKNGIHLHLKPLGPGSGK 511

           SRLPIKM QLVLRSKNG HLHLKPLGPGSGK

Sbjct: 481 SRLPIKMPQLVLRSKNGFHLHLKPLGPGSGK 511

 

>CYP4V2 (S.Sarva)

MAGIWLGLVWQKLLLWGAASAVSLAGASLVLSLLQRVASYVRKWQQMRPIPTVARAYPLV 
GHALLMKRDGR (1)
EFFQQIIEYTEEYRHMPLLKLWVGPVPMVALYNAENVE (0)
VILTSSKQIDKSSMYKFLEPWLGLGLLTS (2)
TGNKWRSRRKMLTPTFHFTILEDFLDIMNEQANILVKKLEKHVNQEAFNCFVYITLCALDIIC (1)
ETAMGKNIGAQSNDDSEYVRAVYR (2)
MSEMIFRRIKMPWLWLDLWYLMFKEGWEHKKSLKILHAFTNN (0)
VIAERANEMNVDEDCRGDGRDSAPSKNKRRAFLDLLLSVTDDEGNRLSHEDIREEVDTFMFE (0)
GHDTTAAAMNWSLYLLGSNPEVQKKVDHELDDVF (1)
GRTDRPATVEDLKKLRYLECVIKETLRLFPSVPLFARSVSEDCEV (1)
AGYRVLKGTEAV
IIPYALHRDPRYFPNPEEFRPERFFPENAQGRHPYAYVPFSAGPRNCI (1)
GQKFAVMEEKTILSCILRHFWIESNQKREELGLEGQLILRPTNGIWIKLKRRNADE 1551

 

Score = 1040 bits (2688),  Expect = 0.0

 Identities = 508/524 (96%), Positives = 517/524 (98%), Gaps = 1/524 (0%)

 Frame = +2

 

Query  1     MAGIWLGLVWQKLLLWGAASAVSLAGASLVLSLLQRVASYVRKWQQMRPIPTVARAYPLV  60

             MAG+WLGLVWQKLLLWGAASA+SLAGASLVLSLLQRVASY RKWQQMRPIPTVARAYPLV

Sbjct  305   MAGLWLGLVWQKLLLWGAASALSLAGASLVLSLLQRVASYARKWQQMRPIPTVARAYPLV  484

 

Query  61    GHALLMKRDGREFFQQIIEYTEEYRHMPLLKLWVGPVPMVALYNAENVEVILTSSKQIDK  120

             GHALLMK DGREFFQQIIEYTEEYRHMPLLKLWVGPVPMVALYNAENVEVILTSSKQIDK

Sbjct  485   GHALLMKPDGREFFQQIIEYTEEYRHMPLLKLWVGPVPMVALYNAENVEVILTSSKQIDK  664

 

Query  121   SSMYKFLEPWLGLGLLTSTGNKWRSRRKMLTPTFHFTILEDFLDIMNEQANILVKKLEKH  180

             SSMYKFLEPWLGLGLLTSTGNKWRSRRKMLTPTFHFTILEDFLDIMNEQANILVKKLEKH

Sbjct  665   SSMYKFLEPWLGLGLLTSTGNKWRSRRKMLTPTFHFTILEDFLDIMNEQANILVKKLEKH  844

 

Query  181   VNQEAFNCFVYITLCALDIICETAMGKNIGAQSNDDSEYVRAVYRMSEMIFRRIKMPWLW  240

             +NQEAFNCF YITLCALDIICETAMGKNIGAQSNDDSEYVRAVYRMSEMIFRRIKMPWLW

Sbjct  845   INQEAFNCFFYITLCALDIICETAMGKNIGAQSNDDSEYVRAVYRMSEMIFRRIKMPWLW  1024

 

Query  241   LDLWYLMFKEGWEHKKSLKILHAFTNNVIAERANEMNVDEDCRGDGRDSAPSKNKRRAFL  300

             LDLWYLMFKEGWEHKKSLKILH FTN+VIAERANEMN +EDCRGDGR SAPSKNKRRAFL

Sbjct  1025  LDLWYLMFKEGWEHKKSLKILHTFTNSVIAERANEMNANEDCRGDGRGSAPSKNKRRAFL  1204

 

Query  301   DLLLSVTDDEGNRLSHEDIREEVDTFMFEGHDTTAAAMNWSLYLLGSNPEVQKKVDHELD  360

             DLLLSVTDDEGNRLSHEDIREEVDTFMFEGHDTTAAA+NWSLYLLGSNPEVQKKVDHELD

Sbjct  1205  DLLLSVTDDEGNRLSHEDIREEVDTFMFEGHDTTAAAINWSLYLLGSNPEVQKKVDHELD  1384

 

Query  361   DVFGRTDRPATVEDLKKLRYLECVIKETLRLFPSVPLFARSVSEDCEVAGYRVLKGTEAV  419

             DVFG++DRPATVEDLKKLRYLECVIKETLRLFPSVPLFARSVSEDCEVAGYRVLKGTEAV

Sbjct  1385  DVFGKSDRPATVEDLKKLRYLECVIKETLRLFPSVPLFARSVSEDCEVAGYRVLKGTEAV  1564

 

Query  420   IIPYALHRDPRYFPNPEEFRPERFFPENAQGRHPYAYVPFSAGPRNCIGQKFAVMEEKTI  479

             IIPYALHRDPRYFPNPEEF+PERFFPENAQGRHPYAYVPFSAGPRNCIGQKFAVMEEKTI

Sbjct  1565  IIPYALHRDPRYFPNPEEFQPERFFPENAQGRHPYAYVPFSAGPRNCIGQKFAVMEEKTI  1744

 

Query  480   LSCILRHFWIESNQKREELGLEGQLILRPTNGIWIKLKRRNADE  523

             LSCILRHFWIESNQKREELGLEGQLILRP+NGIWIKLKRRNADE

Sbjct  1745  LSCILRHFWIESNQKREELGLEGQLILRPSNGIWIKLKRRNADE  1876

 

>CYP4X1 (Vasser) missing exon 1

FIQDDNMEKLEEIIEKYPRAFPFWIGPFQAFFYIYDPDYAKTFLSRT (1)

DPKSQYLQKFLPPLI (1)

GKGLLALDGPKWFQHRRLLTPGFHFNILKAYIEVMAHSVKTML (0)

DKWEKICSTQNTSVEVYEHINLMSLDIIMKCAFSKETNCQTN (2)

STHDPYVKAIFELGKIIFHRLYSFLYHSDIIFKLSPQGYRFQKLSRVLNQYT (1)

DAIIQERKKSLQAGEKQDNTQKRKYQDFLDIVLSAK (0)

DENGSSFSDTDVHSEVSMFLLGGHDSLAASISWILYCLALNPEHQERCREEVRGILGDGCSITW (2)

DQLGEMSYTTMCIKETCRLIPAVPSISRDLSKPLTFPDGCTLPA*

 

>CYP4X1 R56515, R53456, AA652746, AC026935

          Length = 506
 

Score =  664 bits (1714), Expect = 0.0

 Identities = 323/343 (94%), Positives = 327/343 (95%)

 

Query: 1   FIQDDNMEKLEEIIEKYPRAFPFWIGPFQAFFYIYDPDYAKTFLSRTDPKSQYLQKFLPP 60

           FIQDDNMEKLEEIIEKYPRAFPFWIGPFQAFF IYDPDYAKT LSRTDPKSQYLQKF PP

Sbjct: 60  FIQDDNMEKLEEIIEKYPRAFPFWIGPFQAFFCIYDPDYAKTLLSRTDPKSQYLQKFSPP 119

 

Query: 61  LIGKGLLALDGPKWFQHRRLLTPGFHFNILKAYIEVMAHSVKTMLDKWEKICSTQNTSVE 120

           L+GKGL ALDGPKWFQHRRLLTPGFHFNILKAYIEVMAHSVK MLDKWEKICSTQ+TSVE

Sbjct: 120 LLGKGLAALDGPKWFQHRRLLTPGFHFNILKAYIEVMAHSVKMMLDKWEKICSTQDTSVE 179

 

Query: 121 VYEHINLMSLDIIMKCAFSKETNCQTNSTHDPYVKAIFELGKIIFHRLYSFLYHSDIIFK 180

           VYEHIN MSLDIIMKCAFSKETNCQTNSTHDPY KAIFEL KIIFHRLYS LYHSDIIFK

Sbjct: 180 VYEHINSMSLDIIMKCAFSKETNCQTNSTHDPYAKAIFELSKIIFHRLYSLLYHSDIIFK 239

 

Query: 181 LSPQGYRFQKLSRVLNQYTDAIIQERKKSLQAGEKQDNTQKRKYQDFLDIVLSAKDENGS 240

           LSPQGYRFQKLSRVLNQYTD IIQERKKSLQAG KQDNT KRKYQDFLDIVLSAKDE+GS

Sbjct: 240 LSPQGYRFQKLSRVLNQYTDTIIQERKKSLQAGVKQDNTPKRKYQDFLDIVLSAKDESGS 299

 

Query: 241 SFSDTDVHSEVSMFLLGGHDSLAASISWILYCLALNPEHQERCREEVRGILGDGCSITWD 300

           SFSD DVHSEVS FLL GHD+LAASISWILYCLALNPEHQERCREEVRGILGDG SITWD

Sbjct: 300 SFSDIDVHSEVSTFLLAGHDTLAASISWILYCLALNPEHQERCREEVRGILGDGSSITWD 359

 

Query: 301 QLGEMSYTTMCIKETCRLIPAVPSISRDLSKPLTFPDGCTLPA 343

           QLGEMSYTTMCIKETCRLIPAVPSISRDLSKPLTFPDGCTLPA

Sbjct: 360 QLGEMSYTTMCIKETCRLIPAVPSISRDLSKPLTFPDGCTLPA 402

 

>CYP5A1 (Z. Zhang, N.Adeltawab) partial

ASGLEFKSVADSVLFLRDKRWEEVRGALMSAFSPEKLNEMTPLISQACDLLLAHLKRYAE
SGDAFDIQRCYCNYTTDVVASVAFGTPVDSQQAPEDPFVKHCKRFFEFCIPRPILVLLLS
FPSIMVPLARILPNKNRDELNGFFNKLIRNVIALRDQQAAEERRRDFLQMVLDARHSASP
VGVQDFDMVGDVFSSTRCKPNPSRQHQAGPMARPLTVDEIVGQAFIFLIAGYEIVTNTLS
FATYLLATNPDCQEKLLREVDLFKEKHMVPEFCSLEEGLPYLDMVIAETLRMYPPAF

 

>CYP5A1 NM_001061 this gene is 197000 bases long
          Length = 534
 
 Score =  576 bits (1484), Expect = e-167
 Identities = 285/297 (95%), Positives = 289/297 (97%)
 
Query: 1   ASGLEFKSVADSVLFLRDKRWEEVRGALMSAFSPEKLNEMTPLISQACDLLLAHLKRYAE 60
           ASGLEFKSVADSVLFLRDKRWEEVRGALMSAFSPEKLNEM PLISQACDLLLAHLKRYAE
Sbjct: 113 ASGLEFKSVADSVLFLRDKRWEEVRGALMSAFSPEKLNEMVPLISQACDLLLAHLKRYAE 172
 
Query: 61  SGDAFDIQRCYCNYTTDVVASVAFGTPVDSQQAPEDPFVKHCKRFFEFCIPRPILVLLLS 120
           SGDAFDIQRCYCNYTTDVVASV FGTPVDS QAPEDPFVKHCKRFFEFCIPRPILVLLLS
Sbjct: 173 SGDAFDIQRCYCNYTTDVVASVPFGTPVDSWQAPEDPFVKHCKRFFEFCIPRPILVLLLS 232
 
Query: 121 FPSIMVPLARILPNKNRDELNGFFNKLIRNVIALRDQQAAEERRRDFLQMVLDARHSASP 180
           FPSIMVPLARILPNKNRDELNGFFNKLIRNVIALRDQQAAEERRRDFLQMVLDARHSASP
Sbjct: 233 FPSIMVPLARILPNKNRDELNGFFNKLIRNVIALRDQQAAEERRRDFLQMVLDARHSASP 292
 
Query: 181 VGVQDFDMVGDVFSSTRCKPNPSRQHQAGPMARPLTVDEIVGQAFIFLIAGYEIVTNTLS 240
           +GVQDFD+V DVFSST CKPNPSRQHQ  PMARPLTVDEIVGQAFIFLIAGYEI+TNTLS
Sbjct: 293 MGVQDFDIVRDVFSSTGCKPNPSRQHQPSPMARPLTVDEIVGQAFIFLIAGYEIITNTLS 352
 
Query: 241 FATYLLATNPDCQEKLLREVDLFKEKHMVPEFCSLEEGLPYLDMVIAETLRMYPPAF 297
           FATYLLATNPDCQEKLLREVD+FKEKHM PEFCSLEEGLPYLDMVIAETLRMYPPAF
Sbjct: 353 FATYLLATNPDCQEKLLREVDVFKEKHMAPEFCSLEEGLPYLDMVIAETLRMYPPAF 409

 

>CYP7A1 (A.Bolen)

MMTTSLIWGIAIAACCCLWLILGIRRR (2)
QTGEPPLENGLIPYLGCALQFGANPLEFLRANQRKHGHVFTCKLMGKYVHFITNPLSY
HKVLCHGKYFDWKKFHFATSAK (0)
AFGHRSIDPMDGNTTENINDTFIKTLQGHALNSLTESMMENLQRIMRPPVSSN
SKTAAWVTEGMYSFCYRVMFEAGYLTIFGRDLTRRDTQKAHILNNLDNFKQFDKVFPA
LVAGLPIHMFRTAHNAREKLAESLRHENLQKRESISELISLRMFLNDTLSTFDDLEKA
KTHLVVLWASQANTIPATFWSLFQMIR (2)
NPEAMKAATEEVKRTLENAGQKVSLEGNPICLSQAELNDLPVL (1)
DSIIKESLRLSSASLNIRTAKEDFTLHLEDGSYNIRKDSIIALYPQLMHLDPEIYPDPL (0)
TFKYDRYLDENGKTKTTFYCNGLKLKYYYMPFGSGATICPGRLFA

IHEIKQFLILMLSYFELELIEGQAKCPPLDQSRAGLGILPPLNDIEFKYKFKHL*

 

>CYP7A1 NM_000780
          Length = 504
 
 Score =  998 bits (2581), Expect = 0.0
 Identities = 485/504 (96%), Positives = 495/504 (98%)
 
Query: 1   MMTISLIWGIAIAACCCLWLILGIRRRQTGEPPLENGLIPYLGCALQFGANPLEFLRANQ 60
           MMT SLIWGIAIAACCCLWLILGIRRRQTGEPPLENGLIPYLGCALQFGANPLEFLRANQ
Sbjct: 1   MMTTSLIWGIAIAACCCLWLILGIRRRQTGEPPLENGLIPYLGCALQFGANPLEFLRANQ 60
 
Query: 61  RKHGHVFTCKLMGKYVHFITNPLSYHKVLCHGKYFDWKKFHFATSAKAFGHRSIDPKDGN 120
           RKHGHVFTCKLMGKYVHFITNPLSYHKVLCHGKYFDWKKFHFATSAKAFGHRSIDP DGN
Sbjct: 61  RKHGHVFTCKLMGKYVHFITNPLSYHKVLCHGKYFDWKKFHFATSAKAFGHRSIDPMDGN 120
 
Query: 121 TTENINNTFIKTLQGNALNSLTESMMENLQRIMRPPVFSNSKTAAWVTEGMYSFCYRVMF 180
           TTENIN+TFIKTLQG+ALNSLTESMMENLQRIMRPPV SNSKTAAWVTEGMYSFCYRVMF
Sbjct: 121 TTENINDTFIKTLQGHALNSLTESMMENLQRIMRPPVSSNSKTAAWVTEGMYSFCYRVMF 180
 
Query: 181 EAGYLTIFGRDLTRQDTQKAHILNNLDNFKQFDKVFPALVAGLPIHMFRTAHSAREKLAE 240
           EAGYLTIFGRDLTR+DTQKAHILNNLDNFKQFDKVFPALVAGLPIHMFRTAH+AREKLAE
Sbjct: 181 EAGYLTIFGRDLTRRDTQKAHILNNLDNFKQFDKVFPALVAGLPIHMFRTAHNAREKLAE 240
 
Query: 241 SLRHENLQKRESVSELIRLRMFLNDTLSTFDDLEKAKTHLVVLWASQANTIPATFWSLFQ 300
           SLRHENLQKRES+SELI LRMFLNDTLSTFDDLEKAKTHLVVLWASQANTIPATFWSLFQ
Sbjct: 241 SLRHENLQKRESISELISLRMFLNDTLSTFDDLEKAKTHLVVLWASQANTIPATFWSLFQ 300
 
Query: 301 MIRNPEAMKAATEEVKRTLENAGQKVSLEGNPICLSQTQLNDLPVLESIIKESLRLSSAS 360
           MIRNPEAMKAATEEVKRTLENAGQKVSLEGNPICLSQ +LNDLPVL SIIKESLRLSSAS
Sbjct: 301 MIRNPEAMKAATEEVKRTLENAGQKVSLEGNPICLSQAELNDLPVLNSIIKESLRLSSAS 360
 
Query: 361 LNIRTAKEDFTLHLEDGSYNIRKDDIIALYPQLMHLDPEIYPDPLSFKYDRYLDENGKTK 420
           LNIRTAKEDFTLHLEDGSYNIRKD IIALYPQLMHLDPEIYPDPL+FKYDRYLDENGKTK
Sbjct: 361 LNIRTAKEDFTLHLEDGSYNIRKDSIIALYPQLMHLDPEIYPDPLTFKYDRYLDENGKTK 420
 
Query: 421 TTFYCNGLKLKYYYMPFGSGATICPGRVFAIHEIKQFLVLMLSYFELELVEGQDKCPPLD 480
           TTFYCNGLKLKYYYMPFGSGATICPGR+FAIHEIKQFL+LMLSYFELEL+EGQ KCPPLD
Sbjct: 421 TTFYCNGLKLKYYYMPFGSGATICPGRLFAIHEIKQFLILMLSYFELELIEGQAKCPPLD 480
 
Query: 481 QSRAGLGILPPLYDIEFKYKFKHL 504
           QSRAGLGILPPL DIEFKYKFKHL

Sbjct: 481 QSRAGLGILPPLNDIEFKYKFKHL 504

 

>CYP7B1 (G.Zhu) partial

RRPGEPPLIKGWLPYLGVVLKLRKDPLSFMKTLQKQHGDTFTVLLG

GKYITFILDPFQYQLVIKNHKQLSFRLFSNKLLEKAFSISQLQKNHDMNDELHLCYQFLQGKSLDILLESMMQN

LKQVFESQLLKTTSWDTAQLYPFCSSIIFEITFTTIYGKVLVCDNKFISELRDDFLKFDD
KFAYLVSNIPIELLGNVKSIRKKIIKCLSSENLAKMQGWSEVFQSRQDVLEKYYVHEDLE
IGAHHLGFLWASVANTIPTMFWAMYYLLRHPEAMAAVRDEIDRLLQSTGQKKGSGFPIHL
TREQLDSLICLESTIFEALRLSSYSTTIRFVEEDLTLSAQTGDYCVRKGDLGAIFPPILH
GDPEIFEAPDSKEFRYDRFIEDGKKKTTFFKRGKKLKCYLMPFGTGTSKCPGRFFALMEI
KQLLVILLTYFDLEIIDDKPIGLNYNRLLFGIQYPDSDVLFRYKVKS

 

Alignment of the two sequences, 95%identical

Query: 1   RRPGEPPLIKGWLPYLGVVLKLRKDPLSFMKTLQKQHGDTFTVLLGGKYITFILDPFQYQ 60
           RRPGEPPLIKGWLPYLGVVL LRKDPL FMKTLQKQHGDTFTVLLGGKYITFILDPFQYQ
Sbjct: 41  RRPGEPPLIKGWLPYLGVVLNLRKDPLRFMKTLQKQHGDTFTVLLGGKYITFILDPFQYQ 100
 
Query: 61  LVIKNHKQLSFRLFSNKLLEKAFSISQLQKNHDMNDELHLCYQFLQGKSLDILLESMMQN 120
           LVIKNHKQLSFR+FSNKLLEKAFSISQLQKNHDMNDELHLCYQFLQGKSLDILLESMMQN
Sbjct: 101 LVIKNHKQLSFRVFSNKLLEKAFSISQLQKNHDMNDELHLCYQFLQGKSLDILLESMMQN 160
 
Query: 121 LKQVFESQLLKTTSWDTAQLYPFCSSIIFEITFTTIYGKVLVCDN-KFISELRDDFLKFD 179
           LKQVFE QLLKTTSWDTA+LYPFCSSIIFEITFTTIYGKV+VCDN KFISELRDDFLKFD
Sbjct: 161 LKQVFEPQLLKTTSWDTAELYPFCSSIIFEITFTTIYGKVIVCDNNKFISELRDDFLKFD 220
 
Query: 180 DKFAYLVSNIPIELLGNVKSIRKKIIKCLSSENLAKMQGWSEVFQSRQDVLEKYYVHEDL 239
           DKFAYLVSNIPIELLGNVKSIR+KIIKC SSE LAKMQGWSEVFQSRQDVLEKYYVHEDL
Sbjct: 221 DKFAYLVSNIPIELLGNVKSIREKIIKCFSSEKLAKMQGWSEVFQSRQDVLEKYYVHEDL 280
 
Query: 240 EIGAHHLGFLWASVANTIPTMFWAMYYLLRHPEAMAAVRDEIDRLLQSTGQKKGSGFPIH 299
           EIGAHHLGFLWASVANTIPTMFWAMYYLLRHPEAMAAVRDEIDRLLQSTGQKKGSGFPIH
Sbjct: 281 EIGAHHLGFLWASVANTIPTMFWAMYYLLRHPEAMAAVRDEIDRLLQSTGQKKGSGFPIH 340
 
Query: 300 LTREQLDSLICLESTIFEALRLSSYSTTIRFVEEDLTLSAQTGDYCVRKGDLGAIFPPIL 359
           LTREQLDSLICLES+IFEALRLSSYSTTIRFVEEDLTLS++TGDYCVRKGDL AIFPP+L
Sbjct: 341 LTREQLDSLICLESSIFEALRLSSYSTTIRFVEEDLTLSSETGDYCVRKGDLVAIFPPVL 400
 
Query: 360 HGDPEIFEAPDSKEFRYDRFIEDGKKKTTFFKRGKKLKCYLMPFGTGTSKCPGRFFALME 419
           HGDPEIFEAP+  EFRYDRFIEDGKKKTTFFKRGKKLKCYLMPFGTGTSKCPGRFFALME
Sbjct: 401 HGDPEIFEAPE--EFRYDRFIEDGKKKTTFFKRGKKLKCYLMPFGTGTSKCPGRFFALME 458
 
Query: 420 IKQLLVILLTYFDLEIIDDKPIGLNYNRLLFGIQYPDSDVLFRYKVKS 467
           IKQLLVILLTYFDLEIIDDKPIGLNY+RLLFGIQYPDSDVLFRYKVKS
Sbjct: 459 IKQLLVILLTYFDLEIIDDKPIGLNYSRLLFGIQYPDSDVLFRYKVKS 506

 

>CYP8A1 partial (Lin Zhu)

GDKDHMCSVKSRLWKLLSPARLATRAHRSKWLESYLLHLEEMGVSEEMQARALVLQLWATQ (0)

GNMGPAAFWLLLFLLKNPEALAAVRGELESILWEAEQPVSQMTTLPQKVLDGTPVL (1)

DSVLSESLRLTAAPFITREVVVDLAMPMADGREFNLRRGDRLLLFPFLSPQRDPEIYTDPE (0)

VFKYNRFLNPDGSEKKDFYKDGKRLKNYNMPWGAGHNHCLGKSYAVNSIKQ (2)

FVFLVLVHLDLELINADVEIPEFDLSRYGFGLMQPEHDVPIRYRIRP

 

>CYP8A1 D83402

          Length = 500

 

Score =  553 bits (1424), Expect = e-160

 Identities = 270/276 (97%), Positives = 273/276 (98%)

 

Query: 1   GDKDHMCSVKSRLWKLLSPARLATRAHRSKWLESYLLHLEEMGVSEEMQARALVLQLWAT 60

           GDKDHMCSVKSRLWKLLSPARLA RAHRSKWLESYLLHLEEMGVSEEMQARALVLQLWAT

Sbjct: 225 GDKDHMCSVKSRLWKLLSPARLARRAHRSKWLESYLLHLEEMGVSEEMQARALVLQLWAT 284

 

Query: 61  QGNMGPAAFWLLLFLLKNPEALAAVRGELESILWEAEQPVSQMTTLPQKVLDGTPVLDSV 120

           QGNMGPAAFWLLLFLLKNPEALAAVRGELESILW+AEQPVSQ TTLPQKVLD TPVLDSV

Sbjct: 285 QGNMGPAAFWLLLFLLKNPEALAAVRGELESILWQAEQPVSQTTTLPQKVLDSTPVLDSV 344

 

Query: 121 LSESLRLTAAPFITREVVVDLAMPMADGREFNLRRGDRLLLFPFLSPQRDPEIYTDPEVF 180

           LSESLRLTAAPFITREVVVDLAMPMADGREFNLRRGDRLLLFPFLSPQRDPEIYTDPEVF

Sbjct: 345 LSESLRLTAAPFITREVVVDLAMPMADGREFNLRRGDRLLLFPFLSPQRDPEIYTDPEVF 404

 

Query: 181 KYNRFLNPDGSEKKDFYKDGKRLKNYNMPWGAGHNHCLGKSYAVNSIKQFVFLVLVHLDL 240

           KYNRFLNPDGSEKKDFYKDGKRLKNYNMPWGAGHNHCLG+SYAVNSIKQFVFLVLVHLDL

Sbjct: 405 KYNRFLNPDGSEKKDFYKDGKRLKNYNMPWGAGHNHCLGRSYAVNSIKQFVFLVLVHLDL 464

 

Query: 241 ELINADVEIPEFDLSRYGFGLMQPEHDVPIRYRIRP 276

           ELINADVEIPEFDLSRYGFGLMQPEHDVP+RYRIRP

Sbjct: 465 ELINADVEIPEFDLSRYGFGLMQPEHDVPVRYRIRP 500

 

>CYP8B1 SCAFFOLD114862:8-613, SCAFFOLD39206:3-626 no introns

BB882888.1 Macaca fasicularis lower case

MVLWGPVLGALLVVIAGYLCLPGMLRQRRPREPPLDKGTVPWLGYAMAFRKNMFEFLKRM

RSKHGDVFTVQLGGQYFTFVMDPLSFGPILKDTQRKLDFGQYAKKLVLKVFGYRSVQGDH

EMIHSASTKHLRGDGLKDLNETMLDSLSFVMLKSKGWSLDASCWHEDSLFHFCYYILFTA

GYLSLFGYTKDKEQDLLQAGEL

40aa gap

shsqxkegisnwlcnmlqflreqgvpsamqdkfnfmmlwasqgntgpts

FWALLFLLKHPEAIRAVRQETTQVLGEARLETKQSFAFKLSALQHTPVLDSVVEETLRLR

AAPTLLRLVHEDYTLKMASGQEYLFRRGDILALFPYLSVHVDPDIHPEPTIFKYDRFLNP

NGSRKVDFFKAGKKIHHYTMPWGSGVSICPGRFFALSEVKLFILLMVTHFDLELVDPDTP

LPHVDPQRWGFGTMQPSHDVRFRYRLRP

 

Query: 1   MVLWGPVLGALLVVIAGYLCLPGMLRQRRPREPPLDKGTVPWLGYAMAFRKNMFEFLKRM 60

           MVLWGPVLGALLVVIAGYLCLPGMLRQRRP EPPLDKGTVPWLG+AMAFRKNMFEFLKRM

Sbjct: 1   MVLWGPVLGALLVVIAGYLCLPGMLRQRRPWEPPLDKGTVPWLGHAMAFRKNMFEFLKRM 60

 

Query: 61  RSKHGDVFTVQLGGQYFTFVMDPLSFGPILKDTQRKLDFGQYAKKLVLKVFGYRSVQGDH 120

           R+KHGDVFTVQLGGQYFTFVMDPLSFGPILKDTQRKLDFGQYAKKLVLKVFGYRSVQGDH

Sbjct: 61  RTKHGDVFTVQLGGQYFTFVMDPLSFGPILKDTQRKLDFGQYAKKLVLKVFGYRSVQGDH 120

 

Query: 121 EMIHSASTKHLRGDGLKDLNETMLDSLSFVMLKSKGWSLDASCWHEDSLFHFCYYILFTA 180

           EMIHSASTKHLRGDGLKDLNETMLDSLSFVML SKGWSLDASCWHEDSLF FCYYILFTA

Sbjct: 121 EMIHSASTKHLRGDGLKDLNETMLDSLSFVMLTSKGWSLDASCWHEDSLFRFCYYILFTA 180

 

Query: 181 GYLSLFGYTKDKEQDLLQAGEL-------------------------------------- 202

           GYLSLFGYTKDKEQDLLQAGEL                                      

Sbjct: 181 GYLSLFGYTKDKEQDLLQAGELFMEFRKFDLLFPRFVYSLLWPREWLEVGRLQHLFHKML 240

 

Query: 203 --SHSQXKEGISNWLCNMLQFLREQGVPSAMQDKFNFMMLWASQGNTGPTSFWALLFLLK 260

             SHSQ KEGISNWL NMLQFLREQGVPSAMQDKFNFMMLWASQGNTGPTSFWALL+LLK

Sbjct: 241 SVSHSQEKEGISNWLGNMLQFLREQGVPSAMQDKFNFMMLWASQGNTGPTSFWALLYLLK 300

 

Query: 261 HPEAIRAVRQETTQVLGEARLETKQSFAFKLSALQHTPVLDSVVEETLRLRAAPTLLRLV 320

           HPEAIRAVR+E TQVLGEARLETKQSFAFKL ALQHTPVLDSVVEETLRLRAAPTLLRLV

Sbjct: 301 HPEAIRAVREEATQVLGEARLETKQSFAFKLGALQHTPVLDSVVEETLRLRAAPTLLRLV 360

 

Query: 321 HEDYTLKMASGQEYLFRRGDILALFPYLSVHVDPDIHPEPTIFKYDRFLNPNGSRKVDFF 380

           HEDYTLKM+SGQEYLFR GDILALFPYLSVH+DPDIHPEPT+FKYDRFLNPNGSRKVDFF

Sbjct: 361 HEDYTLKMSSGQEYLFRHGDILALFPYLSVHMDPDIHPEPTVFKYDRFLNPNGSRKVDFF 420

 

Query: 381 KAGKKIHHYTMPWGSGVSICPGRFFALSEVKLFILLMVTHFDLELVDPDTPLPHVDPQRW 440

           K GKKIHHYTMPWGSGVSICPGRFFALSEVKLFILLMVTHFDLELVDPDTPLPHVDPQRW

Sbjct: 421 KTGKKIHHYTMPWGSGVSICPGRFFALSEVKLFILLMVTHFDLELVDPDTPLPHVDPQRW 480

 

Query: 441 GFGTMQPSHDVRFRYRLRP 459

           GFGTMQPSHDVRFRYRL P

Sbjct: 481 GFGTMQPSHDVRFRYRLHP 499

 

>CYP11A1 N-term = DQ228169.1 Macaca fasicularis = lower case (Mahrous)

mlakglpprsvlvkgcqtflsapkerlghlrvptsegagistrs

prpfneipspgdngwlnlyhfwretgthkvhlhhvqnfqkydpiy

REKLGNVESVYVIDPEDVALLFKSEGPNPERFLIPPWVAYHQYYQRPIGVLLK (2)

KSAAWKKDRVALNQEVMAPETTKNFLPLLDAVSRDFVSVLHRRIKKAGSGNFSGDISDDLFRFAFE (1)

SITNVIFGERQGMLEEVVNPEAQRFIDAIYQMFHTSVPMLNLPPDLFRLFRTKTWKDHVAAWDVIFSK (1)

ADMYTENFHWELRQKGNVHHDYRGILYRLLGDSKMSFEDIKANVTEMLAGGVDT (0)

TSMTLQWHLYEMARNLKVQDMLRAEVLAARRQAQGDMATMLQLVPLLKASIKETLR (2)

LHPISVTLQRYLVNDLVLRGYMIPAK (0)

TLVQVAIYALGREPTFFFDPENFDPTRWLSKDKNITYFRNLGFGWGVRQCLGRRIAELEMTIFLIN (0)

MLENFRVEIQHLSDVGTTFNLILMPEKPISFTFWPFNQEATQ

 

>CYP11A1 NM_000781

          Length = 521

 

 Score =  857 bits (2214), Expect = 0.0

 Identities = 422/431 (97%), Positives = 428/431 (99%)

 

Query: 1   REKLGNVESVYVIDPEDVALLFKSEGPNPERFLIPPWVAYHQYYQRPIGVLLKKSAAWKK 60

           REKLGNVESVYVIDPEDVALLFKSEGPNPERFLIPPWVAYHQYYQRPIGVLLKKSAAWKK

Sbjct: 90  REKLGNVESVYVIDPEDVALLFKSEGPNPERFLIPPWVAYHQYYQRPIGVLLKKSAAWKK 149

 

Query: 61  DRVALNQEVMAPETTKNFLPLLDAVSRDFVSVLHRRIKKAGSGNFSGDISDDLFRFAFES 120

           DRVALNQEVMAPE TKNFLPLLDAVSRDFVSVLHRRIKKAGSGN+SGDISDDLFRFAFES

Sbjct: 150 DRVALNQEVMAPEATKNFLPLLDAVSRDFVSVLHRRIKKAGSGNYSGDISDDLFRFAFES 209

 

Query: 121 ITNVIFGERQGMLEEVVNPEAQRFIDAIYQMFHTSVPMLNLPPDLFRLFRTKTWKDHVAA 180

           ITNVIFGERQGMLEEVVNPEAQRFIDAIYQMFHTSVPMLNLPPDLFRLFRTKTWKDHVAA

Sbjct: 210 ITNVIFGERQGMLEEVVNPEAQRFIDAIYQMFHTSVPMLNLPPDLFRLFRTKTWKDHVAA 269

 

Query: 181 WDVIFSKADMYTENFHWELRQKGNVHHDYRGILYRLLGDSKMSFEDIKANVTEMLAGGVD 240

           WDVIFSKAD+YT+NF+WELRQKG+VHHDYRG+LYRLLGDSKMSFEDIKANVTEMLAGGVD

Sbjct: 270 WDVIFSKADIYTQNFYWELRQKGSVHHDYRGMLYRLLGDSKMSFEDIKANVTEMLAGGVD 329

 

Query: 241 TTSMTLQWHLYEMARNLKVQDMLRAEVLAARRQAQGDMATMLQLVPLLKASIKETLRLHP 300

           TTSMTLQWHLYEMARNLKVQDMLRAEVLAAR QAQGDMATMLQLVPLLKASIKETLRLHP

Sbjct: 330 TTSMTLQWHLYEMARNLKVQDMLRAEVLAARHQAQGDMATMLQLVPLLKASIKETLRLHP 389

 

Query: 301 ISVTLQRYLVNDLVLRGYMIPAKTLVQVAIYALGREPTFFFDPENFDPTRWLSKDKNITY 360

           ISVTLQRYLVNDLVLR YMIPAKTLVQVAIYALGREPTFFFDPENFDPTRWLSKDKNITY

Sbjct: 390 ISVTLQRYLVNDLVLRDYMIPAKTLVQVAIYALGREPTFFFDPENFDPTRWLSKDKNITY 449

 

Query: 361 FRNLGFGWGVRQCLGRRIAELEMTIFLINMLENFRVEIQHLSDVGTTFNLILMPEKPISF 420

           FRNLGFGWGVRQCLGRRIAELEMTIFLINMLENFRVEIQHLSDVGTTFNLILMPEKPISF

Sbjct: 450 FRNLGFGWGVRQCLGRRIAELEMTIFLINMLENFRVEIQHLSDVGTTFNLILMPEKPISF 509

 

Query: 421 TFWPFNQEATQ 431

           TFWPFNQEATQ

Sbjct: 510 TFWPFNQEATQ 520

 

>CYP11B1 (Lin Zhu, S.Hill)

MALRAKAEVCMAAPWLSLQRARALGTRATRVPRTVLPFEAMPRRPGNRWLRLLQIWREQG
YEHLHLEVHQTFQELGPIFR (2)
YDLGGAGMVCVMLPEDVEKLQQVDSLNPRRMSLEPWVAYRQHRGHKCGVFLL (2)
NGPEWRFNRLRLNPDVLSPRAVQRFLPMVDAVARDFSQALRKKVLQNARGSLTLDVQPSIFHYTIE (1)
ASNLALFGERLGLVGHSPSSASLSFLHALEVMFKSTVQLMFMPRSLSRWTSPKVWKEHFEAWDCIFQY (1)
GDNCIQKIYQELALSRPQQYTSIVAELLLNAELSPDAIKANSMELTAGSVDT (0) 
TVFPLLMTLFELARNPNVQQALRQESLAAAASISEHPQKATTELPLLRAALKETLR (2)
LYPVGLFLERVVSSDLVLQNYHIPAG (0)
TLVRVFLYSLGRNPALFPRPERYNPQRWLDIRGSGRNFYHVPFGFGMRQCLGRRLAEAEMLLLLHH (0)
VLKHLQVETLTQEDIKMVYSFILRPSTFPLLTFRAIN

 

Score =  984 bits (2543), Expect = 0.0
 Identities = 489/503 (97%), Positives = 494/503 (98%)
 
Query: 1   MALRAKAEVCMAAPWLSLQRARALGTRATRVPRTVLPFEAMPRRPGNRWLRLLQIWREQG 60
           MALRAKAEVCMA PWLSLQRA+ALGTRA RVPRTVLPFEAMPRRPGNRWLRLLQIWREQG
Sbjct: 1   MALRAKAEVCMAVPWLSLQRAQALGTRAARVPRTVLPFEAMPRRPGNRWLRLLQIWREQG 60
 
Query: 61  YEHLHLEVHQTFQELGPIFRYDLGGAGMVCVMLPEDVEKLQQVDSLNPRRMSLEPWVAYR 120
           YE LHLEVHQTFQELGPIFRYDLGGAGMVCVMLPEDVEKLQQVDSL+P RMSLEPWVAYR
Sbjct: 61  YEDLHLEVHQTFQELGPIFRYDLGGAGMVCVMLPEDVEKLQQVDSLHPHRMSLEPWVAYR 120
 
Query: 121 QHRGHKCGVFLLNGPEWRFNRLRLNPDVLSPRAVQRFLPMVDAVARDFSQALRKKVLQNA 180
           QHRGHKCGVFLLNGPEWRFNRLRLNP+VLSP AVQRFLPMVDAVARDFSQAL+KKVLQNA
Sbjct: 121 QHRGHKCGVFLLNGPEWRFNRLRLNPEVLSPNAVQRFLPMVDAVARDFSQALKKKVLQNA 180
 
Query: 181 RGSLTLDVQPSIFHYTIEASNLALFGERLGLVGHSPSSASLSFLHALEVMFKSTVQLMFM 240
           RGSLTLDVQPSIFHYTIEASNLALFGERLGLVGHSPSSASL+FLHALEVMFKSTVQLMFM
Sbjct: 181 RGSLTLDVQPSIFHYTIEASNLALFGERLGLVGHSPSSASLNFLHALEVMFKSTVQLMFM 240
 
Query: 241 PRSLSRWTSPKVWKEHFEAWDCIFQYGDNCIQKIYQELALSRPQQYTSIVAELLLNAELS 300
           PRSLSRWTSPKVWKEHFEAWDCIFQYGDNCIQKIYQELA SRPQQYTSIVAELLLNAELS
Sbjct: 241 PRSLSRWTSPKVWKEHFEAWDCIFQYGDNCIQKIYQELAFSRPQQYTSIVAELLLNAELS 300
 
Query: 301 PDAIKANSMELTAGSVDTTVFPLLMTLFELARNPNVQQALRQESLAAAASISEHPQKATT 360
           PDAIKANSMELTAGSVDTTVFPLLMTLFELARNPNVQQALRQESLAAAASISEHPQKATT
Sbjct: 301 PDAIKANSMELTAGSVDTTVFPLLMTLFELARNPNVQQALRQESLAAAASISEHPQKATT 360
 
Query: 361 ELPLLRAALKETLRLYPVGLFLERVVSSDLVLQNYHIPAGTLVRVFLYSLGRNPALFPRP 420
           ELPLLRAALKETLRLYPVGLFLERV SSDLVLQNYHIPAGTLVRVFLYSLGRNPALFPRP
Sbjct: 361 ELPLLRAALKETLRLYPVGLFLERVASSDLVLQNYHIPAGTLVRVFLYSLGRNPALFPRP 420
 
Query: 421 ERYNPQRWLDIRGSGRNFYHVPFGFGMRQCLGRRLAEAEMLLLLHHVLKHLQVETLTQED 480
           ERYNPQRWLDIRGSGRNFYHVPFGFGMRQCLGRRLAEAEMLLLLHHVLKHLQVETLTQED
Sbjct: 421 ERYNPQRWLDIRGSGRNFYHVPFGFGMRQCLGRRLAEAEMLLLLHHVLKHLQVETLTQED 480
 
Query: 481 IKMVYSFILRPSTFPLLTFRAIN 503
           IKMVYSFILRPS  PLLTFRAIN
Sbjct: 481 IKMVYSFILRPSMCPLLTFRAIN 503
 

 

>CYP17 AY746983.1 and AF458332.1

MWELVALLLLTLAYLFWPKRRCPGAKYPKSLLSLPLVGSLPFLP

RHGHMHNNFFKLQKKYGPIYSVRMGTKTTVIVGHHQLAKEVLIKKGKDFSGRPQVTTL

DILSNNRKGIAFADYGAHWQLHRRLAMATFALFKDGDQKLEKIICQEISTLCDMLATH

NGQTIDISFPVFVAITNVISLICFNISYKNGDPELKIVHNYNEGIIDSLGKESLVDLF

PWLKVFPNKTLEKLKRHVKTRNDLLTKIFENYKEKFHSDSITNMLDVLMQAKMNSDNG

NAGPDQDSELLSDNHILTTIGDIFGAGVETTTSVVKWIVAFLLHNPQVKKKLYEEIDQ

NVGFSRTPTISDRNRLLLLEATIREVLRIRPVAPMLIPHKANVDSSIGEFAVDKGTHV

IINLWALHHNEKEWHQPDQFMPERFLNPAGTQLISPSLSYLPFGAGPRSCIGEILARQ

ELFLIMAWLLQRFDLEVPDDGQLPSLEGNPKVVFLIDSFKVKIKVRQAWREAQAEGST

 

>CYP17 (S. Hill) partial

MWELVALLLFTLAYLFWPKRRCPGAKYPKSLLSLPLVGSLPFLPRHGHMHNNFFKLQKKY
GPIYSVRMGTKTTVIVGHHQLAKEVLIKKGKDFSGRPQXTTLDILSNNRKGIAFADYGAHW
QLHRRLAMATFALFKDGDQKLEKIICQEISTLCDMLATHNGQTIDISFPVFVAITNVISL
ICFNISYKNGDPELKIVHNYNEGIIDSLGKESLVDLFPWLK

 

>CYP19 (Iyer)
MVLEMLNPMHYNITSMVPEAMPAATMPILLLTGLFLLVWNYEGTSSIP (1)
GPGYCMGIGPLISHGRFLWMGIGSACNYYNQVYGEFMRVWISGEETLIISK (2)
SSSMFHIMKHNHYSSRFGSKLGLQCIGMHEKGIIFNNNPDLWKTTRPFFMK (1)
ALSGPGLVRMVTVCAESLKTHLDRLEEVTNESGYVDVLTLLRRVMLDTSNMLFLRIPLD (1)
ESAIVVKIQGYFDAWQALLIKPDIFFKISWLYKKYEKSV (2)
KDLKDAIEVLIAEKRRRISTEEKLEECMDFATELILAE (0)
KRGDLTRENVNQCILEMLIAAPDTMSVSLFFMLFLIAKHPNVEEAIMKEIQTVV (1)
GERDVKIDDMQKLKVMENFIYESMRYQPVVDLVMRKALEDDVIDGYPVKKG
TNIILNIGRMHRLEFFPKPNEFTLENFAKN (0)
VPYRYFQPFGFGPRGCAGKYIAMVMMKAILVTLLRRFHVKTLQGQCVERIQ
KIHDLSSHPDETKNMLEMIFTPRNSDRCLEH
 
>CYP19 NM_000103
          Length = 503
 
 Score =  997 bits (2578), Expect = 0.0
 Identities = 490/503 (97%), Positives = 499/503 (99%)
 
Query: 1   MVLEMLNPMHYNITSMVPEAMPAATMPILLLTGLFLLVWNYEGTSSIPGPGYCMGIGPLI 60
           MVLEMLNP+HYNITS+VPEAMPAATMP+LLLTGLFLLVWNYEGTSSIPGPGYCMGIGPLI
Sbjct: 1   MVLEMLNPIHYNITSIVPEAMPAATMPVLLLTGLFLLVWNYEGTSSIPGPGYCMGIGPLI 60
 
Query: 61  SHGRFLWMGIGSACNYYNQVYGEFMRVWISGEETLIISKSSSMFHIMKHNHYSSRFGSKL 120
           SHGRFLWMGIGSACNYYN+VYGEFMRVWISGEETLIISKSSSMFHIMKHNHYSSRFGSKL
Sbjct: 61  SHGRFLWMGIGSACNYYNRVYGEFMRVWISGEETLIISKSSSMFHIMKHNHYSSRFGSKL 120
 
Query: 121 GLQCIGMHEKGIIFNNNPDLWKTTRPFFMKALSGPGLVRMVTVCAESLKTHLDRLEEVTN 180
           GLQCIGMHEKGIIFNNNP+LWKTTRPFFMKALSGPGLVRMVTVCAESLKTHLDRLEEVTN
Sbjct: 121 GLQCIGMHEKGIIFNNNPELWKTTRPFFMKALSGPGLVRMVTVCAESLKTHLDRLEEVTN 180
 
Query: 181 ESGYVDVLTLLRRVMLDTSNMLFLRIPLDESAIVVKIQGYFDAWQALLIKPDIFFKISWL 240
           ESGYVDVLTLLRRVMLDTSN LFLRIPLDESAIVVKIQGYFDAWQALLIKPDIFFKISWL
Sbjct: 181 ESGYVDVLTLLRRVMLDTSNTLFLRIPLDESAIVVKIQGYFDAWQALLIKPDIFFKISWL 240
 
Query: 241 YKKYEKSVKDLKDAIEVLIAEKRRRISTEEKLEECMDFATELILAEKRGDLTRENVNQCI 300
           YKKYEKSVKDLKDAIEVLIAEKR RISTEEKLEECMDFATELILAEKRGDLTRENVNQCI
Sbjct: 241 YKKYEKSVKDLKDAIEVLIAEKRCRISTEEKLEECMDFATELILAEKRGDLTRENVNQCI 300
 
Query: 301 LEMLIAAPDTMSVSLFFMLFLIAKHPNVEEAIMKEIQTVVGERDVKIDDMQKLKVMENFI 360
           LEMLIAAPDTMSVSLFFMLFLIAKHPNVEEAI+KEIQTV+GERD+KIDD+QKLKVMENFI
Sbjct: 301 LEMLIAAPDTMSVSLFFMLFLIAKHPNVEEAIIKEIQTVIGERDIKIDDIQKLKVMENFI 360
 
Query: 361 YESMRYQPVVDLVMRKALEDDVIDGYPVKKGTNIILNIGRMHRLEFFPKPNEFTLENFAK 420
           YESMRYQPVVDLVMRKALEDDVIDGYPVKKGTNIILNIGRMHRLEFFPKPNEFTLENFAK
Sbjct: 361 YESMRYQPVVDLVMRKALEDDVIDGYPVKKGTNIILNIGRMHRLEFFPKPNEFTLENFAK 420
 
Query: 421 NVPYRYFQPFGFGPRGCAGKYIAMVMMKAILVTLLRRFHVKTLQGQCVERIQKIHDLSSH 480
           NVPYRYFQPFGFGPRGCAGKYIAMVMMKAILVTLLRRFHVKTLQGQCVE IQKIHDLS H
Sbjct: 421 NVPYRYFQPFGFGPRGCAGKYIAMVMMKAILVTLLRRFHVKTLQGQCVESIQKIHDLSLH 480
 
Query: 481 PDETKNMLEMIFTPRNSDRCLEH 503
           PDETKNMLEMIFTPRNSDRCLEH
Sbjct: 481 PDETKNMLEMIFTPRNSDRCLEH 503
 

>CYP21 (Blackwell) partial

LVSKNYPDLSLGDYSLLWKAHKKLTRSALLLGMRDSMEPVVEQLTQEFCERMRAQAGTPV
AIEEEFSLLTCSIICHLTFGDKIKDNLVPAYYKCIQEVLKTWSHWSIQIVDVIPFLRFFP
NPGLRRLKQAIEKRDHIVEKQLRQHKESLVAGQWRDMMDYMLQVVAQPSMEEGSGQLLEG
HVHMAAVDLLIGGTETTANTLSWAVVFLLHHPEIQQRLQEELDHELGPSASSSRVPYKDR
ARLPLLNATIAEVLRLRPVVPLALPHRTTRPSSISGYDIPEGTVIIPNLQGAHLDEMVWE
RPHEFWPDRFLEPGKNSRALAFGCGARVCLGEPLARLELFVVLTRLLQAFTLLPPGDALP
SLQPLPHCSVILKMQPFQVWLQPRGLGVHSLGQSQ

 

Query: 1   LVSKNYPDLSLGDYSLLWKAHKKLTRSALLLGMRDSMEPVVEQLTQEFCERMRAQAGTPV 60

           LVS+NYPDLSLGDYSLLWKAHKKLTRSALLLG+RDSMEPVVEQLTQEFCERMRAQ GTPV

Sbjct: 100 LVSRNYPDLSLGDYSLLWKAHKKLTRSALLLGIRDSMEPVVEQLTQEFCERMRAQPGTPV 159

 

Query: 61  AIEEEFSLLTCSIICHLTFGDKIKD-NLVPAYYKCIQEVLKTWSHWSIQIVDVIPFLRFF 119

           AIEEEFSLLTCSIIC+LTFGDKIKD NL+PAYYKCIQEVLKTWSHWSIQIVDVIPFLRFF

Sbjct: 160 AIEEEFSLLTCSIICYLTFGDKIKDDNLMPAYYKCIQEVLKTWSHWSIQIVDVIPFLRFF 219

 

Query: 120 PNPGLRRLKQAIEKRDHIVEKQLRQHKESLVAGQWRDMMDYMLQVVAQPSMEEGSGQLLE 179

           PNPGLRRLKQAIEKRDHIVE QLRQHKESLVAGQWRDMMDYMLQ VAQPSMEEGSGQLLE

Sbjct: 220 PNPGLRRLKQAIEKRDHIVEMQLRQHKESLVAGQWRDMMDYMLQGVAQPSMEEGSGQLLE 279

 

Query: 180 GHVHMAAVDLLIGGTETTANTLSWAVVFLLHHPEIQQRLQEELDHELGPSASSSRVPYKD 239

           GHVHMAAVDLLIGGTETTANTLSWAVVFLLHHPEIQQRLQEELDHELGP ASSSRVPYKD

Sbjct: 280 GHVHMAAVDLLIGGTETTANTLSWAVVFLLHHPEIQQRLQEELDHELGPGASSSRVPYKD 339

 

Query: 240 RARLPLLNATIAEVLRLRPVVPLALPHRTTRPSSISGYDIPEGTVIIPNLQGAHLDEMVW 299

           RARLPLLNATIAEVLRLRPVVPLALPHRTTRPSSISGYDIPEGTVIIPNLQGAHLDE VW

Sbjct: 340 RARLPLLNATIAEVLRLRPVVPLALPHRTTRPSSISGYDIPEGTVIIPNLQGAHLDETVW 399

 

Query: 300 ERPHEFWPDRFLEPGKNSRALAFGCGARVCLGEPLARLELFVVLTRLLQAFTLLPPGDAL 359

           ERPHEFWPDRFLEPGKNSRALAFGCGARVCLGEPLARLELFVVLTRLLQAFTLLP GDAL

Sbjct: 400 ERPHEFWPDRFLEPGKNSRALAFGCGARVCLGEPLARLELFVVLTRLLQAFTLLPSGDAL 459

 

Query: 360 PSLQPLPHCSVILKMQPFQVWLQPRGLGVHSLGQSQ 395

           PSLQPLPHCSVILKMQPFQV LQPRG+G HS GQ+Q

Sbjct: 460 PSLQPLPHCSVILKMQPFQVRLQPRGMGAHSPGQNQ 495

 

 

>CYP24 (S.Jain) partial

MSSPISKSRSLAAFLQQLRSPRQPPRPVTSTAYTSPQPREVPVCPLTAGGETQNAAALPG

PTSWPLLGSLLQILWKGGLKKQHDTL(0)
VEYHKKYGKIFRMKLGSFESVHLGSPCLLEALYR

TESAYPQRLEIKPWKAYRDYRKEGYGLLIL

 

>CYP24 NM_000782

          Length = 513

 

 Score =  295 bits (754), Expect = 6e-83

 Identities = 146/150 (97%), Positives = 146/150 (97%), Gaps = 1/150 (0%)

 

Query: 1   MSSPISKSRSLAAFLQQLRSPRQPPRPVTSTAYTSPQPREVPVCPLTAGGETQNAAALPG 60

           MSSPISKSRSLAAFLQQLRSPRQPPR VTSTAYTSPQPREVPVCPLTAGGETQNAAALPG

Sbjct: 1   MSSPISKSRSLAAFLQQLRSPRQPPRLVTSTAYTSPQPREVPVCPLTAGGETQNAAALPG 60

 

Query: 61  PTSWPLLGSLLQILWKGGLKKQHDTLVEYHKKYGKIFRMKLGSFESVHLGSPCLLEALYR 120

           PTSWPLL SLLQILWKGGLKKQHDTLVEYHKKYGKIFRMKLGSFESVHLGSPCLLEALYR

Sbjct: 61  PTSWPLLASLLQILWKGGLKKQHDTLVEYHKKYGKIFRMKLGSFESVHLGSPCLLEALYR 120

 

Query: 121 TESAYPQRLEIKPWKAYRDYRKEGYGLLIL 150

           TES  PQRLEIKPWKAYRDYRKEGYGLLIL

Sbjct: 121 TESV-PQRLEIKPWKAYRDYRKEGYGLLIL 149

 

>CYP26A1 (Liao, Iyer)

MGLPALLASALCTFVLPLLLFLAAIKLWDLYCVSGRDRSCALPLPPGTMGFPFFGETLQM
VLQRRKFLQMKRRKYGFIYKTHLFGRPTVRVMGADNVRRILLGEHRLVSVHWPASVRTIL
GSGCLSNLHDSSHKQRKKVIMRAFSREALECYVPVITEEVGSSLEQWLSCGERGLLVYPE
VKRLMFRIAMRILLGCEPQLAGDGDAEQQLVEAFEEMTRNLFSLPIDVPFSGLYRGMKAR
NLIHARIEQNIRAKICGLRASEAGRGCKDALQLLIEHSWERGERLDMQALKQSSTELLFG
GHETTASAATSLITYLGLYPHVLQKVREELKSKGLLCKSNQDNKLDMEILEQLKYIGCVI
KETLRLNPPVPGGFRVALKTFELNGYQIPKGWNVIYSICDTHDVAEIFTNKEEFNPDRFM
LPHPEDASRFSFIPFGGGLRSCVGKEFAKILLKIFTVELARHCDWQLLNGPPTMKTSPTV
YPVDNLPARFTHFHGEI

 

>CYP26A1 NM_000783

          Length = 497

 

 Score = 1003 bits (2594), Expect = 0.0

 Identities = 493/497 (99%), Positives = 496/497 (99%)

 

Query: 1   MGLPALLASALCTFVLPLLLFLAAIKLWDLYCVSGRDRSCALPLPPGTMGFPFFGETLQM 60

           MGLPALLASALCTFVLPLLLFLAAIKLWDLYCVSGRDRSCALPLPPGTMGFPFFGETLQM

Sbjct: 1   MGLPALLASALCTFVLPLLLFLAAIKLWDLYCVSGRDRSCALPLPPGTMGFPFFGETLQM 60

 

Query: 61  VLQRRKFLQMKRRKYGFIYKTHLFGRPTVRVMGADNVRRILLGEHRLVSVHWPASVRTIL 120

           VLQRRKFLQMKRRKYGFIYKTHLFGRPTVRVMGADNVRRILLG+ RLVSVHWPASVRTIL

Sbjct: 61  VLQRRKFLQMKRRKYGFIYKTHLFGRPTVRVMGADNVRRILLGDDRLVSVHWPASVRTIL 120

 

Query: 121 GSGCLSNLHDSSHKQRKKVIMRAFSREALECYVPVITEEVGSSLEQWLSCGERGLLVYPE 180

           GSGCLSNLHDSSHKQRKKVIMRAFSREALECYVPVITEEVGSSLEQWLSCGERGLLVYPE

Sbjct: 121 GSGCLSNLHDSSHKQRKKVIMRAFSREALECYVPVITEEVGSSLEQWLSCGERGLLVYPE 180

 

Query: 181 VKRLMFRIAMRILLGCEPQLAGDGDAEQQLVEAFEEMTRNLFSLPIDVPFSGLYRGMKAR 240

           VKRLMFRIAMRILLGCEPQLAGDGD+EQQLVEAFEEMTRNLFSLPIDVPFSGLYRGMKAR

Sbjct: 181 VKRLMFRIAMRILLGCEPQLAGDGDSEQQLVEAFEEMTRNLFSLPIDVPFSGLYRGMKAR 240

 

Query: 241 NLIHARIEQNIRAKICGLRASEAGRGCKDALQLLIEHSWERGERLDMQALKQSSTELLFG 300

           NLIHARIEQNIRAKICGLRASEAG+GCKDALQLLIEHSWERGERLDMQALKQSSTELLFG

Sbjct: 241 NLIHARIEQNIRAKICGLRASEAGQGCKDALQLLIEHSWERGERLDMQALKQSSTELLFG 300

 

Query: 301 GHETTASAATSLITYLGLYPHVLQKVREELKSKGLLCKSNQDNKLDMEILEQLKYIGCVI 360

           GHETTASAATSLITYLGLYPHVLQKVREELKSKGLLCKSNQDNKLDMEILEQLKYIGCVI

Sbjct: 301 GHETTASAATSLITYLGLYPHVLQKVREELKSKGLLCKSNQDNKLDMEILEQLKYIGCVI 360

 

Query: 361 KETLRLNPPVPGGFRVALKTFELNGYQIPKGWNVIYSICDTHDVAEIFTNKEEFNPDRFM 420

           KETLRLNPPVPGGFRVALKTFELNGYQIPKGWNVIYSICDTHDVAEIFTNKEEFNPDRFM

Sbjct: 361 KETLRLNPPVPGGFRVALKTFELNGYQIPKGWNVIYSICDTHDVAEIFTNKEEFNPDRFM 420

 

Query: 421 LPHPEDASRFSFIPFGGGLRSCVGKEFAKILLKIFTVELARHCDWQLLNGPPTMKTSPTV 480

           LPHPEDASRFSFIPFGGGLRSCVGKEFAKILLKIFTVELARHCDWQLLNGPPTMKTSPTV

Sbjct: 421 LPHPEDASRFSFIPFGGGLRSCVGKEFAKILLKIFTVELARHCDWQLLNGPPTMKTSPTV 480

 

Query: 481 YPVDNLPARFTHFHGEI 497

           YPVDNLPARFTHFHGEI

Sbjct: 481 YPVDNLPARFTHFHGEI 497

 

>CYP26B1 (S.Jain, Penmatsa) partial
VFSKIFSHEALESYLPKIQLVIQDTLRAWSSHPEAINVYQEAQKLTFRMAIRVLLGFSIP

EEDLGHLFEVYQQFVENVFSLPVDLPFSGYRRGIQARQILQKGLEKAIREKLQCTQGKDY

SDALDLLIESSKEHGKEMTMQELKDGTLELIFAAYATTASASTSLIMQLLKHPTVLEKLR

EELRAHGILHSGGCPCEGTLRLDTLSGLRYLDCVIKEVMRLFTPISGGYRTVLQTFELDG

FQIPKGWSVMYSIRDTHDTAPVFKDVNVFDPDRFSQARSEDKDGRFHYLPFGGGVRTCLG

KHLAKLFLKVLAVELASTSRFELATRTFPRITLVPVLHPVDGLSVKFFGLDSNQNKILPE

TEAMLSATV

 

>CYP26B1 AC007002

          Length = 512

 

 Score =  727 bits (1876), Expect = 0.0

 Identities = 365/369 (98%), Positives = 368/369 (99%)

 

Query: 1   VFSKIFSHEALESYLPKIQLVIQDTLRAWSSHPEAINVYQEAQKLTFRMAIRVLLGFSIP 60

           VFSKIFSHEALESYLPKIQLVIQDTLRAWSSHPEAINVYQEAQKLTFRMAIRVLLGFSIP

Sbjct: 144 VFSKIFSHEALESYLPKIQLVIQDTLRAWSSHPEAINVYQEAQKLTFRMAIRVLLGFSIP 203

 

Query: 61  EEDLGHLFEVYQQFVENVFSLPVDLPFSGYRRGIQARQILQKGLEKAIREKLQCTQGKDY 120

           EEDLGHLFEVYQQFV+NVFSLPVDLPFSGYRRGIQARQILQKGLEKAIREKLQCTQGKDY

Sbjct: 204 EEDLGHLFEVYQQFVDNVFSLPVDLPFSGYRRGIQARQILQKGLEKAIREKLQCTQGKDY 263

 

Query: 121 SDALDLLIESSKEHGKEMTMQELKDGTLELIFAAYATTASASTSLIMQLLKHPTVLEKLR 180

            DALDLLIESSKEHGKEMTMQELKDGTLELIFAAYATTASASTSLIMQLLKHPTVLEKLR

Sbjct: 264 LDALDLLIESSKEHGKEMTMQELKDGTLELIFAAYATTASASTSLIMQLLKHPTVLEKLR 323

 

Query: 181 EELRAHGILHSGGCPCEGTLRLDTLSGLRYLDCVIKEVMRLFTPISGGYRTVLQTFELDG 240

           +ELRAHGILHSGGCPCEGTLRLDTLSGLRYLDCVIKEVMRLFTPISGGYRTVLQTFELDG

Sbjct: 324 DELRAHGILHSGGCPCEGTLRLDTLSGLRYLDCVIKEVMRLFTPISGGYRTVLQTFELDG 383

 

Query: 241 FQIPKGWSVMYSIRDTHDTAPVFKDVNVFDPDRFSQARSEDKDGRFHYLPFGGGVRTCLG 300

           FQIPKGWSVMYSIRDTHDTAPVFKDVNVFDPDRFSQARSEDKDGRFHYLPFGGGVRTCLG

Sbjct: 384 FQIPKGWSVMYSIRDTHDTAPVFKDVNVFDPDRFSQARSEDKDGRFHYLPFGGGVRTCLG 443

 

Query: 301 KHLAKLFLKVLAVELASTSRFELATRTFPRITLVPVLHPVDGLSVKFFGLDSNQNKILPE 360

           KHLAKLFLKVLAVELASTSRFELATRTFPRITLVPVLHPVDGLSVKFFGLDSNQN+ILPE

Sbjct: 444 KHLAKLFLKVLAVELASTSRFELATRTFPRITLVPVLHPVDGLSVKFFGLDSNQNEILPE 503

 

Query: 361 TEAMLSATV 369

           TEAMLSATV

Sbjct: 504 TEAMLSATV 512

 

>CYP26C1 (C.Blackwell) missing an exon

MFPWGLSCLSVLGAAGTAVLCAGLLLSLAQHLWTLRWMLSRDRASTLPLPKGSMGWPFFGETLHWLV(0)

QGSRFHSSRRERYGTVFKTHLLGRPVIRVSGAENVRTILLGEHRLVRSQWPQSAHILLGSHTLLGAV

GEPHRRRRK (0)

VLARVFSRAALERYVPRLQGALRHEVRSWCAAGGPVSVYDAAKALTFRMAARILLGLRLDEAQCATL

ARTFEQLVENLFSLPLDVPFSGLRK (0)

SAVELLFAAFFTTASASTSLVLLLLQHPAAIAKIREELVAQGLGRACGCAPGAAGGCVGPPPDCG
CEPDLSLAALGSLRYVDCVVKEVLRLLPPVSGGYRTALRTFELD(0)

GYQIPKGWSVMYSIRDTHETAAVYRSPPEGFDPERFGAAREDSRGASSRFHYIPFGGGARSCLGQEL

AQTVLQLLAVELVRTARWELATPAFPALQTVPIVHPVDGLRLFFHPLAPLVAGDGLCL

 

Query: 1   MFPWGLSCLSVLGAAGTAVLCAGLLLSLAQHLWTLRWMLSRDRASTLPLPKGSMGWPFFG 60

           MFPWGLSCLSVLGAAGTA+LCAGLLLSLAQHLWTLRWMLSRDRASTLPLPKGSMGWPFFG

Sbjct: 1   MFPWGLSCLSVLGAAGTALLCAGLLLSLAQHLWTLRWMLSRDRASTLPLPKGSMGWPFFG 60

 

Query: 61  ETLHWLVQGSRFHSSRRERYGTVFKTHLLGRPVIRVSGAENVRTILLGEHRLVRSQWPQS 120

           ETLHWLVQGSRFHSSRRERYGTVFKTHLLGRPVIRVSGAENVRTILLGEHRLVRSQWPQS

Sbjct: 61  ETLHWLVQGSRFHSSRRERYGTVFKTHLLGRPVIRVSGAENVRTILLGEHRLVRSQWPQS 120

 

Query: 121 AHILLGSHTLLGAVGEPHRRRRKVLARVFSRAALERYVPRLQGALRHEVRSWCAAGGPVS 180

           AHILLGSHTLLGAVGEPHRRRRKVLARVFSRAALERYVPRLQGALRHEVRSWCAAGGPVS

Sbjct: 121 AHILLGSHTLLGAVGEPHRRRRKVLARVFSRAALERYVPRLQGALRHEVRSWCAAGGPVS 180

 

Query: 181 VYDAAKALTFRMAARILLGLRLDEAQCATLARTFEQLVENLFSLPLDVPFSGLRK----- 235

           VYDA+KALTFRMAARILLGLRLDEAQCATLARTFEQLVENLFSLPLDVPFSGLRK    

Sbjct: 181 VYDASKALTFRMAARILLGLRLDEAQCATLARTFEQLVENLFSLPLDVPFSGLRKGIRAR 240

 

Query: 236 ------------------------------------------------SAVELLFAAFFT 247

                                                           SAVELLFAAFFT

Sbjct: 241 DQLHRHLEGAISEKLHEDKAAEPGDALDLIIHSARELGHEPSMQELKESAVELLFAAFFT 300

 

Query: 248 TASASTSLVLLLLQHPAAIAKIREELVAQGLGRACGCAPGAAGGCVGPPPDCGCEPDLSL 307

           TASASTSLVLLLLQHPAAIAKIREELVAQGLGRACGCAPGAAGG  GPPPDCGCEPDLSL

Sbjct: 301 TASASTSLVLLLLQHPAAIAKIREELVAQGLGRACGCAPGAAGGSEGPPPDCGCEPDLSL 360

 

Query: 308 AALGSLRYVDCVVKEVLRLLPPVSGGYRTALRTFELDGYQIPKGWSVMYSIRDTHETAAV 367

           AALG LRYVDCVVKEVLRLLPPVSGGYRTALRTFELDGYQIPKGWSVMYSIRDTHETAAV

Sbjct: 361 AALGRLRYVDCVVKEVLRLLPPVSGGYRTALRTFELDGYQIPKGWSVMYSIRDTHETAAV 420

 

Query: 368 YRSPPEGFDPERFGAAREDSRGASSRFHYIPFGGGARSCLGQELAQTVLQLLAVELVRTA 427

           YRSPPEGFDPERFGAAREDSRGASSR HYIPFGGGARSCLGQELAQ VLQLLAVELVRTA

Sbjct: 421 YRSPPEGFDPERFGAAREDSRGASSRLHYIPFGGGARSCLGQELAQAVLQLLAVELVRTA 480

 

Query: 428 RWELATPAFPALQTVPIVHPVDGLRLFFHPLAPLVAGDGLCL 469

           RWELATPAFPA+QTVPIVHPVDGLRLFFHPL P VAG+GLCL

Sbjct: 481 RWELATPAFPAMQTVPIVHPVDGLRLFFHPLTPSVAGNGLCL 522

 

>CYP27A1 (Q.Tran)

QVLYKAKYGPMWMSYLGPQMHVNLASAPLLEQVMRQEGKYPVRNDMELWKEHRDLHDLTYGPFTT(2)

EGHHWYQLRQALNQRLLKPAEAALYTDAFNEVIDDFMIRLDQLRAESASGNQVSDTAQLFYYFALE (1)

AICYILFEKRIGCLQRSIPEDTVTFVRSIGLMFQNSLYATFLPKWTRPVLPFWKRYLDGWNAIFSF (1)

GKKLIDEKLEDMEAQLQAEGPDGVQVSGYLHFLLASGQLSPREAMGSLPELLMAGVDT (0)

TSNTLTWALYHLSKDPEIQEALHEEVVGVVPAGQVPQHKDFAHLPLLKAVLKETLR (2)

LYPVVPTNSRIIEKEIEVDGFLFPKN (0)

TQFVFCHYVVSRDPTTFSEPESFQPHRWLRSSQPATPRIQHPFGSVPFGYGVRACLGRRIAELEMQLLLAR (0)

LIQKYKVVLAPETGELKSVARIVLVPNKKVGLQFLQRQC

 

>CYP27A1 NM_000784
          Length = 531
 
 Score =  897 bits (2319), Expect = 0.0
 Identities = 439/447 (98%), Positives = 442/447 (98%)
 
Query: 1   QVLYKAKYGPMWMSYLGPQMHVNLASAPLLEQVMRQEGKYPVRNDMELWKEHRDLHDLTY 60
           QVLYKAKYGPMWMSYLGPQMHVNLASAPLLEQVMRQEGKYPVRNDMELWKEHRD HDLTY
Sbjct: 85  QVLYKAKYGPMWMSYLGPQMHVNLASAPLLEQVMRQEGKYPVRNDMELWKEHRDQHDLTY 144
 
Query: 61  GPFTTEGHHWYQLRQALNQRLLKPAEAALYTDAFNEVIDDFMIRLDQLRAESASGNQVSD 120
           GPFTTEGHHWYQLRQALNQRLLKPAEAALYTDAFNEVIDDFM RLDQLRAESASGNQVSD
Sbjct: 145 GPFTTEGHHWYQLRQALNQRLLKPAEAALYTDAFNEVIDDFMTRLDQLRAESASGNQVSD 204
 
Query: 121 TAQLFYYFALEAICYILFEKRIGCLQRSIPEDTVTFVRSIGLMFQNSLYATFLPKWTRPV 180
            AQLFYYFALEAICYILFEKRIGCLQRSIPEDTVTFVRSIGLMFQNSLYATFLPKWTRPV
Sbjct: 205 MAQLFYYFALEAICYILFEKRIGCLQRSIPEDTVTFVRSIGLMFQNSLYATFLPKWTRPV 264
 
Query: 181 LPFWKRYLDGWNAIFSFGKKLIDEKLEDMEAQLQAEGPDGVQVSGYLHFLLASGQLSPRE 240
           LPFWKRYLDGWNAIFSFGKKLIDEKLEDMEAQLQA GPDG+QVSGYLHFLLASGQLSPRE
Sbjct: 265 LPFWKRYLDGWNAIFSFGKKLIDEKLEDMEAQLQAAGPDGIQVSGYLHFLLASGQLSPRE 324
 
Query: 241 AMGSLPELLMAGVDTTSNTLTWALYHLSKDPEIQEALHEEVVGVVPAGQVPQHKDFAHLP 300
           AMGSLPELLMAGVDTTSNTLTWALYHLSKDPEIQEALHEEVVGVVPAGQVPQHKDFAH+P
Sbjct: 325 AMGSLPELLMAGVDTTSNTLTWALYHLSKDPEIQEALHEEVVGVVPAGQVPQHKDFAHMP 384
 
Query: 301 LLKAVLKETLRLYPVVPTNSRIIEKEIEVDGFLFPKNTQFVFCHYVVSRDPTTFSEPESF 360
           LLKAVLKETLRLYPVVPTNSRIIEKEIEVDGFLFPKNTQFVFCHYVVSRDPT FSEPESF
Sbjct: 385 LLKAVLKETLRLYPVVPTNSRIIEKEIEVDGFLFPKNTQFVFCHYVVSRDPTAFSEPESF 444
 
Query: 361 QPHRWLRSSQPATPRIQHPFGSVPFGYGVRACLGRRIAELEMQLLLARLIQKYKVVLAPE 420
           QPHRWLR+SQPATPRIQHPFGSVPFGYGVRACLGRRIAELEMQLLLARLIQKYKVVLAPE
Sbjct: 445 QPHRWLRNSQPATPRIQHPFGSVPFGYGVRACLGRRIAELEMQLLLARLIQKYKVVLAPE 504
 
Query: 421 TGELKSVARIVLVPNKKVGLQFLQRQC 447
           TGELKSVARIVLVPNKKVGLQFLQRQC
Sbjct: 505 TGELKSVARIVLVPNKKVGLQFLQRQC 531

 

>CYP27B1 (Xin Liu)
MTQTLKYASRVFHRVRWAPELGASLGYREYDSARRSLADIPGPSTPSFLAELFCKGGLSR

LHELQVQGAARFGPVWLASFGTVRTVYVAAPALVEELLRQEGPRPERCSFSPWTEHRRRR

QRACGLLTAEGEEWQRLRSLLAPLLLRPQAAARYAETLDNVVRDLVRRLRCQRGRGTGPP

ALVRDVAGEFYKFGLEGIAAVLLGSRLGCLEAQVPPDTETFIRAVGSVFVSTLLTMAMPH

WLRRLVPGPWGRLCRDWDQMFAFAQRHVERREAEAAMRNRGQPDEDLESGAHLTHFLFQE

ELPAQSILGNVTELLLAGVDTVSNTLSWALYELSRHPEVQTALHSEITAALGPGSSAHPP

ATVLSQLPLLKAVVKEVLRLYPVVPGNSRVPDKDIHVGDYIIPKNTLVTLCHYATSRDPA

QFPEPNSFHPARWLGEGPTPHPFASLPFGFGKRSCMGRRLAELELQMALAQILIHFEVQP

EPGAAPIRPMTRTVLVPERSINLQFLDR
 
>CYP27B1 NM_000785
          Length = 508
 
 Score =  985 bits (2547), Expect = 0.0
 Identities = 489/508 (96%), Positives = 495/508 (97%)
 
Query: 1   MTQTLKYASRVFHRVRWAPELGASLGYREYDSARRSLADIPGPSTPSFLAELFCKGGLSR 60
           MTQTLKYASRVFHRVRWAPELGASLGYREY SARRSLADIPGPSTPSFLAELFCKGGLSR
Sbjct: 1   MTQTLKYASRVFHRVRWAPELGASLGYREYHSARRSLADIPGPSTPSFLAELFCKGGLSR 60
 
Query: 61  LHELQVQGAARFGPVWLASFGTVRTVYVAAPALVEELLRQEGPRPERCSFSPWTEHRRRR 120
           LHELQVQGAA FGPVWLASFGTVRTVYVAAPALVEELLRQEGPRPERCSFSPWTEHRR R
Sbjct: 61  LHELQVQGAAHFGPVWLASFGTVRTVYVAAPALVEELLRQEGPRPERCSFSPWTEHRRCR 120
 
Query: 121 QRACGLLTAEGEEWQRLRSLLAPLLLRPQAAARYAETLDNVVRDLVRRLRCQRGRGTGPP 180
           QRACGLLTAEGEEWQRLRSLLAPLLLRPQAAARYA TL+NVV DLVRRLR QRGRGTGPP
Sbjct: 121 QRACGLLTAEGEEWQRLRSLLAPLLLRPQAAARYAGTLNNVVCDLVRRLRRQRGRGTGPP 180
 
Query: 181 ALVRDVAGEFYKFGLEGIAAVLLGSRLGCLEAQVPPDTETFIRAVGSVFVSTLLTMAMPH 240
           ALVRDVAGEFYKFGLEGIAAVLLGSRLGCLEAQVPPDTETFIRAVGSVFVSTLLTMAMPH
Sbjct: 181 ALVRDVAGEFYKFGLEGIAAVLLGSRLGCLEAQVPPDTETFIRAVGSVFVSTLLTMAMPH 240
 
Query: 241 WLRRLVPGPWGRLCRDWDQMFAFAQRHVERREAEAAMRNRGQPDEDLESGAHLTHFLFQE 300
           WLR LVPGPWGRLCRDWDQMFAFAQRHVERREAEAAMRN GQP++DLESGAHLTHFLF+E
Sbjct: 241 WLRHLVPGPWGRLCRDWDQMFAFAQRHVERREAEAAMRNGGQPEKDLESGAHLTHFLFRE 300
 
Query: 301 ELPAQSILGNVTELLLAGVDTVSNTLSWALYELSRHPEVQTALHSEITAALGPGSSAHPP 360
           ELPAQSILGNVTELLLAGVDTVSNTLSWALYELSRHPEVQTALHSEITAAL PGSSA+P 
Sbjct: 301 ELPAQSILGNVTELLLAGVDTVSNTLSWALYELSRHPEVQTALHSEITAALSPGSSAYPS 360
 
Query: 361 ATVLSQLPLLKAVVKEVLRLYPVVPGNSRVPDKDIHVGDYIIPKNTLVTLCHYATSRDPA 420
           ATVLSQLPLLKAVVKEVLRLYPVVPGNSRVPDKDIHVGDYIIPKNTLVTLCHYATSRDPA
Sbjct: 361 ATVLSQLPLLKAVVKEVLRLYPVVPGNSRVPDKDIHVGDYIIPKNTLVTLCHYATSRDPA 420
 
Query: 421 QFPEPNSFHPARWLGEGPTPHPFASLPFGFGKRSCMGRRLAELELQMALAQILIHFEVQP 480
           QFPEPNSF PARWLGEGPTPHPFASLPFGFGKRSCMGRRLAELELQMALAQIL HFEVQP
Sbjct: 421 QFPEPNSFRPARWLGEGPTPHPFASLPFGFGKRSCMGRRLAELELQMALAQILTHFEVQP 480
 
Query: 481 EPGAAPIRPMTRTVLVPERSINLQFLDR 508
           EPGAAP+RP TRTVLVPERSINLQFLDR
Sbjct: 481 EPGAAPVRPKTRTVLVPERSINLQFLDR 508

 

>CYP27C1 (F.Zhang) partial
MQTSAMALLARILRAGLRPAPERGGLLGGAAPRRPQPAGARLPAGARAEDKGAGRPGAPP

AGGRAEGPRLAAMPGPRTLANLAEFFYRDGFSRIHEIQQKHT

QEYGKIFKSHFGPQFVVSIADRDMVAQVLRAEGAAPQRANMESWREYRDLRGRATGLISA

EGEQWLKMRSVLRQRILKPKDVAIYSGEVNQVIADLIKRIYLLRSQAEDGETVTNVNDLF

FKYSMEGVATILYESRLGCLENSIPQLTVEYIEALELMFSMFKTSMYAGAIPRWLRPFIP

KPWREFCRSWDGLFKF
 
>CYP27C1 AC027142 43% identical to 27A1 assembled gene
          Length = 542
 
 Score =  586 bits (1511), Expect = e-170
 Identities = 293/301 (97%), Positives = 296/301 (98%), Gaps = 1/301 (0%)
 
Query: 1   MQTSAMALLARILRAGLRPAPERGGLLGGAAPRRPQPAGARLPAGARAEDKGAGRPGAPP 60
           MQTSAMALLARILRAGLRPAPERGGLLGG APRRPQPAGARLPAGARAEDKGAGRPG+PP
Sbjct: 1   MQTSAMALLARILRAGLRPAPERGGLLGGGAPRRPQPAGARLPAGARAEDKGAGRPGSPP 60
 
Query: 61  AGGRAEGPR-LAAMPGPRTLANLAEFFYRDGFSRIHEIQQKHTQEYGKIFKSHFGPQFVV 119
            GGRAEGPR LAAMPGPRTLANLAEFF RDGFSRIHEIQQKHT+EYGKIFKSHFGPQFVV
Sbjct: 61  GGGRAEGPRSLAAMPGPRTLANLAEFFCRDGFSRIHEIQQKHTREYGKIFKSHFGPQFVV 120
 
Query: 120 SIADRDMVAQVLRAEGAAPQRANMESWREYRDLRGRATGLISAEGEQWLKMRSVLRQRIL 179
           SIADRDMVAQVLRAEGAAPQRANMESWREYRDLRGRATGLISAEGEQWLKMRSVLRQRIL
Sbjct: 121 SIADRDMVAQVLRAEGAAPQRANMESWREYRDLRGRATGLISAEGEQWLKMRSVLRQRIL 180
 
Query: 180 KPKDVAIYSGEVNQVIADLIKRIYLLRSQAEDGETVTNVNDLFFKYSMEGVATILYESRL 239
           KPKDVAIYSGEVNQVIADLIKRIYLLRSQAEDGETVTNVNDLFFKYSMEGVATILYESRL
Sbjct: 181 KPKDVAIYSGEVNQVIADLIKRIYLLRSQAEDGETVTNVNDLFFKYSMEGVATILYESRL 240
 
Query: 240 GCLENSIPQLTVEYIEALELMFSMFKTSMYAGAIPRWLRPFIPKPWREFCRSWDGLFKFK 299
           GCLENSIPQLTVEYIEALELMFSMFKTSMYAGAIPRWLRPFIPKPWREFCRSWDGLFKF 
Sbjct: 241 GCLENSIPQLTVEYIEALELMFSMFKTSMYAGAIPRWLRPFIPKPWREFCRSWDGLFKFS 300
 

>CYP39 (Q.Tran)

MELIFPTVIIILGCLALFLLLQRKNLRRPPCIRGWIPWIGVGFEFGKAPLEFIEKARIK (0)

YGPIFTVFAMGNRMTFVTEEEGINVFLKSKKVDFELAVQNIVYHT (1)

ASIPKNVFLALHEKLYIMLKGKMGTVNLHQFTGQLTEELHEQLENLGTHGTMDLNNLV (2)

RHLLYPVTVNTLFNKSWFPTNKKKIKEFHQYFQAYDEDFEYGSQLPECLL (2)

RNWSKSKKWFLELFEKNIPDIKACKSAKDNSM (0)

TLLQATLDIVETETSKENSPNYGLLLLWASLSNAVP(0)

VAFWTLAYVLSHPDIHKAVMEGISSVFGTA (1)

GKDKIKVSEDDLEKLLLIKWCVLETIRLKAPGVITRKVVKPVEIL (0)

NYIIPSGDLLMLSPFWLHRNPKYFPEPELFKP (0)

ERWKKANLEKHSFLDCFVAFGSGKFQCPG (2)

RWFALLEVQMCVILILYKYDCSLLDPLPKQ (0)

SSLHLVGVPQPEGQCRIEYKQRI

 

>CYP39A1 AC008104 AL035670 note heme region exon corrected 1/18/02
          Length = 469
 
 Score =  934 bits (2413), Expect = 0.0
 Identities = 455/469 (97%), Positives = 459/469 (97%)
 
Query: 1   MELIFPTVIIILGCLALFLLLQRKNLRRPPCIRGWIPWIGVGFEFGKAPLEFIEKARIKY 60
           MELI PTVIIILGCLALFLLLQRKNLRRPPCI+GWIPWIGVGFEFGKAPLEFIEKARIKY
Sbjct: 1   MELISPTVIIILGCLALFLLLQRKNLRRPPCIKGWIPWIGVGFEFGKAPLEFIEKARIKY 60
 
Query: 61  GPIFTVFAMGNRMTFVTEEEGINVFLKSKKVDFELAVQNIVYHTASIPKNVFLALHEKLY 120
           GPIFTVFAMGNRMTFVTEEEGINVFLKSKKVDFELAVQNIVY TASIPKNVFLALHEKLY
Sbjct: 61  GPIFTVFAMGNRMTFVTEEEGINVFLKSKKVDFELAVQNIVYRTASIPKNVFLALHEKLY 120
 
Query: 121 IMLKGKMGTVNLHQFTGQLTEELHEQLENLGTHGTMDLNNLVRHLLYPVTVNTLFNKSWF 180
           IMLKGKMGTVNLHQFTGQLTEELHEQLENLGTHGTMDLNNLVRHLLYPVTVN LFNKS F
Sbjct: 121 IMLKGKMGTVNLHQFTGQLTEELHEQLENLGTHGTMDLNNLVRHLLYPVTVNMLFNKSLF 180
 
Query: 181 PTNKKKIKEFHQYFQAYDEDFEYGSQLPECLLRNWSKSKKWFLELFEKNIPDIKACKSAK 240
            TNKKKIKEFHQYFQ YDEDFEYGSQLPECLLRNWSKSKKWFLELFEKNIPDIKACKSAK
Sbjct: 181 STNKKKIKEFHQYFQVYDEDFEYGSQLPECLLRNWSKSKKWFLELFEKNIPDIKACKSAK 240
 
Query: 241 DNSMTLLQATLDIVETETSKENSPNYGLLLLWASLSNAVPVAFWTLAYVLSHPDIHKAVM 300
           DNSMTLLQATLDIVETETSKENSPNYGLLLLWASLSNAVPVAFWTLAYVLSHPDIHKA+M
Sbjct: 241 DNSMTLLQATLDIVETETSKENSPNYGLLLLWASLSNAVPVAFWTLAYVLSHPDIHKAIM 300
 
Query: 301 EGISSVFGTAGKDKIKVSEDDLEKLLLIKWCVLETIRLKAPGVITRKVVKPVEILNYIIP 360
           EGISSVFG AGKDKIKVSEDDLE LLLIKWCVLETIRLKAPGVITRKVVKPVEILNYIIP
Sbjct: 301 EGISSVFGKAGKDKIKVSEDDLENLLLIKWCVLETIRLKAPGVITRKVVKPVEILNYIIP 360
 
Query: 361 SGDLLMLSPFWLHRNPKYFPEPELFKPERWKKANLEKHSFLDCFVAFGSGKFQCPGRWFA 420
           SGDLLMLSPFWLHRNPKYFPEPELFKPERWKKANLEKHSFLDCF+AFGSGKFQCP RWFA
Sbjct: 361 SGDLLMLSPFWLHRNPKYFPEPELFKPERWKKANLEKHSFLDCFMAFGSGKFQCPARWFA 420
 
Query: 421 LLEVQMCVILILYKYDCSLLDPLPKQSSLHLVGVPQPEGQCRIEYKQRI 469
           LLEVQMC+ILILYKYDCSLLDPLPKQS LHLVGVPQPEGQCRIEYKQRI
Sbjct: 421 LLEVQMCIILILYKYDCSLLDPLPKQSYLHLVGVPQPEGQCRIEYKQRI 469
 
>CYP46 C-term part (Ramy.Naguib)
RLQAEVDEVIGSKRYLDFEDLGRLQYLSQVLKESLRLYPPAWGTFRLLEEETLIDGVRVP

 
GNTPLLFSTYVMGRMDTYFEDPLTFNPDRFGPGAPKPRFTYFPFSLGHRSCIGQQFAQME

 
VKVVMAKLLQRLEFRLVPGQRFGLQEQATLKPLDPVLCTLRPRGWQPTPPPPPC
 
Alignment of the two proteins Identities = 173/174 (99%), Positives = 173/174 (99%), Gaps = 0/174 (0%)
 
Query  1    RLQAEVDEVIGSKRYLDFEDLGRLQYLSQVLKESLRLYPPAWGTFRLLEEETLIDGVRVP  60
            RLQAEVDEVIGSKRYLDFEDLGRLQYLSQVLKESLRLYPPAWGTFRLLEEETLIDGVRVP
Sbjct  327  RLQAEVDEVIGSKRYLDFEDLGRLQYLSQVLKESLRLYPPAWGTFRLLEEETLIDGVRVP  386
 
Query  61   GNTPLLFSTYVMGRMDTYFEDPLTFNPDRFGPGAPKPRFTYFPFSLGHRSCIGQQFAQME  120
            GNTPLLFSTYVMGRMDTYFEDPLTFNPDRFGPGAPKPRFTYFPFSLGHRSCIGQQFAQME
Sbjct  387  GNTPLLFSTYVMGRMDTYFEDPLTFNPDRFGPGAPKPRFTYFPFSLGHRSCIGQQFAQME  446
 
Query  121  VKVVMAKLLQRLEFRLVPGQRFGLQEQATLKPLDPVLCTLRPRGWQPTPPPPPC  174
            VKVVMAKLLQRLEFRLVPGQRFGLQEQATLKPLDPVLCTLRPRGWQP PPPPPC
Sbjct  447  VKVVMAKLLQRLEFRLVPGQRFGLQEQATLKPLDPVLCTLRPRGWQPAPPPPPC  500
 
>CYP51 (Y.Peng) partial
AAAAGMMLLGLLQAGGSVLGQAMEKVTGGNLLSMLLIACAFTLSLVYLFRLAAGHLVQLPAG (0)
KSPPYIFSPIPFLGHAIAFGKSPVEFLENAYEK (0)
VFGKGVAYDVPNP (0)
VFLEQKKMLKSGLNIAHFKQHVSIIEKETKEYFQSWGESGEK (1)
VFEALSELIILTASHCLHGKEIRSQLNEKVAQLYADLDGGFSHAAWLLPGWLPLPSF (2)
RRRDRAHREIKNIFYKAIQKRRQSQEKIDDILQTLLDATYK (0)
DGRPLTDDEVAGMLIGLLLAGQHTSSTTSAWMGFFLARDKTLQEKCYLEQKT 

VCGENLPPLTYDQ (0)
TVAGYTIPPGHQVCVSPTVNQRLKDSWVERLDFNPDRYLQDNPASGE

KFAYVPFGA (1)
GRHRCIGENFAYVQIKTIWSTMLRLYEFDLIDGYFPTVNYTTMIHTPENPVIRYKRRSK
 
Score = 2176 (766.0 bits), Expect = 5.2e-231, P = 5.2e-231
 Identities = 422/428 (98%), Positives = 423/428 (98%)
 
Query:     1 AAAAGMLLLGLLQAGGSVLGQAMEKVTGGNLLSMLLIACAFTLSLVYL-RLAAGHLVQLP 59
             AAAAGM+LLGLLQAGGSVLGQAMEKVTGGNLLSMLLIACAFTLSLVYL RLAAGHLVQLP
Sbjct:     1 AAAAGMMLLGLLQAGGSVLGQAMEKVTGGNLLSMLLIACAFTLSLVYLFRLAAGHLVQLP 60
 
Query:    60 AGKSPPYIFSPIPFLGHAIAFGKSP-EFLENAYEKVFGKGVAYDVPNPVFLEQKKMLKSG 118
             AGKSPPYIFSPIPFLGHAIAFGKSP EFLENAYEKVFGKGVAYDVPNPVFLEQKKMLKSG
Sbjct:    61 AGKSPPYIFSPIPFLGHAIAFGKSPVEFLENAYEKVFGKGVAYDVPNPVFLEQKKMLKSG 120
 
Query:   119 LNIAHFKQHVSIIEKETKEYF-SWGESGEKVFEALSELIILTASHCLHGKEIRSQLNEKV 177
             LNIAHFKQHVSIIEKETKEYF SWGESGEKVFEALSELIILTASHCLHGKEIRSQLNEKV
Sbjct:   121 LNIAHFKQHVSIIEKETKEYFQSWGESGEKVFEALSELIILTASHCLHGKEIRSQLNEKV 180
 
Query:   178 AQLYADLDGGFSHAAWLLPGWLPLPSFRRRDRAHREIK-IFYKAIQKRRQSQEKIDDILQ 236
             AQLYADLDGGFSHAAWLLPGWLPLPSFRRRDRAHREIK IFYKAIQKRRQSQEKIDDILQ
Sbjct:   181 AQLYADLDGGFSHAAWLLPGWLPLPSFRRRDRAHREIKNIFYKAIQKRRQSQEKIDDILQ 240
 
Query:   237 TLLDATYKDGRPLTDDEVAGMLIGLLLAGQHTSSTTSAWMGFFLARDKTLQ-KCYLEQKT 295
             TLLDATYKDGRPLTDDEVAGMLIGLLLAGQHTSSTTSAWMGFFLARDKTLQ KCYLEQKT
Sbjct:   241 TLLDATYKDGRPLTDDEVAGMLIGLLLAGQHTSSTTSAWMGFFLARDKTLQEKCYLEQKT 300
 
Query:   296 VCGENLPPLTYDQTVAGYTIPPGHQVCVSPTVNQRLKDSWVERLDFNPDRYLQDNPASGE 355
             VCGENLPPLTYDQTVAGYTIPPGHQVCVSPTVNQRLKDSWVERLDFNPDRYLQDNPASGE
Sbjct:   301 VCGENLPPLTYDQTVAGYTIPPGHQVCVSPTVNQRLKDSWVERLDFNPDRYLQDNPASGE 360
 
Query:   356 KFAYVPFGAGRHRCIGENFAYVQIKTIWSTMLRLYEFDLIDGYFPTVNYTTMIHTPENPV 415
             KFAYVPFGAGRHRCIGENFAYVQIKTIWSTMLRLYEFDLIDGYFPTVNYTTMIHTPENPV
Sbjct:   361 KFAYVPFGAGRHRCIGENFAYVQIKTIWSTMLRLYEFDLIDGYFPTVNYTTMIHTPENPV 420
 
Query:   416 IRYKRRSK 423
             IRYKRRSK
Sbjct:   421 IRYKRRSK 428
 
 
 
FASTA format of assembled sequences

 

>CYP1A1

MLFRISMSATEFLLASLIFCLVFWVIRASRPRVPKGLKNPPGPW

GWPLIGHILTLGKNPHLALSRMSQRYGDVLQIRIGSTPVLVLSGLDTIRQALVQQGDD

FKGRPNLYSFTLISNGQSMSFGPDSGPVWAARRRLAQNGLKSFSIASDPASSSSCYLE

EHVSKEAEVLISKLQEQMAGPGHFNPYRYVVISVANVICAICFGQRYDHNHQELLSLV

NLSNNFGEVVGSGNPADFIPILRYLPNRSLNGFKDLNEKFHSFMQKMIKEHYKTFEKG

HIRDITDSLIEHCQEKQLDENANIQLSDEKIVNVVLDLFGAGFDTVTTAISWSLMYLV

TNPRVQRKIQEELDTVIGRSRRPRLSDRSHLPYMEAFILETFRHSSFVPFTIPHSTTR

DTSLKGFYIPKGRCVFVNQWQINHDQKLWVNPSEFLPERFITPDGAIDKVLSEKVILF

GLGKRKCIGETIARWEVFLFLAILLQRVEFSVPPGVKVDMTPIYGLTMKHACCEHFQM

QLRS

 

>CYP1A2 (Aggarwal)

MALSQSVPFSATELLLASAIFCLVFWVLRGSRPRVPKGLKSPPEPWGWPLLGHVLTLGKN
PHLALARMSQLYGDVLQIRIGSTPVLVLSGLDTIRQALVRQGNDFKGRPDLYSFTFITDG
QSMSFSPDSGPVWAARRRLAQNALNTFSIASDPASSSSCYLEEHVSKEAEALISRLQELM
AGPGHFDPYNQVVVSVANVIGAMCFGQHFPESSDEMLSLVKNSHEFVESASSANPVDFFP
ILRYLPNPALQRFKAFNQRFRRFLQKTVQEHYQDFDK (0)
NSVQDITGALFKHSKKGPRASGNLIPQEKTVNLVNDIFGA (1)
GFDTIATAISWSLMYLVTKPEIQRKIQKEL (1)
DAVIGRGRRPRLSDRPQLPYLEAFILETFRHSSFVPFTIPH (2)
STTRDTTLNGFYIPRECCVFINQWQVNHDP  (2)
QLWGDPSEFRPERFLTAEGTTINKPLSEKIMLFGLGKRRCIGEVLGKWEVFLFLAILLQQLE
FSVPPGVKVDLTPIYGLTMKHARCEHFQAR
 

>CYP1A8P ortholog  possibly a functional gene in rhesus

MILNLAVTPGEVTTSLIILVMVFVFVRALRSKGRKQLSPPGPWSFPIIGNLLQLGEHPYL

TLMEMRKKYGDVFLLKLGMVPVLVVNGMEMVKQVLLKDGEHFAGRPNMHTFSFLAEGKSL

SFSVNYGESWKLHKKIASKALRTLSNAEAKSSTCSCLLEEHVTEEVSELVTVFVELSSKN

GGFDPRNAITCAVANVVCALCFGKRYDHSDEEFLKIVKTNDDLLKASSAANPADFIPCLR

YLPLQIINAPREFYRALNGFIALHVQDHLATYDK (0)

DHIRDITDALINVCHNKYAATKTDTLNDSEIISTVNDLFGA (1)

GFETVSTCLYWSFLYLIHYPEIQAKIQEEI (1)

DGNIGLKPPRFEDRKILPYTEAFISEVFRHASFLPFTIPHC (2)

TTADTTLNGYFIPRKTCTFINMYQVNHDE (2)

TIWDNPSLFRPDRFLNENRELNKSLVEKVLIFGMGIRKCLGEDVARNEIFIFITAVLQQL

KLKKCPRAKLDLTPTYGLVMRPKPYQLEAERRSSGSSSA

 

>CYP1B1

MGTGLSPKDPWPLNLLSTQQTTLLLLLSVLVAVHVGQWLLRQRRRQLGSTPPGPFAWPLI

GNAAAVGQASHLSFARLARRYGDVFQIRLGSCPIVVLNGERAIHQALVQQGSAFADRPSF

ASFRVISGGRSMAFGHYSEHWKVQRRAAHSTMRNFSTRQLRSRQVLEGHVLSEARELVAL

LVRGSADGAFLDPRQLTVVAVANVMSAVCFGCRYSHDDPEFRELLSHNEEFGRTVGAGSL

VDVMPWLQYFPNPMRTAFREFEQLNRNFSNFVLDKFLRHCESLRPGAAPRDMMDAFILSA

EKKAARDSDDGGARLDLENVPATVTDIFGASQDTLSTALQWLLLLFIR (2)

YPDVQARVQAELDQVVGRDRLPCMDDQPNLPYVLAFLYEAMRFSSFVPVTIPHATNANTS

VLGYHIPKDTVIFVNQWSVNHDPVKWPNPENFDPARFLDKDGLINKDLTSRVMIFSVGKR

RCIGEELSKMQLFLFISILAHQCNFRANPNGPEMNFSYGLTIKPKSFKVNVTLRESMELL

DSAVQKLQAEETCQ

 

>CYP2A24 AY635460

MLASGLLLVALLACLTVMVLMSVWQQRNSKGKLPPGPTPLPFIG

NYLQLNTEQMCNSLMKISERYGPVFTIHLGPRRVVVLCGYDAVKEALVDQAEEFSGRG

EQATFDWVFKGYGVVFSNGERAKQLRRFSIATLRDFGVGKRGIEERIQEEAGFLIEAL

RDTHGANIDPTFFLSRTVSNVISSIVFGDRFDYEDKEFLSLLGMMLGSFQFTSTSTGQ

LYEMFSSVMKHLPGPQQQAFKELQGLEDFIAKKVEHNQHTLDPNSPRDFIDSFLIRMQ

EEEKNPNTEFYLKNLMMTTLNLFIAGTETVSTTLRYGFLLLMKYPEVEAKVHEEIDRV

IGKNRQPKFEDRVKMPYMEAVIHEIQRFGDVIPMSLARRVNKDTKFRDFFLPKGTEVF

PMLGSVLRDPRFFSNPQDFNPQHFLDEKGQFKKSDAFVPFSIGKRNCFGEGLARMELF

LFFTTIMQNFRFKSPQLPKDIDVSPKHVGFATIPPNYTMSFLPR

 

>CYP2A23 AY635459.1

MLASGLLLVALLACLTVMVLMSVWQQRNSKGKLPPGPTPLPFIG

NYLQLNTEQMYNSIMKISERYGPVFTIHLGPRRIVVLCGYDAVKEALVDQAEEFSGRG

EQATFDWLFKGYGVVFSNGERAKQLRRFSIATLRDFGVGKRGIEERIQEEAGFLIEAL

RDTQGANIDPTFFLSRTVSNVISSIVFGDRFDYEDKEFLSLLRMMLGSFQFTATSAGQ

LYEMFSSVMKHLPGPQQQAFKELQGLEDFIAKKVEHNRRTLDPNSPRDFIDSFLIRMQ

EEEKNPNTEFHLKNLVLTSLNLFFGGTETVSTTLRYGFLLLMKHPEVEAKVHEEIDRV

IGKNRQPKFEDQARMPYMEAVIHEIQRFGDMLPLGVAHRVIKDTKFRDFFLPKGTEVF

PMLGSVLKDPKFFSNPQDFNPQHFLDEKGQFKKSDAFVPFSIGKRNCFGEGLARMELF

LFFTTIMQNFRFKSPQSPKDIDVSPKHVGFATIPPNYTMSFLPR

 

>CYP2C43 AB212264.1

MDSLVVLVLCLSCLLLLSLWRQRSGRGKLPPGPTPLPVIGNILK

IGIKDVSKSLTNLSKVYGPVFTLYFGLERMVVLHGYEAVKEALIDLGEEFSGRGHFPL

FERANRRFGLVFSNGKRWKEIRRFSLMTLRNFGMGKRSIEDRVQEEARCLVEELRKTK

ASPCDPTFILGCAPCNVICSIIFHKRFDYKDQQFLNLMEKFNENAKILSSPWIQIYNN

FSPIIDYFPGTHNKLLKNIAFVKSYILEKVKEHQESMDMNNPRDFIDCFLIKMEKEKH

NQQSEFNIENLENTAVDLFAAGTETTSTTLRYALLLLLKHPEVAAKVQEEIEHVIGRN

RSPCMQDRSRMPYTDAVVHEIQRYIDLLPTSVPHAVTCDVKFRNYLIPKGTTILISLT

SVLRDNKEFPNPEMFDPRHFLDEGGNFKNSNYFMPFSAGKRICVGEALARMELFLFLT

SILQNFNLKSLVDLKDLDTTPVFNGFVSVPPIYQLCFIPV

 

>CYP2C74 variant (S.Sarva) missing exon 1, 4 aa diffs to 2C74 AY635462.1
FSKVYGPVFTVYFGMNPVVVLHGYETVKEALIDNAEEFSGRGILPISERITNGL (1)
GIISSNGKRWKETRRFSLTTLRNFGMGKRSIEDRVQEEARCLVEELRKTK (1)
ASPCDPTFILGCAPCNVICSVVFQKRFDYKDENFLTLMKRFTVNFRILTSPWIQ (0)
VCNNFPLLIDCFPGTHNKLLKNVALTKSYIRKKVKEHQATLDVNNPRDFIDCFLIKMEQ (0)
EKDNQQSEFTIENLVGTVADLFVAGTETTSTTLRYGLLLLLKHPEVT (1)
AKVQEEIDHVIGRHRSPCMQDRSHMPYTDAVIHEIQRYIDLVPTGVPHAVTTDIKFRNYLIPK (0)
GTIIITLLTSVLQDDKEFPNPKIFDPGHFLDENGNFKKSDYFMPFSA (1)
GKRICAGEGLARMELFLFLTTILQNFNLKSVADLKNLNTTSATRGIISLPPSYQICFIPV

 

>CYP2C75 AY635463.1

MDSLVVLVLCLSCLLLLSLWRQRSGRGKFPPGPTPLPVIGNILQ

IDIKDVSKSLTNLSKVYGPVFTLYFGLERMVVLHGYEAVKEALIDLGEEFSGRGHFPL

ADRANRGFGIVFSNGKRWKEIRRFSLMTLRNFGMGKRSIEDRVQEEARCLVEELRKTK

GSPCDPTFILGCAPCNVICSIIFHKRFDYKDQQFLKVMEKLNENVKILSSPWIQICNN

FPPFIDYFPGAHNKLLKNIAFLKSYILEKVKEHQESMDMNNPRDFIDCFLMKMEKEKH

NQQSEFTIENLENTAVDLFGAGTETTSTTLRYALLLLLKHPEVTAKVQEEIERVIGRN

RSPCMQDRSHMPYTDAVVHEIQRYIDLLPTNLPHAVTCDVKFRNYLIPKGTTILISLT

SVLHDNKEFPNPEMFDPRHFLDEGGNFKKSNYFMPFSAGKRICVGEALARMELFLFLT

SVLQNFNLKSLVDPKDLDTTPVVNGFASVPPFYQLCFIPV

 

>searched with 2C29 differs from 2C43, 2C74, 2C75 (Liao)

MDPAVALVLCLSCLFLLSLWRQSSGRGRLPSGPTPLPIIGNILQLDVKDMSKSLTNFSKV

YGPVFTVYFGLKPIVVLHGYEAVKEALIDHGEKFSGRGSFPVAEKVNKGLGILFSNGKRW

KEIRRFSLMTLRNFGMGKRSIEDRVQEEALCLVEELRKTNASPCDPTFILGCAPCNVICS

VIFHNRFDYKDQRFLNLMEKFNENLRILSSPWIQ

 

>CYP2B30 (Puljic) also AY635461.1

MELSVLLFLALLTGLLLLLVQRHPNAHGRLPPGPCPLPLLGNLL

QMDRRGLLRSFLRFREKYGDVFTVYLGPRPVVMLCGVEAIREALVDNAEAFSGRGKIA

ITDPVFQGYGVVFANGNRWKVLRRFSLTTMRDFGMGKRSVEERIQEEAQCLIEELRKS

KGALVDPTFLFHSITANIICSIVFGKRFHYQDQEFLKILNLFYHTFSLASSMFGQLFE

LLSGFLKYFPGAHRQVYKNLQEINAYIGHSVEKHRETLDPSAPQDLIDSYLLQMEKEK

SNPHSEFSHRNLIINTLSLFFAGTETTSTTLRYGFLLMLKYPHVAERIYKEIEQVIGP

HRPPALDDRAKMPYTEAVIHEIQRFADLLPMGVPHIVTQQTSFRGYIIPKDTEVFPLL

STALHDPHYFEKPDTFNPDHFLDANGALKKNEAFIPFSLGRRMCLGEGIARNELFLFF

TTILQNFSVASPVAPEDIDLTPQESGVGKIPPTYQIRFLPR

 

>CYP2F6 AY952296.1

MDSISTAILLLLLALVCLLLTLSSRDKXKLPPGPRPLPLLGNLLLLRSQNMLTSLTQ

LSKEYGSVYTVHLGPRRVVVLSGYQAVKEALVDQGEEFSGRGDYPVFFNFTKGN

GIAFSNGDRWKVLRRFSIQILRNFGMGKRSIEERILEEGSFLLAELRKTE

GEPFDPTFVLSRSVSNIICSVLFGSRFDYDDERLLTVIRLINDNFQIMSSPWGE (0)

LYNIFPSLLDWVPGPHQRIFQNFKRLRDLIAHRVHDQQASLDPRSPRDFIDCFLTKMAE

EKEDPLSHFHMDTLLMTTHNLLFGGTETVGTTLRHAFLALMKYPKVQ

ARVQEEIDLVVGRTRLPTLEDRAAMPYTDAVIHEVQRFADIIPMNLPHRVIRDTAFRXFLIPK

GTDIITLLNTVHYDPSQFLXPQEFNPEHFLDANQSFKKSPAFMPFSA

GRRLCLGESLARMELFLYLTAILQSFSLQPLGAPEDIDLTPLSSGLGNLPRSFQLCLCPR

 

>CYP2D42 (Vasser) also AY635464.1

MELDALVPLAVIVAIFLLLVDLMHRRQRWAARYPPGPLPLPGLGNLLHVDFKNTPYCFDQ (0)

LRRRFGDVFSLQLAWTPVVVLNGLAAVREALVTCGEDTADRPPVPINQVLGVGPRSQ (1)

GVFLARYGPAWREQRRFSVSTLRNLGLGKKSLEQWVTEEAACLCAAFTDQA (1)

GRPFRPNSLLDKAVSNVIASLTYGRRFEYDDPRFLRLFDLTHEALKEESGFLRE (0)

VLNAVPLLLRIPGLAGKVLRSQKAFLTQLDELLTEHRMTWDPAQPPRDLTEAFLAEMEK (0)

AKGNPESSFNEENLRIVVADLFSAGMVTTSTTLAWGLLLMILHPDVQR (1)

RVQQEIDNVIGQVRRPEMGDQARMPYTTAVIHEVQRFGDIVPLGVTHMTSRDIELQGFLIPK (0)

GTTLFTNLSSVLKDEAVWEKPFRFHPEHFLDAQGHFVKPEAFLPFSA (1)

GRRACLGEPLARMELFLFFTCLLQRFSFSVPAGQPRPSHHGVFAFLVTPSPYELCAVPR*

 

>CYP2E1 AY635465.1

MSALGVSVALLVWVAVLLLVSIWRQVHSSWNLPPGPFPLPIIGN

LFQLELKNIPKSFTRLAQRFGPVFTLYVGSRRVVVVHGYKAVREVLLDHKDEFSGRGD

IPAFHAHRDRGIIFNNGPTWKDIRRFSLTTLRNYGMGKQGNESRIQREAHFLLEALRK

TQGQPFDPTFLIGCAPCNVIADILFRKHFDYNDEKFLRLMYLFNENFQLLSTPWLQLY

NNFPSLLHYLPGSHRKVMKNVAEIKEYVSERVKEHLQSLDPNCPRDLTDCLLVEMEKE

KHSAERLYTMDGITVTVADLFFAGTETTSTTLRYGLLILMKYPEIEEKLHEEIDRVIG

PSRIPAIKDRQEMPYMDAVVHEIQRFITLVPSNLPHEATRDTIFRGYIIPKGTVIVPT

LDSVLYDNQEFPDPEKFKPEHFLDESGKFKYSDYFKPFSAGKRVCAGEGLARMELFLL

LSAILQHFNLKPLVDPKDIDISPVNIGFGCIPPRFKLCVIPRS

 

>CYP2G2P best hit (Li Chen) Note this does not look like a pseudogene
MELGGAVTIFLALCLSCLLVLIAWKRMNKAGKLPPGPTPIPFLGNLLQVRTDATFQSFMK (0)
LKEKYGPLFTVYMGLWPVVVLCGHEAVKEALIDQADEFSGRGKLASIEQNFQGH (1)
GVALANGERWRILRRFSLTILRDFGMGKRSIEERILEEASYLLEEFRKTK (1)
GAPIDPTFLLSRTVSNVISSVVFGSRFDYEDKQFLNLLRLINESFIEMSTPWAQ (0)
LYDMYSGIMQYLPGRHNRVYYLIEQLKDFIASRVKINEASFDSQNPRDFIDCFLIKMHQ (0)
DKNNPRTEFNLKNLVLTALNLFFAGTETVSSTLRYGFLLLMKHPEVE (1)
ARIHEEINQVIGPHRLPSVDDRVKMPYTDAVIHEIQRLVDIVPMGVPHNVIRDTQFRGYLLPK (0)
GTDVFPLLGSVLKDPKYFRYPEAFYPQHFLDEQGRFKKNEAFVPFSS (1)
GKRICLGEAMARMELFLYFTSILQNFSPRSLVPPADIDITPKLSGFGNIPPTYELCLVAR

 

>CYP2J2 best hit (Z. Zhang) partial

GLIMSSGQIWKEQRRFTLTALRNFGLGKKSLEERIQEEAQHLTEAIKEENGQPFDPHFKI
NNAVSNIICSITFGERFDYQDSQFQELLKLLDEVTYLEASKTCQLYNIFPWLMKFLPGPH
QTLFSNWEKLKLFVSHMIEKHRKDWNPAETRDFIDAYLKEMSKHTGNSTSSFHEENLICS
TLDLFFAGTETTSTTLRWALLYMALYPEIQEKVQAEIDRVIGQGQQPSTAARESMPYTNA
VIHEVQRMGNIVPLNVPREVTVDTTLAGYHLPKRACLGEQLARTELFIFFTSLVQKFTFR
PPNNEKLSLKFRMGITISPVSHHLC

 

>CYP2R1 (G.Zhu) partial

IFSLDLGGISTVVLNGYDVVKECLVHQSGIFADRPCLPLFMKMTKMGGLLNSRYGQGWVE
HRRLAVNSFRYFGYGQKSFESKILEETKFFTDAIETYKGRPFDFKQLITSAVSNITNLII
FGERFTYEDTDFQHMIELFSENVELAASASVFLYNAFPWIGILPFGKHQQLFRNASVVYD
FLSRLIEKASVNRKPQLPQHFVDAYFDEMDQGKNDPSSTFSKENLIFSVGELIIAGTETT
TNVLRWAILFMALYPNIQGQVQKEIDLIMGPNGKPSWDDKFKMPYTEAVLHEVLRFCNIV
PLGIFHATSEDAVVRGYSIPKGTTVITNLYSVHFDEKYWRDPEVFHPERFLDSSGYFAKK
EALVPFSLGRRHCLGEQLARMEMFLFFTALLQRFHLHFPHELVPDLKPRLGMTLQPQPYL
ICAERR

 

>CYP2S1 AC011510 one exon per line 78% to mouse 2s1 49% to 2B6 47% to 2A13

exons 2,3 from CO649282.1

MEATGTWALLLALALLLLLTLALSGTRARGQLPPGPTPLPLLGNLLQLRPGALYSGLMR (0)

LSKKYGPVFTIYLGPWRPVVVLVGQEAVREALGGQAEEFSGRGTVAMLEGTFDGH (1)

GVFFSNGERWRQLRKFTMLALRDLGMGKREGEELIQAEARCLVETFQGTE (1)

GRPFDPSLLLAQATSNVVCSLLFGLRFSYEDKEFQAMVRAAGGTLLGVSSRGGQ (0)

TYEMCSWFLWPLPGPHKQLLHHVSTLAAFTVRQVQQHQGNLDASGPARDLVDAFLLKMAQ (0)

EEQNPDTEFTNKNMLMTVIYLLFAGTMTVSATVGYTLLLLMKYPHVQ (1)

KRVREELTQELGSGQAPSLGDRTRLPYTDAVLHEAQRLLALVPMGIPRTLMRTTRFRGYTLPQ (0)

GTEVFPLLGSILHDPSIFKHPEEFNPDHFLDADGRFRKHEAFLPFSL (1)

GKRVCLGEGLAKAELFLFFTTILQAFSLESPCPLDSLSLKPTISGLFNIPPAFQLQVRPTDLHSTTQTT*

 

>CYP2T2P ortholog, SCAFFOLD100362 (+) 38209-41795

frameshift in exon 4 after VIC, numerous other defects

MIAGIAALLLWLLVLALARWG*GGCRARMRGSLPPRPRPLPLLGNLQLQSGGTDHALHS (?)

LSGRWGPVFTAQLGPRPAVVLCGYAALRDALVLQADAFSGRGSMAVFERFTRGH (1)

GIFLSNGPRWWTLRNFAVGALKELGLGTRTIQAHVLEEAACLLDEMQATI (1)

GAPFDPMRLLDNAVSNVICX

LVFGNRYGYGDPEFLRLLNLFSDNFRIMSSRWGE (0)

(?) SLMDWLPGRHRRIFRNF

SELWVFISEQIQQHWQMRQPAEPRDFINCLTRWVRRGSQ

QDPESHFQEETSVMMTHLFFGGTETSTTLCYGLLVLLKYPEVA (1)

AKVQELDPVVGWRCAPSPDDHQRLPYTNAVLLQIQRFISVVPLGLPRX

TLNTHLHSHCLPK (1)

GTFVIPLLVTAHXDRTQFKDPDCFNATNFLDKGKSQGNDPFMPFAS (1)

(?) GKQMCLGAGLAHLEIFLFLTATLPRFRLLPVVNPGTINLT

QFTGLGSVPPAFQLQLVAC

 

>CYP2U1 (li Chen) note gc boundary between exons 7,8
MSSPGPPQPPAEDPPWPARLLRAPLGLLRMDPSGDALLLCGLVAVLGWSWLRRRRARGIP PGPTPWPLVGNFGHVLLPPFLRRRSWLSSRTRAAGIDPSVVGPQVLLAHLARVYGSIFSF FIGHYLVVVLSDFHSVREALVQQAEVFSDRPRVPLISIVTKEK (1)
GVVFAHYGPIWRQQRKFSHSTLRHFGLGKLSLEPKIIEEFKYVKAEMQKHGEDPFCPFSI
ISNAVSNIICSLCFGQRFDYTNSEFKKMLGFMSRGLEICLNSQVLMVNICPWLYYLPFGP
FKELRQIEKDITSFLKKIIKDHQESLDRENPQDFIDMYLLHMEEERKNNSNSSFDEEYLF
YIIGDLFIAGTDTTTNSLLWCLLYMSLNPDVQ (1)
EKVHEEIERVIGANRAPSLTDKAQMPYTEATIMEVQRLTVVVPLAIPHMTSGNT (1)
VLQGYTIPKGTLILPNLWSVHRDPAIWEKPEDFYPNRFLDDQGQLIKKETFIPFGI (1)

GKRVCMGEQLAKMELFLMFVSLMQSFAFALPEKSKKPLLTGRFGLTLAPHPFNITISRR*

 

>CYP2W1 Macaca mulatta rhesus monkey (Mahrous)

LSERYGPVFTVHLGCQKTVVLTGFEVVKEALAGPGQELADRPPIAIFQLIQRGG (1)

GIFFSSGARWRAARQFTVRALHSLGVGRKPVADKILQELKCLLGQLDGYR (1)

GQPFPLALLGWAPSNITFTLLFGRRFDYRDPVFVSLLGLIDEVMVLLGSPGLQ (0)

LFNVYPWLGALLQLHRPVLRKIEEVRAILRTLLEARRPHMRPGDPVCSYVDALIQQGQ (0)

GDDPEGLFAEDNAVACTLDMVMAGTETTSATLQWAALLMGRHPDVQ (1)

GRVQEELDRVLRRGRPPQPEDQQVLPYTSAVLHEVQRFITLLPHVPRCTATDMQLGGFLLPK (0)

GTPVIPLLTSVLLDETQWQTPDQFNPGHFLDADGHFVKQEAFLPFSA (1)

GRRVCVGERLARTELFLLFAGLLQKYYLLPPPGVSPASLDTTPAQAFTMRPRAQALCAVPRP

 

>CYP2AB1P SCAFFOLD46808:34-204, SCAFFOLD101629:758-8003 (no ESTs found)

MLSLLSGLALLAISFLLLKLGTFCWDRNRLPPGPFPFPILGNLWQLRFQLHPETLLQ (0)

LAQTHG

VCLFTVWVSPIPIVVLSGFRAVKEALVSNSEQFSGRPLTSLFQDLFGEQ (1)

GIVCSRRHMWWQQRRFCLVTLQGLGLGKLALEVQLQKQAAELVEAFRQEL (1)

SRSFDPQVSIVRSTVRVIGALVFGHHFLSEDPIFQELTQAIDFGLALVRTVWHW (0)

LHDVFPRALCHLPGSHREIFRYQGVVRSFTRREITGRKLKALEALKDFINCSLAQISK (0)

AMDEPVSTFHEENLVQVVIDLFLGGTNTTATTQRWALVYMIQHGAVQ (1?)

 

GTIILPLCRGSVLYDPECWETPPQFNPGHFLDKDGNFVANEAFLPFSA (1)

GHCVCPGDQLARMELFLMFATLLRTFRFQLPEGSPGLKLEYIFGGTLQPQPQEICAVPR

 

>CYP2AC1P SCAF55146 (+) 91919-103589, SCAF70822:43519-43602, SCAF103481:817-966

1 MSEFDASAILPIRVLILIFILSIKKFMTEASKQLSPPGPRPLLVIGNLYFLNLKRPYQTMLE (0)

 

3 GIAFFHGETWKTMRWFSLTTLQNFGMDEWIIEDTIIEECQNLIQNSEFHR

4 GKSFEMKTIMNASVVNIIVLVLPGKWFDYQDSQFLRLLALIGENVKLIGGLRIAVN (1)

5 SFQYVSFWGVLLKSHKTVFRNRDELFSFIRMIFLDHCHKLDKNDPRSFTDAFLVTQQE (0)

6 ENDTFADHFSDENLMALVNNLFTTGTETTASTLPWGILLVICLRSRV (1)

7 KKVHNEVTKVARSAQP*LAHQTQMPHTDAVSHEVQRFANILPTSLPHATPTNIFKNYYIPK (0)

8 ATEVIILLASVRRDQAQWEKPDTFNPEHFLTSKGKFIKREAFLPFTV (1)

9 GRRMCAGESSAR MELFLFFTSLLQ KFTFQPPLGVSHLDLDLSLDIGFTT*

 

>CYP3A64 AY582531.1

MDLIPDLAVETWLLLAVTLVLLYLYGTHSHGLFKKLGIPGPTPL

PLLGNILSYRKGFWTFDMECYKKYGKVWGFYDGRQPVLAITDPNMIKTVLVKECYSVF

TNRRPFGPVGFMKNAISIAEDEEWKRIRSLLSPTFTSGKLKEMVPIIAKYGDVLVRNL

RREAETGKPVTLKDVFGAYSMDVITSTSFGVNIDSLNNPQDPFVENTKKLLRFDFLDP

FFLSITIFPFIIPILEVLNISIFPREVTSFLRKSVKRIKESRLKDTQKHRVDFLQLMI

DSQNSKETESHKALSDQELVAQSIIFIFAGYETTSSVLSFIIYELATHPDVQQKLQEE

IDTVLPNKAPPTYDTVLQMEYLDMVVNETLRIFPIAMRLERVCKKDVEINGIFIPKGV

VVMIPSYALHHDPKYWPEPEKFLPERFSKKNNDNIDPYIYTPFGSGPRNCIGMRFALM

NMKLAIIRVLQNFSFKPCKETQIPLKLRLGGLLQTEKPIVLKIESRDGTVSGA

 

>CYP3A43 ortholog? partial assembly (Aggarwal) 72% to 3A64
MDLIPNFAMETWVLVATSLVLL (2)
YIYGTHSHKLFKKLGIPGPTPLPFLGTILFYLR (0)
GLWKFDRECNEKYGEMWG (2)
LYEGQQPMLVIMDPDMIKTVLVKECYSVFTNRM (0)
PLGPMGFMKSALSFAEDEEWKRIRTLLSPAFTSVKFKE  (0)
MVPIISQCGDMLVRSLRREAENSKPTNLKE

 

>CYP4A11 match (ramy.Naguib, Puljic) (exon 1 added later)
MSVSVLSPSRLLGGVSGILQVASLLILLLLLIKAAQLYLHRQWLLKAFQQFPCSPSHWLFGHKQE (0)
FQQDQELQRIRKWVEMFPSACPLWLWGGKARVQLHDPDYMKVILGRS (1)
DPKSQDPYRFLAPWI (1)
GYGLLLLNGQTWFQHRRMLTPAFHYDILKAYVALMADSVRVML (0)
DKWEKLLGQDSPLEVFQHVSLMTLDTIMKCAFSHQGSIQVDR (2)
DSQSYIQAISDLNNLVFSRVRNVFHQNDTIYSLTSTGRWTHRACQLAHQHT (1)
DQVIQLRKAQLQKEGELEKVKRKKHLDFLDILLLAK (0)
MENGSILSDKDLRAEVDTFMFEGHDTTASGISWILYALATHPKHQERCREEIHGLLGDGASITW (2)
NHLDQMPYTTMCIKEALRLYPPVPGISRELSTPVTFPDGRSLPK (1)
GITVMLSIYGLHHNPKVWPNPE (0)
VFDPSRFAPGSAQHSHAFLPFSGGSR (2)
NCIGKQFAMNELKVATALTLLRFELLPDPTRIPIPMARLVLKSKNGIHLRLRRLPNPCEDKDQL

 

>CYP4B1

MVPSFLSLRLSCLGLWASGLILVLGFLKLIRLLLRRQRLAKAMGNFPGPPTHWLFGHALE (0)

IQQTGSLDKVVSWAHQFPYAHPLWFGQFIGFLNIYEPDYAKAVYSRG (1)

DPKAPDVYDFFLQWI (1)

GRGLLVLEGPKWFQHRKLLTPGFHYDVLKPYVALFAESTRVML (0)

DKWEEKAQEGKSFDIFCDVGHMALNTLMKCTFGRGDTGLGHR (2)

DSSYYLAVSDLTLLMQQRLVSFHYHNDFIYWLTPHGRRFLRACQVAHDHT (1)

DQVIRERKAALQDEKVRKKIQNRRHLDFLDILLGAR (0)

DEDDSKLSDADLRAEVDTFMFEGHDTTTSGISWFLYCMALYPEHQHRCREEVREILGDQDSFQW (2)

DDLGKMTYLTMCIKESFRLYPPVPQVYRQLSKPVTFVDGRSLPA (1)

GSLISMHIYALHRNSAVWPDPE (0)

VFDPLRFSTENASKRHPFAFMPFSAGPR (2)

NCIGQQFAMSEMKVVTAMCLLHFEFSLDPSRLPIKMLQLVLRSKNGIHLHLKPLGPGSGK

 

>CYP4V2 (S.Sarva)
MAGIWLGLVWQKLLLWGAASAVSLAGASLVLSLLQRVASYVRKWQQMRPIPTVARAYPLV 
GHALLMKRDGR (1)
EFFQQIIEYTEEYRHMPLLKLWVGPVPMVALYNAENVE (0)
VILTSSKQIDKSSMYKFLEPWLGLGLLTS (2)
TGNKWRSRRKMLTPTFHFTILEDFLDIMNEQANILVKKLEKHVNQEAFNCFVYITLCALDIIC (1)
ETAMGKNIGAQSNDDSEYVRAVYR (2)
MSEMIFRRIKMPWLWLDLWYLMFKEGWEHKKSLKILHAFTNN (0)
VIAERANEMNVDEDCRGDGRDSAPSKNKRRAFLDLLLSVTDDEGNRLSHEDIREEVDTFMFE (0)
GHDTTAAAMNWSLYLLGSNPEVQKKVDHELDDVF (1)
GRTDRPATVEDLKKLRYLECVIKETLRLFPSVPLFARSVSEDCEV (1)
AGYRVLKGTEAV
IIPYALHRDPRYFPNPEEFRPERFFPENAQGRHPYAYVPFSAGPRNCI (1)
GQKFAVMEEKTILSCILRHFWIESNQKREELGLEGQLILRPTNGIWIKLKRRNADE 1551

 

>CYP4X1 (Vasser) missing exon 1

FIQDDNMEKLEEIIEKYPRAFPFWIGPFQAFFYIYDPDYAKTFLSRT (1)

DPKSQYLQKFLPPLI (1)

GKGLLALDGPKWFQHRRLLTPGFHFNILKAYIEVMAHSVKTML (0)

DKWEKICSTQNTSVEVYEHINLMSLDIIMKCAFSKETNCQTN (2)

STHDPYVKAIFELGKIIFHRLYSFLYHSDIIFKLSPQGYRFQKLSRVLNQYT (1)

DAIIQERKKSLQAGEKQDNTQKRKYQDFLDIVLSAK (0)

DENGSSFSDTDVHSEVSMFLLGGHDSLAASISWILYCLALNPEHQERCREEVRGILGDGCSITW (2)

DQLGEMSYTTMCIKETCRLIPAVPSISRDLSKPLTFPDGCTLPA*

 

>CYP5A1 (Z. Zhang) partial

ASGLEFKSVADSVLFLRDKRWEEVRGALMSAFSPEKLNEMTPLISQACDLLLAHLKRYAE
SGDAFDIQRCYCNYTTDVVASVAFGTPVDSQQAPEDPFVKHCKRFFEFCIPRPILVLLLS
FPSIMVPLARILPNKNRDELNGFFNKLIRNVIALRDQQAAEERRRDFLQMVLDARHSASP
VGVQDFDMVGDVFSSTRCKPNPSRQHQAGPMARPLTVDEIVGQAFIFLIAGYEIVTNTLS
FATYLLATNPDCQEKLLREVDLFKEKHMVPEFCSLEEGLPYLDMVIAETLRMYPPAF

 

>CYP7A1 (A.Bolen)

MMTTSLIWGIAIAACCCLWLILGIRRR (2)

QTGEPPLENGLIPYLGCALQFGANPLEFLRANQRKHGHVFTCKLMGKYVHFITNPLSY

HKVLCHGKYFDWKKFHFATSAK (0)

AFGHRSIDPMDGNTTENINDTFIKTLQGHALNSLTESMMENLQRIMRPPVSSN

SKTAAWVTEGMYSFCYRVMFEAGYLTIFGRDLTRRDTQKAHILNNLDNFKQFDKVFPA

LVAGLPIHMFRTAHNAREKLAESLRHENLQKRESISELISLRMFLNDTLSTFDDLEKA

KTHLVVLWASQANTIPATFWSLFQMIR (2)

NPEAMKAATEEVKRTLENAGQKVSLEGNPICLSQAELNDLPVL (1)

DSIIKESLRLSSASLNIRTAKEDFTLHLEDGSYNIRKDSIIALYPQLMHLDPEIYPDPL (0)

TFKYDRYLDENGKTKTTFYCNGLKLKYYYMPFGSGATICPGRLFA

IHEIKQFLILMLSYFELELIEGQAKCPPLDQSRAGLGILPPLNDIEFKYKFKHL*

 

>CYP7B1 (G.Zhu) partial

RRPGEPPLIKGWLPYLGVVLKLRKDPLSFMKTLQKQHGDTFTVLLG

GKYITFILDPFQYQLVIKNHKQLSFRLFSNKLLEKAFSISQLQKNHDMNDELHLCYQFLQGKSLDILLESMMQN

LKQVFESQLLKTTSWDTAQLYPFCSSIIFEITFTTIYGKVLVCDNKFISELRDDFLKFDD
KFAYLVSNIPIELLGNVKSIRKKIIKCLSSENLAKMQGWSEVFQSRQDVLEKYYVHEDLE
IGAHHLGFLWASVANTIPTMFWAMYYLLRHPEAMAAVRDEIDRLLQSTGQKKGSGFPIHL
TREQLDSLICLESTIFEALRLSSYSTTIRFVEEDLTLSAQTGDYCVRKGDLGAIFPPILH
GDPEIFEAPDSKEFRYDRFIEDGKKKTTFFKRGKKLKCYLMPFGTGTSKCPGRFFALMEI
KQLLVILLTYFDLEIIDDKPIGLNYNRLLFGIQYPDSDVLFRYKVKS

 

>CYP8A1 partial (Lin Zhu)

GDKDHMCSVKSRLWKLLSPARLATRAHRSKWLESYLLHLEEMGVSEEMQARALVLQLWATQ (0)

GNMGPAAFWLLLFLLKNPEALAAVRGELESILWEAEQPVSQMTTLPQKVLDGTPVL (1)

DSVLSESLRLTAAPFITREVVVDLAMPMADGREFNLRRGDRLLLFPFLSPQRDPEIYTDPE (0)

VFKYNRFLNPDGSEKKDFYKDGKRLKNYNMPWGAGHNHCLGKSYAVNSIKQ (2)

FVFLVLVHLDLELINADVEIPEFDLSRYGFGLMQPEHDVPIRYRIRP

 

>CYP8B1 SCAFFOLD114862:8-613, SCAFFOLD39206:3-626 no introns

BB882888.1 Macaca fasicularis lower case

MVLWGPVLGALLVVIAGYLCLPGMLRQRRPREPPLDKGTVPWLGYAMAFRKNMFEFLKRM

RSKHGDVFTVQLGGQYFTFVMDPLSFGPILKDTQRKLDFGQYAKKLVLKVFGYRSVQGDH

EMIHSASTKHLRGDGLKDLNETMLDSLSFVMLKSKGWSLDASCWHEDSLFHFCYYILFTA

GYLSLFGYTKDKEQDLLQAGEL

40aa gap

shsqxkegisnwlcnmlqflreqgvpsamqdkfnfmmlwasqgntgpts

FWALLFLLKHPEAIRAVRQETTQVLGEARLETKQSFAFKLSALQHTPVLDSVVEETLRLR

AAPTLLRLVHEDYTLKMASGQEYLFRRGDILALFPYLSVHVDPDIHPEPTIFKYDRFLNP

NGSRKVDFFKAGKKIHHYTMPWGSGVSICPGRFFALSEVKLFILLMVTHFDLELVDPDTP

LPHVDPQRWGFGTMQPSHDVRFRYRLRP

 

>CYP11A1 N-term = DQ228169.1 Macaca fasicularis = lower case (Mahrous)

mlakglpprsvlvkgcqtflsapkerlghlrvptsegagistrs

prpfneipspgdngwlnlyhfwretgthkvhlhhvqnfqkydpiy

REKLGNVESVYVIDPEDVALLFKSEGPNPERFLIPPWVAYHQYYQRPIGVLLK (2)

KSAAWKKDRVALNQEVMAPETTKNFLPLLDAVSRDFVSVLHRRIKKAGSGNFSGDISDDLFRFAFE (1)

SITNVIFGERQGMLEEVVNPEAQRFIDAIYQMFHTSVPMLNLPPDLFRLFRTKTWKDHVAAWDVIFSK (1)

ADMYTENFHWELRQKGNVHHDYRGILYRLLGDSKMSFEDIKANVTEMLAGGVDT (0)

TSMTLQWHLYEMARNLKVQDMLRAEVLAARRQAQGDMATMLQLVPLLKASIKETLR (2)

LHPISVTLQRYLVNDLVLRGYMIPAK (0)

TLVQVAIYALGREPTFFFDPENFDPTRWLSKDKNITYFRNLGFGWGVRQCLGRRIAELEMTIFLIN (0)

MLENFRVEIQHLSDVGTTFNLILMPEKPISFTFWPFNQEATQ

 

>CYP11B1 (Lin Zhu)

MALRAKAEVCMAAPWLSLQRARALGTRATRVPRTVLPFEAMPRRPGNRWLRLLQIWREQG
YEHLHLEVHQTFQELGPIFR (2)
YDLGGAGMVCVMLPEDVEKLQQVDSLNPRRMSLEPWVAYRQHRGHKCGVFLL (2)
NGPEWRFNRLRLNPDVLSPRAVQRFLPMVDAVARDFSQALRKKVLQNARGSLTLDVQPSIFHYTIE (1)
ASNLALFGERLGLVGHSPSSASLSFLHALEVMFKSTVQLMFMPRSLSRWTSPKVWKEHFEAWDCIFQY (1)
GDNCIQKIYQELALSRPQQYTSIVAELLLNAELSPDAIKANSMELTAGSVDT (0) 
TVFPLLMTLFELARNPNVQQALRQESLAAAASISEHPQKATTELPLLRAALKETLR (2)
LYPVGLFLERVVSSDLVLQNYHIPAG (0)
TLVRVFLYSLGRNPALFPRPERYNPQRWLDIRGSGRNFYHVPFGFGMRQCLGRRLAEAEMLLLLHH (0)
VLKHLQVETLTQEDIKMVYSFILRPSTFPLLTFRAIN

 

>CYP17 AY746983.1 and AF458332.1

MWELVALLLLTLAYLFWPKRRCPGAKYPKSLLSLPLVGSLPFLP

RHGHMHNNFFKLQKKYGPIYSVRMGTKTTVIVGHHQLAKEVLIKKGKDFSGRPQVTTL

DILSNNRKGIAFADYGAHWQLHRRLAMATFALFKDGDQKLEKIICQEISTLCDMLATH

NGQTIDISFPVFVAITNVISLICFNISYKNGDPELKIVHNYNEGIIDSLGKESLVDLF

PWLKVFPNKTLEKLKRHVKTRNDLLTKIFENYKEKFHSDSITNMLDVLMQAKMNSDNG

NAGPDQDSELLSDNHILTTIGDIFGAGVETTTSVVKWIVAFLLHNPQVKKKLYEEIDQ

NVGFSRTPTISDRNRLLLLEATIREVLRIRPVAPMLIPHKANVDSSIGEFAVDKGTHV

IINLWALHHNEKEWHQPDQFMPERFLNPAGTQLISPSLSYLPFGAGPRSCIGEILARQ

ELFLIMAWLLQRFDLEVPDDGQLPSLEGNPKVVFLIDSFKVKIKVRQAWREAQAEGST

 

>CYP19 (Iyer)

MVLEMLNPMHYNITSMVPEAMPAATMPILLLTGLFLLVWNYEGTSSIP (1)

GPGYCMGIGPLISHGRFLWMGIGSACNYYNQVYGEFMRVWISGEETLIISK (2)

SSSMFHIMKHNHYSSRFGSKLGLQCIGMHEKGIIFNNNPDLWKTTRPFFMK (1)

ALSGPGLVRMVTVCAESLKTHLDRLEEVTNESGYVDVLTLLRRVMLDTSNMLFLRIPLD (1)

ESAIVVKIQGYFDAWQALLIKPDIFFKISWLYKKYEKSV (2)

KDLKDAIEVLIAEKRRRISTEEKLEECMDFATELILAE (0)

KRGDLTRENVNQCILEMLIAAPDTMSVSLFFMLFLIAKHPNVEEAIMKEIQTVV (1)

GERDVKIDDMQKLKVMENFIYESMRYQPVVDLVMRKALEDDVIDGYPVKKG

TNIILNIGRMHRLEFFPKPNEFTLENFAKN (0)

VPYRYFQPFGFGPRGCAGKYIAMVMMKAILVTLLRRFHVKTLQGQCVERIQ

KIHDLSSHPDETKNMLEMIFTPRNSDRCLEH

 

>CYP21 (Blackwell) partial

LVSKNYPDLSLGDYSLLWKAHKKLTRSALLLGMRDSMEPVVEQLTQEFCERMRAQAGTPV

AIEEEFSLLTCSIICHLTFGDKIKDNLVPAYYKCIQEVLKTWSHWSIQIVDVIPFLRFFP

NPGLRRLKQAIEKRDHIVEKQLRQHKESLVAGQWRDMMDYMLQVVAQPSMEEGSGQLLEG

HVHMAAVDLLIGGTETTANTLSWAVVFLLHHPEIQQRLQEELDHELGPSASSSRVPYKDR

ARLPLLNATIAEVLRLRPVVPLALPHRTTRPSSISGYDIPEGTVIIPNLQGAHLDEMVWE

RPHEFWPDRFLEPGKNSRALAFGCGARVCLGEPLARLELFVVLTRLLQAFTLLPPGDALP

SLQPLPHCSVILKMQPFQVWLQPRGLGVHSLGQSQ

 

>CYP24 (S.Jain) partial

MSSPISKSRSLAAFLQQLRSPRQPPRPVTSTAYTSPQPREVPVCPLTAGGETQNAAALPG
PTSWPLLGSLLQILWKGGLKKQHDTL(0)
VEYHKKYGKIFRMKLGSFESVHLGSPCLLEALYR
TESAYPQRLEIKPWKAYRDYRKEGYGLLIL

 

>CYP26A1 (Liao, Iyer)

MGLPALLASALCTFVLPLLLFLAAIKLWDLYCVSGRDRSCALPLPPGTMGFPFFGETLQM

VLQRRKFLQMKRRKYGFIYKTHLFGRPTVRVMGADNVRRILLGEHRLVSVHWPASVRTIL

GSGCLSNLHDSSHKQRKKVIMRAFSREALECYVPVITEEVGSSLEQWLSCGERGLLVYPE

VKRLMFRIAMRILLGCEPQLAGDGDAEQQLVEAFEEMTRNLFSLPIDVPFSGLYRGMKAR

NLIHARIEQNIRAKICGLRASEAGRGCKDALQLLIEHSWERGERLDMQALKQSSTELLFG

GHETTASAATSLITYLGLYPHVLQKVREELKSKGLLCKSNQDNKLDMEILEQLKYIGCVI

KETLRLNPPVPGGFRVALKTFELNGYQIPKGWNVIYSICDTHDVAEIFTNKEEFNPDRFM

LPHPEDASRFSFIPFGGGLRSCVGKEFAKILLKIFTVELARHCDWQLLNGPPTMKTSPTV

YPVDNLPARFTHFHGEI

 

>CYP26B1 (S.Jain) partial
VFSKIFSHEALESYLPKIQLVIQDTLRAWSSHPEAINVYQEAQKLTFRMAIRVLLGFSIP
EEDLGHLFEVYQQFVENVFSLPVDLPFSGYRRGIQARQILQKGLEKAIREKLQCTQGKDY
SDALDLLIESSKEHGKEMTMQELKDGTLELIFAAYATTASASTSLIMQLLKHPTVLEKLR
EELRAHGILHSGGCPCEGTLRLDTLSGLRYLDCVIKEVMRLFTPISGGYRTVLQTFELDG
FQIPKGWSVMYSIRDTHDTAPVFKDVNVFDPDRFSQARSEDKDGRFHYLPFGGGVRTCLG
KHLAKLFLKVLAVELASTSRFELATRTFPRITLVPVLHPVDGLSVKFFGLDSNQNKILPE
TEAMLSATV

 

>CYP26C1 (C.Blackwell) missing an exon

MFPWGLSCLSVLGAAGTAVLCAGLLLSLAQHLWTLRWMLSRDRASTLPLPKGSMGWPFFGETLHWLV(0)

QGSRFHSSRRERYGTVFKTHLLGRPVIRVSGAENVRTILLGEHRLVRSQWPQSAHILLGSHTLLGAV

GEPHRRRRK (0)

VLARVFSRAALERYVPRLQGALRHEVRSWCAAGGPVSVYDAAKALTFRMAARILLGLRLDEAQCATL

ARTFEQLVENLFSLPLDVPFSGLRK (0)

SAVELLFAAFFTTASASTSLVLLLLQHPAAIAKIREELVAQGLGRACGCAPGAAGGCVGPPPDCG
CEPDLSLAALGSLRYVDCVVKEVLRLLPPVSGGYRTALRTFELD(0)

GYQIPKGWSVMYSIRDTHETAAVYRSPPEGFDPERFGAAREDSRGASSRFHYIPFGGGARSCLGQEL

AQTVLQLLAVELVRTARWELATPAFPALQTVPIVHPVDGLRLFFHPLAPLVAGDGLCL

 

>CYP27A1 (Q.Tran)

QVLYKAKYGPMWMSYLGPQMHVNLASAPLLEQVMRQEGKYPVRNDMELWKEHRDLHDLTYGPFTT(2)

EGHHWYQLRQALNQRLLKPAEAALYTDAFNEVIDDFMIRLDQLRAESASGNQVSDTAQLFYYFALE (1)

AICYILFEKRIGCLQRSIPEDTVTFVRSIGLMFQNSLYATFLPKWTRPVLPFWKRYLDGWNAIFSF (1)

GKKLIDEKLEDMEAQLQAEGPDGVQVSGYLHFLLASGQLSPREAMGSLPELLMAGVDT (0)

TSNTLTWALYHLSKDPEIQEALHEEVVGVVPAGQVPQHKDFAHLPLLKAVLKETLR (2)

LYPVVPTNSRIIEKEIEVDGFLFPKN (0)

TQFVFCHYVVSRDPTTFSEPESFQPHRWLRSSQPATPRIQHPFGSVPFGYGVRACLGRRIAELEMQLLLAR (0)

LIQKYKVVLAPETGELKSVARIVLVPNKKVGLQFLQRQC

 

>CYP27B1 (Xin Liu)
MTQTLKYASRVFHRVRWAPELGASLGYREYDSARRSLADIPGPSTPSFLAELFCKGGLSR
LHELQVQGAARFGPVWLASFGTVRTVYVAAPALVEELLRQEGPRPERCSFSPWTEHRRRR
QRACGLLTAEGEEWQRLRSLLAPLLLRPQAAARYAETLDNVVRDLVRRLRCQRGRGTGPP
ALVRDVAGEFYKFGLEGIAAVLLGSRLGCLEAQVPPDTETFIRAVGSVFVSTLLTMAMPH
WLRRLVPGPWGRLCRDWDQMFAFAQRHVERREAEAAMRNRGQPDEDLESGAHLTHFLFQE
ELPAQSILGNVTELLLAGVDTVSNTLSWALYELSRHPEVQTALHSEITAALGPGSSAHPP
ATVLSQLPLLKAVVKEVLRLYPVVPGNSRVPDKDIHVGDYIIPKNTLVTLCHYATSRDPA
QFPEPNSFHPARWLGEGPTPHPFASLPFGFGKRSCMGRRLAELELQMALAQILIHFEVQP
EPGAAPIRPMTRTVLVPERSINLQFLDR
 
>CYP27C1 (F.Zhang) partial
MQTSAMALLARILRAGLRPAPERGGLLGGAAPRRPQPAGARLPAGARAEDKGAGRPGAPP
AGGRAEGPRLAAMPGPRTLANLAEFFYRDGFSRIHEIQQKHT
QEYGKIFKSHFGPQFVVSIADRDMVAQVLRAEGAAPQRANMESWREYRDLRGRATGLISA
EGEQWLKMRSVLRQRILKPKDVAIYSGEVNQVIADLIKRIYLLRSQAEDGETVTNVNDLF
FKYSMEGVATILYESRLGCLENSIPQLTVEYIEALELMFSMFKTSMYAGAIPRWLRPFIP
KPWREFCRSWDGLFKF

 

>CYP39 (Q.Tran)

MELIFPTVIIILGCLALFLLLQRKNLRRPPCIRGWIPWIGVGFEFGKAPLEFIEKARIK (0)

YGPIFTVFAMGNRMTFVTEEEGINVFLKSKKVDFELAVQNIVYHT (1)

ASIPKNVFLALHEKLYIMLKGKMGTVNLHQFTGQLTEELHEQLENLGTHGTMDLNNLV (2)

RHLLYPVTVNTLFNKSWFPTNKKKIKEFHQYFQAYDEDFEYGSQLPECLL (2)

RNWSKSKKWFLELFEKNIPDIKACKSAKDNSM (0)

TLLQATLDIVETETSKENSPNYGLLLLWASLSNAVP(0)

VAFWTLAYVLSHPDIHKAVMEGISSVFGTA (1)

GKDKIKVSEDDLEKLLLIKWCVLETIRLKAPGVITRKVVKPVEIL (0)

NYIIPSGDLLMLSPFWLHRNPKYFPEPELFKP (0)

ERWKKANLEKHSFLDCFVAFGSGKFQCPG (2)

RWFALLEVQMCVILILYKYDCSLLDPLPKQ (0)

SSLHLVGVPQPEGQCRIEYKQRI

 

>CYP46 C-term part (Ramy.Naguib)
RLQAEVDEVIGSKRYLDFEDLGRLQYLSQVLKESLRLYPPAWGTFRLLEEETLIDGVRVP
GNTPLLFSTYVMGRMDTYFEDPLTFNPDRFGPGAPKPRFTYFPFSLGHRSCIGQQFAQME
VKVVMAKLLQRLEFRLVPGQRFGLQEQATLKPLDPVLCTLRPRGWQPTPPPPPC

 

>CYP51 (Y.Peng) partial
AAAAGMMLLGLLQAGGSVLGQAMEKVTGGNLLSMLLIACAFTLSLVYLFRLAAGHLVQLPAG (0)
KSPPYIFSPIPFLGHAIAFGKSPVEFLENAYEK (0)
VFGKGVAYDVPNP (0)
VFLEQKKMLKSGLNIAHFKQHVSIIEKETKEYFQSWGESGEK (1)
VFEALSELIILTASHCLHGKEIRSQLNEKVAQLYADLDGGFSHAAWLLPGWLPLPSF (2)
RRRDRAHREIKNIFYKAIQKRRQSQEKIDDILQTLLDATYK (0)
DGRPLTDDEVAGMLIGLLLAGQHTSSTTSAWMGFFLARDKTLQEKCYLEQKT 
VCGENLPPLTYDQ (0)
TVAGYTIPPGHQVCVSPTVNQRLKDSWVERLDFNPDRYLQDNPASGE
KFAYVPFGA (1)
GRHRCIGENFAYVQIKTIWSTMLRLYEFDLIDGYFPTVNYTTMIHTPENPVIRYKRRSK