Rhesus monkey P450s (Macaca mulatta)
(in progress, 49 seqs so far)
Jan. 27, 2006
The sequences are shown followed by a blast alignment of the Macaca seq on top compared to the human sequence below. At the bottom of this file a FASTA format collection of the rhesus P450s is given. One surprise finding is that two pseudogenes in humans are not pseudogenes in rhesus monkey. CYP2G2P and CYP1A8P are both full length intact genes.
>CYP1A1
AY635458.1
MLFRISMSATEFLLASLIFCLVFWVIRASRPRVPKGLKNPPGPW
GWPLIGHILTLGKNPHLALSRMSQRYGDVLQIRIGSTPVLVLSGLDTIRQALVQQGDD
FKGRPNLYSFTLISNGQSMSFGPDSGPVWAARRRLAQNGLKSFSIASDPASSSSCYLE
EHVSKEAEVLISKLQEQMAGPGHFNPYRYVVISVANVICAICFGQRYDHNHQELLSLV
NLSNNFGEVVGSGNPADFIPILRYLPNRSLNGFKDLNEKFHSFMQKMIKEHYKTFEKG
HIRDITDSLIEHCQEKQLDENANIQLSDEKIVNVVLDLFGAGFDTVTTAISWSLMYLV
TNPRVQRKIQEELDTVIGRSRRPRLSDRSHLPYMEAFILETFRHSSFVPFTIPHSTTR
DTSLKGFYIPKGRCVFVNQWQINHDQKLWVNPSEFLPERFITPDGAIDKVLSEKVILF
GLGKRKCIGETIARWEVFLFLAILLQRVEFSVPPGVKVDMTPIYGLTMKHACCEHFQM
QLRS
>CYP1A2
(Aggarwal)
MALSQSVPFSATELLLASAIFCLVFWVLRGSRPRVPKGLKSPPEPWGWPLLGHVLTLGKNPHLALARMSQLYGDVLQIRIGSTPVLVLSGLDTIRQALVRQGNDFKGRPDLYSFTFITDGQSMSFSPDSGPVWAARRRLAQNALNTFSIASDPASSSSCYLEEHVSKEAEALISRLQELMAGPGHFDPYNQVVVSVANVIGAMCFGQHFPESSDEMLSLVKNSHEFVESASSANPVDFFPILRYLPNPALQRFKAFNQRFRRFLQKTVQEHYQDFDK (0)NSVQDITGALFKHSKKGPRASGNLIPQEKTVNLVNDIFGA (1)GFDTIATAISWSLMYLVTKPEIQRKIQKEL (1)DAVIGRGRRPRLSDRPQLPYLEAFILETFRHSSFVPFTIPH (2)STTRDTTLNGFYIPRECCVFINQWQVNHDP (2)QLWGDPSEFRPERFLTAEGTTINKPLSEKIMLFGLGKRRCIGEVLGKWEVFLFLAILLQQLE FSVPPGVKVDLTPIYGLTMKHARCEHFQAR >CYP1A2 NM_000761 Length = 515 Score = 968 bits (2503), Expect = 0.0 Identities = 472/510 (92%), Positives = 493/510 (96%) Query: 1 MALSQSVPFSATELLLASAIFCLVFWVLRGSRPRVPKGLKSPPEPWGWPLLGHVLTLGKN 60 MALSQSVPFSATELLLASAIFCLVFWVL+G RPRVPKGLKSPPEPWGWPLLGHVLTLGKNSbjct: 1 MALSQSVPFSATELLLASAIFCLVFWVLKGLRPRVPKGLKSPPEPWGWPLLGHVLTLGKN 60 Query: 61 PHLALARMSQLYGDVLQIRIGSTPVLVLSGLDTIRQALVRQGNDFKGRPDLYSFTFITDG 120 PHLAL+RMSQ YGDVLQIRIGSTPVLVLS LDTIRQALVRQG+DFKGRPDLY+ T ITDGSbjct: 61 PHLALSRMSQRYGDVLQIRIGSTPVLVLSRLDTIRQALVRQGDDFKGRPDLYTSTLITDG 120 Query: 121 QSMSFSPDSGPVWAARRRLAQNALNTFSIASDPASSSSCYLEEHVSKEAEALISRLQELM 180 QS++FS DSGPVWAARRRLAQNALNTFSIASDPASSSSCYLEEHVSKEA+ALISRLQELMSbjct: 121 QSLTFSTDSGPVWAARRRLAQNALNTFSIASDPASSSSCYLEEHVSKEAKALISRLQELM 180 Query: 181 AGPGHFDPYNQVVVSVANVIGAMCFGQHFPESSDEMLSLVKNSHEFVESASSANPVDFFP 240 AGPGHFDPYNQVVVSVANVIGAMCFGQHFPESSDEMLSLVKN+HEFVE+ASS NP+DFFPSbjct: 181 AGPGHFDPYNQVVVSVANVIGAMCFGQHFPESSDEMLSLVKNTHEFVETASSGNPLDFFP 240 Query: 241 ILRYLPNPALQRFKAFNQRFRRFLQKTVQEHYQDFDKNSVQDITGALFKHSKKGPRASGN 300 ILRYLPNPALQRFKAFNQRF FLQKTVQEHYQDFDKNSV+DITGALFKHSKKGPRASGNSbjct: 241 ILRYLPNPALQRFKAFNQRFLWFLQKTVQEHYQDFDKNSVRDITGALFKHSKKGPRASGN 300 Query: 301 LIPQEKTVNLVNDIFGAGFDTIATAISWSLMYLVTKPEIQRKIQKELDAVIGRGRRPRLS 360 LIPQEK VNLVNDIFGAGFDT+ TAISWSLMYLVTKPEIQRKIQKELD VIGR RRPRLSSbjct: 301 LIPQEKIVNLVNDIFGAGFDTVTTAISWSLMYLVTKPEIQRKIQKELDTVIGRERRPRLS 360 Query: 361 DRPQLPYLEAFILETFRHSSFVPFTIPHSTTRDTTLNGFYIPRECCVFINQWQVNHDPQL 420 DRPQLPYLEAFILETFRHSSF+PFTIPHSTTRDTTLNGFYIP++CCVF+NQWQVNHDP+LSbjct: 361 DRPQLPYLEAFILETFRHSSFLPFTIPHSTTRDTTLNGFYIPKKCCVFVNQWQVNHDPEL 420 Query: 421 WGDPSEFRPERFLTAEGTTINKPLSEKIMLFGLGKRRCIGEVLGKWEVFLFLAILLQQLE 480 W DPSEFRPERFLTA+GT INKPLSEK+MLFG+GKRRCIGEVL KWE+FLFLAILLQQLESbjct: 421 WEDPSEFRPERFLTADGTAINKPLSEKMMLFGMGKRRCIGEVLAKWEIFLFLAILLQQLE 480 Query: 481 FSVPPGVKVDLTPIYGLTMKHARCEHFQAR 510 FSVPPGVKVDLTPIYGLTMKHARCEH QARSbjct: 481 FSVPPGVKVDLTPIYGLTMKHARCEHVQAR 510
>CYP1A8P
ortholog possibly a functional
gene in rhesus
MILNLAVTPGEVTTSLIILVMVFVFVRALRSKGRKQLSPPGPWSFPIIGNLLQLGEHPYL
TLMEMRKKYGDVFLLKLGMVPVLVVNGMEMVKQVLLKDGEHFAGRPNMHTFSFLAEGKSL
SFSVNYGESWKLHKKIASKALRTLSNAEAKSSTCSCLLEEHVTEEVSELVTVFVELSSKN
GGFDPRNAITCAVANVVCALCFGKRYDHSDEEFLKIVKTNDDLLKASSAANPADFIPCLR
YLPLQIINAPREFYRALNGFIALHVQDHLATYDK
(0)
DHIRDITDALINVCHNKYAATKTDTLNDSEIISTVNDLFGA
(1)
GFETVSTCLYWSFLYLIHYPEIQAKIQEEI
(1)
DGNIGLKPPRFEDRKILPYTEAFISEVFRHASFLPFTIPHC
(2)
TTADTTLNGYFIPRKTCTFINMYQVNHDE
(2)
TIWDNPSLFRPDRFLNENRELNKSLVEKVLIFGMGIRKCLGEDVARNEIFIFITAVLQQL
KLKKCPRAKLDLTPTYGLVMRPKPYQLEAERRSSGSSSA
>CYP1A8P
NT_008580.9|Hs9_8737 chromosome 9 Pseudogene 43% to 1A2
Length = 508
Score = 940 bits (2430), Expect = 0.0
Identities = 472/510 (92%), Positives =
486/510 (95%), Gaps = 2/510 (0%)
Query:
1
MILNLAVTPGEVTTSLIILVMVFVFVRALRSKGRKQLSPPGPWSFPIIGNLLQLGEHPYL 60
MIL+LAVTPGEVTTSLIILVMVFVFVRALRSKGRKQ+SPPGP SFPII NLLQLG+HPYL
Sbjct:
1
MILDLAVTPGEVTTSLIILVMVFVFVRALRSKGRKQVSPPGP*SFPIIENLLQLGDHPYL 60
Query:
61
TLMEMRKKYGDVFLLKLGMVPVLVVNGMEMVKQVLLKDGEHFAGRPNMHTFSFLAEGKSL 120
TLMEMRKKYGDVFLLKLGMVPVLVVNGMEMVKQVL KDGEHFAGRPNMHTFSFLAEGKSL
Sbjct:
61 TLMEMRKKYGDVFLLKLGMVPVLVVNGMEMVKQVLHKDGEHFAGRPNMHTFSFLAEGKSL
120
Query:
121 SFSVNYGESWKLHKKIASKALRTLSNAEAKSSTCSCLLEEHVTEEVSELVTVFVELSSKN
180
SFSVNYGESWKLHKKIASKAL T SNAEAKSSTCSC LEEHVTEE+SELVTVFVEL+SKN
Sbjct:
121 SFSVNYGESWKLHKKIASKAL*TFSNAEAKSSTCSCSLEEHVTEEISELVTVFVELTSKN
180
Query:
181 GGFDPRNAITCAVANVVCALCFGKRYDHSDEEFLKIVKTNDDLLKASSAANPADFIPCLR 240
G
FDPRNAITC VAN+VCALCFGKR DHSDEEFL+IVKTNDDLLKASSAANPADFIPCL
Sbjct:
181 GSFDPRNAITCVVANIVCALCFGKR*DHSDEEFLRIVKTNDDLLKASSAANPADFIPCLH 240
Query:
241 YLPLQIINAPREFYRALNGFIALHVQDHLATYDKDHIRDITDALINVCHNKYAATKTDTL 300
YLPL+IINAP EFY+ALNGFIALHVQDHLATY KDHIRDITDALINVCHNKYAATKTDTL
Sbjct:
241 YLPLKIINAPLEFYQALNGFIALHVQDHLATYGKDHIRDITDALINVCHNKYAATKTDTL 300
Query:
301 NDSEIISTVNDLFGAGFETVSTCLYWSFLYLIHYPEIQAKIQEEIDGNIGLKPPRFEDRKILPY 360
NDSEIISTV+DLFGAGFETVSTCL WSFLYLIHYPEIQA+IQEEI +PPRFEDRKILPY
Sbjct:
301 NDSEIISTVSDLFGAGFETVSTCLCWSFLYLIHYPEIQARIQEEI------RPPRFEDRKILPY 358
Query:
361 TEAFISEVFRHASFLPFTIPHCTTADTTLNGYFIPRKTCTFINMYQVNHDETIWDNPSLF 420
TEAF+SEVFRHASFLPFTIPH TTADTTLNGYFIPRKTCTFINMYQVNHDETIWDN SLF
Sbjct:
359 TEAFVSEVFRHASFLPFTIPHSTTADTTLNGYFIPRKTCTFINMYQVNHDETIWDNHSLF 418
Query:
421 RPDRFLNENRELNKSLVEKVLIFGMGIRKCLGEDVARNEIFIFITAVLQQLKLKKCPRAK 480
RPDRFLNENRELNKSLVEKVLIFGMGIRKCLGEDVARNEIFIFIT VLQQ KLKK PRAK
Sbjct:
419 RPDRFLNENRELNKSLVEKVLIFGMGIRKCLGEDVARNEIFIFITTVLQQFKLKK*PRAK 478
Query:
481 LDLTPTYGLVMRPKPYQLEAERRSSGSSSA 510
LDLTPTYGLVMRPK YQL+AE
SGSSSA
Sbjct:
479 LDLTPTYGLVMRPKLYQLQAELHPSGSSSA 508
>CYP1B1
MGTGLSPKDPWPLNLLSTQQTTLLLLLSVLVAVHVGQWLLRQRRRQLGSTPPGPFAWPLI
GNAAAVGQASHLSFARLARRYGDVFQIRLGSCPIVVLNGERAIHQALVQQGSAFADRPSF
ASFRVISGGRSMAFGHYSEHWKVQRRAAHSTMRNFSTRQLRSRQVLEGHVLSEARELVAL
LVRGSADGAFLDPRQLTVVAVANVMSAVCFGCRYSHDDPEFRELLSHNEEFGRTVGAGSL
VDVMPWLQYFPNPMRTAFREFEQLNRNFSNFVLDKFLRHCESLRPGAAPRDMMDAFILSA
EKKAARDSDDGGARLDLENVPATVTDIFGASQDTLSTALQWLLLLFIR
(2)
YPDVQARVQAELDQVVGRDRLPCMDDQPNLPYVLAFLYEAMRFSSFVPVTIPHATNANTS
VLGYHIPKDTVIFVNQWSVNHDPVKWPNPENFDPARFLDKDGLINKDLTSRVMIFSVGKR
RCIGEELSKMQLFLFISILAHQCNFRANPNGPEMNFSYGLTIKPKSFKVNVTLRESMELL
DSAVQKLQAEETCQ
>CYP1B1
NM_000104
Length = 543
Score = 1021 bits (2639), Expect = 0.0
Identities = 509/543 (93%), Positives =
519/543 (95%), Gaps = 1/543 (0%)
Query:
1
MGTGLSPKDPWPLNLLSTQQTTLLLLLSVLVAVHVGQWLLRQRRRQLGSTPPGPFAWPLI 60
MGT
LSP DPWPLN LS QQTTLLLLLSVL VHVGQ
LLRQRRRQL S PPGPFAWPLI
Sbjct:
1
MGTSLSPNDPWPLNPLSIQQTTLLLLLSVLATVHVGQRLLRQRRRQLRSAPPGPFAWPLI 60
Query:
61
GNAAAVGQASHLSFARLARRYGDVFQIRLGSCPIVVLNGERAIHQALVQQGSAFADRPSF 120
GNAAAVGQA+HLSFARLARRYGDVFQIRLGSCPIVVLNGERAIHQALVQQGSAFADRP+F
Sbjct:
61
GNAAAVGQAAHLSFARLARRYGDVFQIRLGSCPIVVLNGERAIHQALVQQGSAFADRPAF 120
Query:
121 ASFRVISGGRSMAFGHYSEHWKVQRRAAHSTMRNFSTRQLRSRQVLEGHVLSEARELVAL 180
ASFRV+SGGRSMAFGHYSEHWKVQRRAAHS MRNF TRQ RSRQVLEGHVLSEARELVAL
Sbjct:
121 ASFRVVSGGRSMAFGHYSEHWKVQRRAAHSMMRNFFTRQPRSRQVLEGHVLSEARELVAL 180
Query:
181 LVRGSADGAFLDPRQLTVVAVANVMSAVCFGCRYSHDDPEFRELLSHNEEFGRTVGAGSL 240
LVRGSADGAFLDPR LTVVAVANVMSAVCFGCRYSHDDPEFRELLSHNEEFGRTVGAGSL
Sbjct:
181 LVRGSADGAFLDPRPLTVVAVANVMSAVCFGCRYSHDDPEFRELLSHNEEFGRTVGAGSL 240
Query:
241 VDVMPWLQYFPNPMRTAFREFEQLNRNFSNFVLDKFLRHCESLRPGAAPRDMMDAFILSA 300
VDVMPWLQYFPNP+RT FREFEQLNRNFSNF+LDKFLRHCESLRPGAAPRDMMDAFILSA
Sbjct:
241 VDVMPWLQYFPNPVRTVFREFEQLNRNFSNFILDKFLRHCESLRPGAAPRDMMDAFILSA 300
Query:
301 EKKAARDSDDGGARLDLENVPATVTDIFGASQDTLSTALQWLLLLFIRYPDVQARVQAEL 360
EKKAA
DS
GGARLDLENVPAT+TDIFGASQDTLSTALQWLLLLF RYPDVQ RVQAEL
Sbjct:
301 EKKAAGDSHGGGARLDLENVPATITDIFGASQDTLSTALQWLLLLFTRYPDVQTRVQAEL 360
Query:
361 DQVVGRDRLPCMDDQPNLPYVLAFLYEAMRFSSFVPVTIPHATNANTSVLGYHIPKDTVI 420
DQVVGRDRLPCM DQPNLPYVLAFLYEAMRFSSFVPVTIPHAT ANTSVLGYHIPKDTV+
Sbjct:
361 DQVVGRDRLPCMGDQPNLPYVLAFLYEAMRFSSFVPVTIPHATTANTSVLGYHIPKDTVV 420
Query:
421 FVNQWSVNHDPVKWPNPENFDPARFLDKDGLINKDLTSRVMIFSVGKRRCIGEELSKMQL 480
FVNQWSVNHDPVKWPNPENFDPARFLDKDGLINKDLTSRVMIFSVGKRRCIGEELSKMQL
Sbjct:
421 FVNQWSVNHDPVKWPNPENFDPARFLDKDGLINKDLTSRVMIFSVGKRRCIGEELSKMQL 480
Query:
481 FLFISILAHQCNFRANPNGP-EMNFSYGLTIKPKSFKVNVTLRESMELLDSAVQKLQAEE
539
FLFISILAHQC+FRANPN P +MNFSYGLTIKPKSFKVNVTLRESMELLDSAVQ LQA+E
Sbjct:
481 FLFISILAHQCDFRANPNEPAKMNFSYGLTIKPKSFKVNVTLRESMELLDSAVQNLQAKE
540
Query:
540 TCQ 542
TCQ
Sbjct:
541 TCQ 543
>CYP2A23 AY635459.1
MLASGLLLVALLACLTVMVLMSVWQQRNSKGKLPPGPTPLPFIG
NYLQLNTEQMYNSIMKISERYGPVFTIHLGPRRIVVLCGYDAVKEALVDQAEEFSGRG
EQATFDWLFKGYGVVFSNGERAKQLRRFSIATLRDFGVGKRGIEERIQEEAGFLIEAL
RDTQGANIDPTFFLSRTVSNVISSIVFGDRFDYEDKEFLSLLRMMLGSFQFTATSAGQ
LYEMFSSVMKHLPGPQQQAFKELQGLEDFIAKKVEHNRRTLDPNSPRDFIDSFLIRMQ
EEEKNPNTEFHLKNLVLTSLNLFFGGTETVSTTLRYGFLLLMKHPEVEAKVHEEIDRV
IGKNRQPKFEDQARMPYMEAVIHEIQRFGDMLPLGVAHRVIKDTKFRDFFLPKGTEVF
PMLGSVLKDPKFFSNPQDFNPQHFLDEKGQFKKSDAFVPFSIGKRNCFGEGLARMELF
LFFTTIMQNFRFKSPQSPKDIDVSPKHVGFATIPPNYTMSFLPR
>CYP2A24
AY635460 (Y. Peng)
CYP2A6 best match = CYP2A24 (Y.Peng) partial
MLASGLLLVALLACLTVMVLMSVWQQRNSKGKLPPGPTPLPFIG
NYLQLNTEQMCNSLMKISERYGPVFTIHLGPRRVVVLCGYDAVKEALVDQAEEFSGRG
EQATFDWVFKGYGVVFSNGERAKQLRRFSIATLRDFGVGKRGIEERIQEEAGFLIEAL
RDTHGANIDPTFFLSRTVSNVISSIVFGDRFDYEDKEFLSLLGMMLGSFQFTSTSTGQ
LYEMFSSVMKHLPGPQQQAFKELQGLEDFIAKKVEHNQHTLDPNSPRDFIDSFLIRMQE
EEKNPNTEFYLKNLMMTTLNLFIAGTETVSTTLRYGFLLLMKYPEVE (1)AKVHEEIDRVIGKNRQPKFEDRVKMPYMEAVIHEIQRFGDVIPMSLARRVNKDTKFRDFFLPK (0)GTEVFPMLGSVLRDPRFFSNPQDFNPQHFLDEKGQFKKSDAFVPFSI (1)GKRNCFGEGLARMELFLFFTTIMQNFRFKSPQLPKDIDVSPKHVGFATIPPNYTMSFLPR Score = 1078 (379.5 bits), Expect = 1.2e-114, P = 1.2e-114 Identities = 204/217 (94%), Positives = 209/217 (96%) Query: 2 EEKNPNTEFYLKNLVMTTLNLFIGGTETVSTTLRYGFLLLMKHPEVEAKVHEEIDRVIGK 61 EEKNPNTEFYLKNL+MTTLNLFI GTETVSTTLRYGFLLLMK+PEVEAKVHEEIDRVIGKSbjct: 1 EEKNPNTEFYLKNLMMTTLNLFIAGTETVSTTLRYGFLLLMKYPEVEAKVHEEIDRVIGK 60 Query: 62 NRQPKFEDRAKMPYMEAVIHEIQRFGDVIPMSLARRVKKDTKFRDFFLPKGTEVYPMLGS 121 NRQPKFEDR KMPYMEAVIHEIQRFGDVIPMSLARRV KDTKFRDFFLPKGTEV+PMLGSSbjct: 61 NRQPKFEDRVKMPYMEAVIHEIQRFGDVIPMSLARRVNKDTKFRDFFLPKGTEVFPMLGS 120 Query: 122 VLRDPSFFSNPQDFNPQHFLNEKGQFKKSDAFVPFSIGKRNCFGEGLARMELFLFFTTVM 181 VLRDP FFSNPQDFNPQHFL+EKGQFKKSDAFVPFSIGKRNCFGEGLARMELFLFFTT+MSbjct: 121 VLRDPRFFSNPQDFNPQHFLDEKGQFKKSDAFVPFSIGKRNCFGEGLARMELFLFFTTIM 180 Query: 182 QNFRLKSSQSPKDIDVSPKHVGFATIPRNYTMSFLPR 218 QNFR KS Q PKDIDVSPKHVGFATIP NYTMSFLPRSbjct: 181 QNFRFKSPQLPKDIDVSPKHVGFATIPPNYTMSFLPR 217>CYP2C43
AB212264.1
MDSLVVLVLCLSCLLLLSLWRQRSGRGKLPPGPTPLPVIGNILK
IGIKDVSKSLTNLSKVYGPVFTLYFGLERMVVLHGYEAVKEALIDLGEEFSGRGHFPL
FERANRRFGLVFSNGKRWKEIRRFSLMTLRNFGMGKRSIEDRVQEEARCLVEELRKTK
ASPCDPTFILGCAPCNVICSIIFHKRFDYKDQQFLNLMEKFNENAKILSSPWIQIYNN
FSPIIDYFPGTHNKLLKNIAFVKSYILEKVKEHQESMDMNNPRDFIDCFLIKMEKEKH
NQQSEFNIENLENTAVDLFAAGTETTSTTLRYALLLLLKHPEVAAKVQEEIEHVIGRN
RSPCMQDRSRMPYTDAVVHEIQRYIDLLPTSVPHAVTCDVKFRNYLIPKGTTILISLT
SVLRDNKEFPNPEMFDPRHFLDEGGNFKNSNYFMPFSAGKRICVGEALARMELFLFLT
SILQNFNLKSLVDLKDLDTTPVFNGFVSVPPIYQLCFIPV
>CYP2C74 variant (S.Sarva) missing exon 1, 4 aa diffs to 2C74FSKVYGPVFTVYFGMNPVVVLHGYETVKEALIDNAEEFSGRGILPISERITNGL (1)GIISSNGKRWKETRRFSLTTLRNFGMGKRSIEDRVQEEARCLVEELRKTK (1)ASPCDPTFILGCAPCNVICSVVFQKRFDYKDENFLTLMKRFTVNFRILTSPWIQ (0)VCNNFPLLIDCFPGTHNKLLKNVALTKSYIRKKVKEHQATLDVNNPRDFIDCFLIKMEQ (0)EKDNQQSEFTIENLVGTVADLFVAGTETTSTTLRYGLLLLLKHPEVT (1)AKVQEEIDHVIGRHRSPCMQDRSHMPYTDAVIHEIQRYIDLVPTGVPHAVTTDIKFRNYLIPK (0)GTIIITLLTSVLQDDKEFPNPKIFDPGHFLDENGNFKKSDYFMPFSA (1)GKRICAGEGLARMELFLFLTTILQNFNLKSVADLKNLNTTSATRGIISLPPSYQICFIPV
>CYP2C8 M17397 Length = 490 Score = 823 bits (2126), Expect = 0.0 Identities = 398/434 (91%), Positives = 413/434 (95%) Frame = +1 Query: 1 FSKVYGPVFTVYFGMNPVVVLHGYETVKEALIDNAEEFSGRGILPISERITNGLGIISSN 180 FSKVYGPVFTVYFGMNP+VV HGYE VKEALIDN EEFSGRG PIS+RIT GLGIISSNSbjct: 57 FSKVYGPVFTVYFGMNPIVVFHGYEAVKEALIDNGEEFSGRGNSPISQRITKGLGIISSN 116 Query: 181 GKRWKETRRFSLTTLRNFGMGKRSIEDRVQEEARCLVEELRKTKASPCDPTFILGCAPCN 360 GKRWKE RRFSLT LRNFGMGKRSIEDRVQEEA CLVEELRKTKASPCDPTFILGCAPCNSbjct: 117 GKRWKEIRRFSLTNLRNFGMGKRSIEDRVQEEAHCLVEELRKTKASPCDPTFILGCAPCN 176 Query: 361 VICSVVFQKRFDYKDENFLTLMKRFTVNFRILTSPWIQVCNNFPLLIDCFPGTHNKLLKN 540 VICSVVFQKRFDYKD+NFLTLMKRF NFRIL SPWIQVCNNFPLLIDCFPGTHNK+LKNSbjct: 177 VICSVVFQKRFDYKDQNFLTLMKRFNENFRILNSPWIQVCNNFPLLIDCFPGTHNKVLKN 236 Query: 541 VALTKSYIRKKVKEHQATLDVNNPRDFIDCFLIKMEQEKDNQQSEFTIENLVGTVADLFV 720 VALT+SYIR+KVKEHQA+LDVNNPRDF+DCFLIKMEQEKDNQ+SEF IENLVGTVADLFVSbjct: 237 VALTRSYIREKVKEHQASLDVNNPRDFMDCFLIKMEQEKDNQKSEFNIENLVGTVADLFV 296 Query: 721 AGTETTSTTLRYGLLLLLKHPEVTAKVQEEIDHVIGRHRSPCMQDRSHMPYTDAVIHEIQ 900 AGTETTSTTLRYGLLLLLKHPEVTAKVQEEIDHVIGRHRSPCMQDRSHMPYTDAV+HEIQSbjct: 297 AGTETTSTTLRYGLLLLLKHPEVTAKVQEEIDHVIGRHRSPCMQDRSHMPYTDAVVHEIQ 356 Query: 901 RYIDLVPTGVPHAVTTDIKFRNYLIPKGTIIITLLTSVLQDDKEFPNPKIFDPGHFLDEN 1080 RY DLVPTGVPHAVTTD KFRNYLIPKGT I+ LLTSVL DDKEFPNP IFDPGHFLD+NSbjct: 357 RYSDLVPTGVPHAVTTDTKFRNYLIPKGTTIMALLTSVLHDDKEFPNPNIFDPGHFLDKN 416 Query: 1081 GNFKKSDYFMPFSAGKRICAGEGLARMELFLFLTTILQNFNLKSVADLKNLNTTSATRGI 1260 GNFKKSDYFMPFSAGKRICAGEGLARMELFLFLTTILQNFNLKSV DLKNLNTT+ T+GISbjct: 417 GNFKKSDYFMPFSAGKRICAGEGLARMELFLFLTTILQNFNLKSVDDLKNLNTTAVTKGI 476 Query: 1261 ISLPPSYQICFIPV 1302 +SLPPSYQICFIPVSbjct: 477 VSLPPSYQICFIPV 490
>CYP2C75 AY635463.1
MDSLVVLVLCLSCLLLLSLWRQRSGRGKFPPGPTPLPVIGNILQ
IDIKDVSKSLTNLSKVYGPVFTLYFGLERMVVLHGYEAVKEALIDLGEEFSGRGHFPL
ADRANRGFGIVFSNGKRWKEIRRFSLMTLRNFGMGKRSIEDRVQEEARCLVEELRKTK
GSPCDPTFILGCAPCNVICSIIFHKRFDYKDQQFLKVMEKLNENVKILSSPWIQICNN
FPPFIDYFPGAHNKLLKNIAFLKSYILEKVKEHQESMDMNNPRDFIDCFLMKMEKEKH
NQQSEFTIENLENTAVDLFGAGTETTSTTLRYALLLLLKHPEVTAKVQEEIERVIGRN
RSPCMQDRSHMPYTDAVVHEIQRYIDLLPTNLPHAVTCDVKFRNYLIPKGTTILISLT
SVLHDNKEFPNPEMFDPRHFLDEGGNFKKSNYFMPFSAGKRICVGEALARMELFLFLT
SVLQNFNLKSLVDPKDLDTTPVVNGFASVPPFYQLCFIPV
>CYP2C19
best hit (N. Abdeltawab) same as Liao’s hit
below
MDPAVALVLCLSCLFLLSLWRQSSGRGRLPSGPTPLPIIGNILQLDVKDMSKSLTNVSKV
YGPVFTVYFGLKPIVVLHGYEAVKEALIDHGEKFSGRGSFPVAEKVNKGLGILFSNGKRW
KEIRRFSLMTLRNFGMGKRSIEDRVQEEALCLVEELRKTASPCDPTFILGCAPCNVICSV
IFHNRFDYKDQRFLNLMEKFNENLRILSSPWIQ (1)
CNNFPALIDYLPGSHNKVVKNFAYVKS
YVLERIKEHQESLDMDNPRDFIDCFLIKMEEKHNLQSEFTIESLIATVTDMFGAGTETTS
TTLRFGLLLLLKYPEVTAKVQEEIECVVGRNRSPCMQDRSHMPYTDAVVHEIQRYIDLIP
TNLPHAVTCDVKFRNYLIPKGTTIITSLTSVLHNDKEFPNPEMFDPGHFLDRSGNFKKSD
YFMPFSAGKRMCVGEGLARMELFLFLTTILQNFNLKSQVDPKDIDITPIANAFGRVPPLY
QLCFIPV
>CYP2C18
M61856
Length = 490
Score = 951 bits (2459), Expect = 0.0
Identities = 468/490 (95%), Positives =
481/490 (98%), Gaps = 3/490 (0%)
Query:
1
MDPAVALVLCLSCLFLLSLWRQSSGRGRLPSGPTPLPIIGNILQLDVKDMSKSLTNVSKV 60
MDPAVALVLCLSCLFLLSLWRQSSGRGRLPSGPTPLPIIGNILQLDVKDMSKSLTN SKV
Sbjct:
1
MDPAVALVLCLSCLFLLSLWRQSSGRGRLPSGPTPLPIIGNILQLDVKDMSKSLTNFSKV 60
Query:
61
YGPVFTVYFGLKPIVVLHGYEAVKEALIDHGEKFSGRGSFPVAEKVNKGLGILFSNGKRW 120
YGPVFTVYFGLKPIVVLHGYEAVKEALIDHGE+FSGRGSFPVAEKVNKGLGILFSNGKRW
Sbjct:
61
YGPVFTVYFGLKPIVVLHGYEAVKEALIDHGEEFSGRGSFPVAEKVNKGLGILFSNGKRW 120
Query:
121 KEIRRFSLMTLRNFGMGKRSIEDRVQEEALCLVEELRKT-ASPCDPTFILGCAPCNVICS 179
KEIRRF
LMTLRNFGMGKRSIEDRVQEEA CLVEELRKT ASPCDPTFILGCAPCNVICS
Sbjct:
121 KEIRRFCLMTLRNFGMGKRSIEDRVQEEARCLVEELRKTNASPCDPTFILGCAPCNVICS 180
Query:
180 VIFHNRFDYKDQRFLNLMEKFNENLRILSSPWIQ-CNNFPALIDYLPGSHNKVVKNFAYV 238
VIFH+RFDYKDQRFLNLMEKFNENLRILSSPWIQ CNNFPALIDYLPGSHNK+ +NFAY+
Sbjct:
181 VIFHDRFDYKDQRFLNLMEKFNENLRILSSPWIQVCNNFPALIDYLPGSHNKIAENFAYI 240
Query:
239 KSYVLERIKEHQESLDMDNPRDFIDCFLIKME-EKHNLQSEFTIESLIATVTDMFGAGTE 297
KSYVLERIKEHQESLDM++ RDFIDCFLIKME EKHN QSEFT+ESLIATVTDMFGAGTE
Sbjct:
241 KSYVLERIKEHQESLDMNSARDFIDCFLIKMEQEKHNQQSEFTVESLIATVTDMFGAGTE 300
Query:
298 TTSTTLRFGLLLLLKYPEVTAKVQEEIECVVGRNRSPCMQDRSHMPYTDAVVHEIQRYID 357
TTSTTLR+GLLLLLKYPEVTAKVQEEIECVVGRNRSPCMQDRSHMPYTDAVVHEIQRYID
Sbjct:
301 TTSTTLRYGLLLLLKYPEVTAKVQEEIECVVGRNRSPCMQDRSHMPYTDAVVHEIQRYID 360
Query:
358 LIPTNLPHAVTCDVKFRNYLIPKGTTIITSLTSVLHNDKEFPNPEMFDPGHFLDRSGNFK 417
L+PTNLPHAVTCDVKF+NYLIPKGTTIITSLTSVLHNDKEFPNPEMFDPGHFLD+SGNFK
Sbjct:
361 LLPTNLPHAVTCDVKFKNYLIPKGTTIITSLTSVLHNDKEFPNPEMFDPGHFLDKSGNFK 420
Query:
418 KSDYFMPFSAGKRMCVGEGLARMELFLFLTTILQNFNLKSQVDPKDIDITPIANAFGRVP 477
KSDYFMPFSAGKRMC+GEGLARMELFLFLTTILQNFNLKSQVDPKDIDITPIANAFGRVP
Sbjct:
421 KSDYFMPFSAGKRMCMGEGLARMELFLFLTTILQNFNLKSQVDPKDIDITPIANAFGRVP 480
Query:
478 PLYQLCFIPV 487
PLYQLCFIPV
Sbjct:
481 PLYQLCFIPV 490
>searched
with 2C29 differs from 2C43, 2C74, 2C75 (Liao)
MDPAVALVLCLSCLFLLSLWRQSSGRGRLPSGPTPLPIIGNILQLDVKDMSKSLTNFSKV
YGPVFTVYFGLKPIVVLHGYEAVKEALIDHGEKFSGRGSFPVAEKVNKGLGILFSNGKRW
KEIRRFSLMTLRNFGMGKRSIEDRVQEEALCLVEELRKTNASPCDPTFILGCAPCNVICS
VIFHNRFDYKDQRFLNLMEKFNENLRILSSPWIQ
(gap in seq, missing one exon)
EKHNLQSEFTIESLIATVTDMFGAG
TETTSTTLRFGLLLLLKYPEVTAKVQEEIECVVGRNRSPCMQDRSHMPYTDAVVHEIQRY
IDLIPTNLPHAVTCDVKFRNYLIPKGTTIITSLTSVLHNDKEFPNPEMFDPGHFLDRSGN
FKKSDYFMPFSAGKRMCVGEGLARMELFLFLTTILQNFNLKSQVDPKDIDITPIANAFGR
VPPLYQLCFIPV
Length = 490
Score = 679 bits (1753), Expect = 0.0
Identities = 347/490 (70%), Positives =
386/490 (78%), Gaps = 58/490 (11%)
Query:
1
MDPAVALVLCLSCLFLLSLWRQSSGRGRLPSGPTPLPIIGNILQLDVKDMSKSLTNFSKV 60
MD V LVLCLSCL
LLSLWRQSSGRG+LP GPTPLP+IGNILQ+ +KD+SKSLTN SKV
Sbjct:
1
MDSLVVLVLCLSCLLLLSLWRQSSGRGKLPPGPTPLPVIGNILQIGIKDISKSLTNLSKV 60
Query:
61
YGPVFTVYFGLKPIVVLHGYEAVKEALIDHGEKFSGRGSFPVAEKVNKGLGILFSNGKRW 120
YGPVFT+YFGLKPIVVLHGYEAVKEALID
GE+FSGRG FP+AE+ N+G GI+FSNGK+W
Sbjct:
61
YGPVFTLYFGLKPIVVLHGYEAVKEALIDLGEEFSGRGIFPLAERANRGFGIVFSNGKKW 120
Query:
121 KEIRRFSLMTLRNFGMGKRSIEDRVQEEALCLVEELRKTNASPCDPTFILGCAPCNVICS 180
KEIRRFSLMTLRNFGMGKRSIEDRVQEEA CLVEELRKT ASPCDPTFILGCAPCNVICS
Sbjct:
121 KEIRRFSLMTLRNFGMGKRSIEDRVQEEARCLVEELRKTKASPCDPTFILGCAPCNVICS 180
Query:
181 VIFH--------------------------------NRF----DY---------KDQRFL 195
+IFH
N F DY K+
F+
Sbjct:
181 IIFHKRFDYKDQQFLNLMEKLNENIKILSSPWIQICNNFSPIIDYFPGTHNKLLKNVAFM 240
Query:
196 N--LMEKFNENLRIL--SSPW---------IQVEKHNLQSEFTIESLIATVTDMFGAGTE 242
++EK E+ + ++P ++ EKHN SEFTIESL T D+FGAGTE
Sbjct:
241 KSYILEKVKEHQESMDMNNPQDFIDCFLMKMEKEKHNQPSEFTIESLENTAVDLFGAGTE 300
Query:
243 TTSTTLRFGLLLLLKYPEVTAKVQEEIECVVGRNRSPCMQDRSHMPYTDAVVHEIQRYID 302
TTSTTLR+ LLLLLK+PEVTAKVQEEIE V+GRNRSPCMQDRSHMPYTDAVVHE+QRYID
Sbjct:
301 TTSTTLRYALLLLLKHPEVTAKVQEEIERVIGRNRSPCMQDRSHMPYTDAVVHEVQRYID 360
Query:
303 LIPTNLPHAVTCDVKFRNYLIPKGTTIITSLTSVLHNDKEFPNPEMFDPGHFLDRSGNFK 362
L+PT+LPHAVTCD+KFRNYLIPKGTTI+ SLTSVLH++KEFPNPEMFDP HFLD GNFK
Sbjct:
361 LLPTSLPHAVTCDIKFRNYLIPKGTTILISLTSVLHDNKEFPNPEMFDPHHFLDEGGNFK 420
Query:
363 KSDYFMPFSAGKRMCVGEGLARMELFLFLTTILQNFNLKSQVDPKDIDITPIANAFGRVP 422
KS
YFMPFSAGKR+CVGE LA MELFLFLT+ILQNFNLKS VDPK++D TP+ N F VP
Sbjct:
421 KSKYFMPFSAGKRICVGEALAGMELFLFLTSILQNFNLKSLVDPKNLDTTPVVNGFASVP 480
Query:
423 PLYQLCFIPV 432
P YQLCFIPV
Sbjct:
481 PFYQLCFIPV 490
>CYP2B30
AY635461.1
MELSVLLFLALLTGLLLLLVQRHPNAHGRLPPGPCPLPLLGNLL
QMDRRGLLRSFLRFREKYGDVFTVYLGPRPVVMLCGVEAIREALVDNAEAFSGRGKIA
ITDPVFQGYGVVFANGNRWKVLRRFSLTTMRDFGMGKRSVEERIQEEAQCLIEELRKS
KGALVDPTFLFHSITANIICSIVFGKRFHYQDQEFLKILNLFYHTFSLASSMFGQLFE
LLSGFLKYFPGAHRQVYKNLQEINAYIGHSVEKHRETLDPSAPQDLIDSYLLQMEKEK
SNPHSEFSHRNLIINTLSLFFAGTETTSTTLRYGFLLMLKYPHVAERIYKEIEQVIGP
HRPPALDDRAKMPYTEAVIHEIQRFADLLPMGVPHIVTQQTSFRGYIIPKDTEVFPLL
STALHDPHYFEKPDTFNPDHFLDANGALKKNEAFIPFSLGRRMCLGEGIARNELFLFF
TTILQNFSVASPVAPEDIDLTPQESGVGKIPPTYQIRFLPR
>CYP2B6 search (M.Puljic) partial
GALVDPTFLFHSITANIICSIVFGKRFHYQDQEFLKILNLFYHTFSLASSMFGQLFELLS
GFLKYFPGAHRQVYKNLQEINAYIGHSVEKHRETLDPSAPQDLIDSYLLQMEKEKSNPHS
EFSHRNLIINTLSLFFAGTETTSTTLRYGFLLMLKYPHVAER
Identities = 162/162 (100%), Positives = 162/162
(100%), Gaps = 0/162 (0%)
Query
1
GALVDPTFLFHSITANIICSIVFGKRFHYQDQEFLKILNLFYHTFSLASSMFGQLFELLS 60
GALVDPTFLFHSITANIICSIVFGKRFHYQDQEFLKILNLFYHTFSLASSMFGQLFELLS
Sbjct
162
GALVDPTFLFHSITANIICSIVFGKRFHYQDQEFLKILNLFYHTFSLASSMFGQLFELLS 221
Query
61
GFLKYFPGAHRQVYKNLQEINAYIGHSVEKHRETLDPSAPQDLIDSYLLQMEKEKSNPHS 120
GFLKYFPGAHRQVYKNLQEINAYIGHSVEKHRETLDPSAPQDLIDSYLLQMEKEKSNPHS
Sbjct
222
GFLKYFPGAHRQVYKNLQEINAYIGHSVEKHRETLDPSAPQDLIDSYLLQMEKEKSNPHS 281
Query
121
EFSHRNLIINTLSLFFAGTETTSTTLRYGFLLMLKYPHVAER 162
EFSHRNLIINTLSLFFAGTETTSTTLRYGFLLMLKYPHVAER
Sbjct 282
EFSHRNLIINTLSLFFAGTETTSTTLRYGFLLMLKYPHVAER 323
>CYP2F6 AY952296.1
MDSISTAILLLLLALVCLLLTLSSRDKXKLPPGPRPLPLLGNLLLLRSQNMLTSLTQ
LSKEYGSVYTVHLGPRRVVVLSGYQAVKEALVDQGEEFSGRGDYPVFFNFTKGN
GIAFSNGDRWKVLRRFSIQILRNFGMGKRSIEERILEEGSFLLAELRKTE
GEPFDPTFVLSRSVSNIICSVLFGSRFDYDDERLLTVIRLINDNFQIMSSPWGE
(0)
LYNIFPSLLDWVPGPHQRIFQNFKRLRDLIAHRVHDQQASLDPRSPRDFIDCFLTKMAE
EKEDPLSHFHMDTLLMTTHNLLFGGTETVGTTLRHAFLALMKYPKVQ
ARVQEEIDLVVGRTRLPTLEDRAAMPYTDAVIHEVQRFADIIPMNLPHRVIRDTAFRXFLIPK
GTDIITLLNTVHYDPSQFLXPQEFNPEHFLDANQSFKKSPAFMPFSA
GRRLCLGESLARMELFLYLTAILQSFSLQPLGAPEDIDLTPLSSGLGNLPRSFQLCLCPR
>CYP2D42 (Vasser) also AY635464.1MELDALVPLAVIVAIFLLLVDLMHRRQRWAARYPPGPLPLPGLGNLLHVDFKNTPYCFDQ (0)LRRRFGDVFSLQLAWTPVVVLNGLAAVREALVTCGEDTADRPPVPINQVLGVGPRSQ (1)GVFLARYGPAWREQRRFSVSTLRNLGLGKKSLEQWVTEEAACLCAAFTDQA (1)GRPFRPNSLLDKAVSNVIASLTYGRRFEYDDPRFLRLFDLTHEALKEESGFLRE (0)VLNAVPLLLRIPGLAGKVLRSQKAFLTQLDELLTEHRMTWDPAQPPRDLTEAFLAEMEK (0)AKGNPESSFNEENLRIVVADLFSAGMVTTSTTLAWGLLLMILHPDVQR (1)RVQQEIDNVIGQVRRPEMGDQARMPYTTAVIHEVQRFGDIVPLGVTHMTSRDIELQGFLIPK (0)GTTLFTNLSSVLKDEAVWEKPFRFHPEHFLDAQGHFVKPEAFLPFSA (1)GRRACLGEPLARMELFLFFTCLLQRFSFSVPAGQPRPSHHGVFAFLVTPSPYELCAVPR* >CYP2D6 NM_000106Length = 497Score = 940 bits (2430), Expect = 0.0Identities = 465/497 (93%), Positives = 476/497 (95%) Query: 1 MELDALVPLAVIVAIFLLLVDLMHRRQRWAARYPPGPLPLPGLGNLLHVDFKNTPYCFDQ 60 M L+ALVPLAVIVAIFLLLVDLMHRRQRWAARYPPGPLPLPGLGNLLHVDF+NTPYCFDQSbjct: 1 MGLEALVPLAVIVAIFLLLVDLMHRRQRWAARYPPGPLPLPGLGNLLHVDFQNTPYCFDQ 60 Query: 61 LRRRFGDVFSLQLAWTPVVVLNGLAAVREALVTCGEDTADRPPVPINQVLGVGPRSQGVF 120 LRRRFGDVFSLQLAWTPVVVLNGLAAVREALVT GEDTADRPPVPI Q+LG GPRSQGVFSbjct: 61 LRRRFGDVFSLQLAWTPVVVLNGLAAVREALVTHGEDTADRPPVPITQILGFGPRSQGVF 120 Query: 121 LARYGPAWREQRRFSVSTLRNLGLGKKSLEQWVTEEAACLCAAFTDQAGRPFRPNSLLDK 180 LARYGPAWREQRRFSVSTLRNLGLGKKSLEQWVTEEAACLCAAF + +GRPFRPN LLDKSbjct: 121 LARYGPAWREQRRFSVSTLRNLGLGKKSLEQWVTEEAACLCAAFANHSGRPFRPNGLLDK 180 Query: 181 AVSNVIASLTYGRRFEYDDPRFLRLFDLTHEALKEESGFLREVLNAVPLLLRIPGLAGKV 240 AVSNVIASLT GRRFEYDDPRFLRL DL E LKEESGFLREVLNAVP+LL IP LAGKVSbjct: 181 AVSNVIASLTCGRRFEYDDPRFLRLLDLAQEGLKEESGFLREVLNAVPVLLHIPALAGKV 240 Query: 241 LRSQKAFLTQLDELLTEHRMTWDPAQPPRDLTEAFLAEMEKAKGNPESSFNEENLRIVVA 300 LR QKAFLTQLDELLTEHRMTWDPAQPPRDLTEAFLAEMEKAKGNPESSFN+ENLRIVVASbjct: 241 LRFQKAFLTQLDELLTEHRMTWDPAQPPRDLTEAFLAEMEKAKGNPESSFNDENLRIVVA 300 Query: 301 DLFSAGMVTTSTTLAWGLLLMILHPDVQRRVQQEIDNVIGQVRRPEMGDQARMPYTTAVI 360 DLFSAGMVTTSTTLAWGLLLMILHPDVQRRVQQEID+VIGQVRRPEMGDQA MPYTTAVISbjct: 301 DLFSAGMVTTSTTLAWGLLLMILHPDVQRRVQQEIDDVIGQVRRPEMGDQAHMPYTTAVI 360 Query: 361 HEVQRFGDIVPLGVTHMTSRDIELQGFLIPKGTTLFTNLSSVLKDEAVWEKPFRFHPEHF 420 HEVQRFGDIVPLG+THMTSRDIE+QGF IPKGTTL TNLSSVLKDEAVWEKPFRFHPEHFSbjct: 361 HEVQRFGDIVPLGMTHMTSRDIEVQGFRIPKGTTLITNLSSVLKDEAVWEKPFRFHPEHF 420 Query: 421 LDAQGHFVKPEAFLPFSAGRRACLGEPLARMELFLFFTCLLQRFSFSVPAGQPRPSHHGV 480 LDAQGHFVKPEAFLPFSAGRRACLGEPLARMELFLFFT LLQ FSFSVP GQPRPSHHGVSbjct: 421 LDAQGHFVKPEAFLPFSAGRRACLGEPLARMELFLFFTSLLQHFSFSVPTGQPRPSHHGV 480 Query: 481 FAFLVTPSPYELCAVPR 497 FAFLV+PSPYELCAVPRSbjct: 481 FAFLVSPSPYELCAVPR 497
>CYP2E1
AY635465.1
MSALGVSVALLVWVAVLLLVSIWRQVHSSWNLPPGPFPLPIIGN
LFQLELKNIPKSFTRLAQRFGPVFTLYVGSRRVVVVHGYKAVREVLLDHKDEFSGRGD
IPAFHAHRDRGIIFNNGPTWKDIRRFSLTTLRNYGMGKQGNESRIQREAHFLLEALRK
TQGQPFDPTFLIGCAPCNVIADILFRKHFDYNDEKFLRLMYLFNENFQLLSTPWLQLY
NNFPSLLHYLPGSHRKVMKNVAEIKEYVSERVKEHLQSLDPNCPRDLTDCLLVEMEKE
KHSAERLYTMDGITVTVADLFFAGTETTSTTLRYGLLILMKYPEIEEKLHEEIDRVIG
PSRIPAIKDRQEMPYMDAVVHEIQRFITLVPSNLPHEATRDTIFRGYIIPKGTVIVPT
LDSVLYDNQEFPDPEKFKPEHFLDESGKFKYSDYFKPFSAGKRVCAGEGLARMELFLL
LSAILQHFNLKPLVDPKDIDISPVNIGFGCIPPRFKLCVIPRS
>CYP2E1
3 aa diffs, partial (F.Zhang)
MSALGVSVALLVWVAVLLLVSIWRQVHSSWNLPPGPFPLPIIGNLFQLEWKNIPKSFTRL
AQRFGPVFTLYVGSRRVVVVHGYKAVREVLLDHKDEFSGRGDIPAFHAHRDRGIIFNNRP
TWKDIRRFSLTTLRNYGMGKQGNESRIQREAHFLLEALRKTQGQPFDPTFLIGCSPCNVI
ADILFRKHFDYNDEKFLYNNFPSLLHYLPGSHRKVMKNVAEIKEYVSERVKEHLQSLDPN
CPRDLTDCLLVEMEKEKHSAERLYTMDGITVTVADLFFAGTETTSITLRYGLLILMKYPE
IE
>CYP2E1 J02843 Length = 493 Score = 576 bits (1484), Expect = e-167 Identities = 283/322 (87%), Positives = 294/322 (91%), Gaps = 20/322 (6%) Frame = -1 Query: 906 MSALGVSVALLVWVAVLLLVSIWRQVHSSWNLPPGPFPLPIIGNLFQLEWKNIPKSFTRL 727 MSALGV+VALLVW A LLLVS+WRQVHSSWNLPPGPFPLPIIGNLFQLE KNIPKSFTRLSbjct: 1 MSALGVTVALLVWAAFLLLVSMWRQVHSSWNLPPGPFPLPIIGNLFQLELKNIPKSFTRL 60 Query: 726 AQRFGPVFTLYVGSRRVVVVHGYKAVREVLLDHKDEFSGRGDIPAFHAHRDRGIIFNNRP 547 AQRFGPVFTLYVGS+R+VV+HGYKAV+E LLD+KDEFSGRGD+PAFHAHRDRGIIFNN PSbjct: 61 AQRFGPVFTLYVGSQRMVVMHGYKAVKEALLDYKDEFSGRGDLPAFHAHRDRGIIFNNGP 120 Query: 546 TWKDIRRFSLTTLRNYGMGKQGNESRIQREAHFLLEALRKTQGQPFDPTFLIGCSPCNVI 367 TWKDIRRFSLTTLRNYGMGKQGNESRIQREAHFLLEALRKTQGQPFDPTFLIGC+PCNVISbjct: 121 TWKDIRRFSLTTLRNYGMGKQGNESRIQREAHFLLEALRKTQGQPFDPTFLIGCAPCNVI 180 Query: 366 ADILFRKHFDYNDEKF--------------------LYNNFPSLLHYLPGSHRKVMKNVA 247 ADILFRKHFDYNDEKF LYNNFPS LHYLPGSHRKV+KNVASbjct: 181 ADILFRKHFDYNDEKFLRLMYLFNENFHLLSTPWLQLYNNFPSFLHYLPGSHRKVIKNVA 240 Query: 246 EIKEYVSERVKEHLQSLDPNCPRDLTDCLLVEMEKEKHSAERLYTMDGITVTVADLFFAG 67 E+KEYVSERVKEH QSLDPNCPRDLTDCLLVEMEKEKHSAERLYTMDGITVTVADLFFAGSbjct: 241 EVKEYVSERVKEHHQSLDPNCPRDLTDCLLVEMEKEKHSAERLYTMDGITVTVADLFFAG 300 Query: 66 TETTSITLRYGLLILMKYPEIE 1 TETTS TLRYGLLILMKYPEIESbjct: 301 TETTSTTLRYGLLILMKYPEIE 322
>CYP2G2P best hit (Li Chen) Note this does not look like a pseudogenesee red regions below.exon 2 = trace archive file 456149111MELGGAVTIFLALCLSCLLVLIAWKRMNKAGKLPPGPTPIPFLGNLLQVRTDATFQSFMK (0)LKEKYGPLFTVYMGLWPVVVLCGHEAVKEALIDQADEFSGRGKLASIEQNFQGH (1)
GVALANGERWRILRRFSLTILRDFGMGKRSIEERILEEASYLLEEFRKTK (1)GAPIDPTFLLSRTVSNVISSVVFGSRFDYEDKQFLNLLRLINESFIEMSTPWAQ (0)LYDMYSGIMQYLPGRHNRVYYLIEQLKDFIASRVKINEASFDSQNPRDFIDCFLIKMHQ (0)DKNNPRTEFNLKNLVLTALNLFFAGTETVSSTLRYGFLLLMKHPEVE (1)ARIHEEINQVIGPHRLPSVDDRVKMPYTDAVIHEIQRLVDIVPMGVPHNVIRDTQFRGYLLPK (0)GTDVFPLLGSVLKDPKYFRYPEAFYPQHFLDEQGRFKKNEAFVPFSS (1)GKRICLGEAMARMELFLYFTSILQNFSPRSLVPPADIDITPKLSGFGNIPPTYELCLVAR
Score
= 933 bits (2411), Expect = 0.0
Identities = 463/496 (93%), Positives =
476/496 (95%), Gaps = 2/496 (0%)
Query:
1 MELGGAVTIFLALCLSCLLVLIAWKRMNKAGKLPPGPTPIPFLGNLLQVRTDATFQSFMK
60
ME+GGAVTIFLALCLSCLL+LIAWK MNKAGKLPPGPTPIPFLGNLLQVRTDATFQSFMK
Sbjct:
1 MEMGGAVTIFLALCLSCLLILIAWK*MNKAGKLPPGPTPIPFLGNLLQVRTDATFQSFMK
60
Query:
61
LKEKYGPLFTVYMGLWPVVVLCGHEAVKEALIDQADEFSGRGKLASIEQNFQGHGVALAN 120
L+EKY
P+FTVYMG
PVVVLCGHEAVKEAL+DQADEFSGRG+LASI+QNFQGHGVALAN
Sbjct:
61
LREKYSPVFTVYMGPRPVVVLCGHEAVKEALVDQADEFSGRGELASIKQNFQGHGVALAN 120
Query:
121 GERWRILRRFSLTILRDFGMGKRSIEERILEEASYLLEEFRKTKGAPIDPTFLLSRTVSN
180
GERWRIL RF
LTILRDFGMGKRSIEERI EEASYLLEEFRKTKGAPIDP FLLSRTVSN
Sbjct:
121 GERWRIL*RFPLTILRDFGMGKRSIEERIQEEASYLLEEFRKTKGAPIDPIFLLSRTVSN
180
Query:
181 VISSVVFGSRFDYEDKQFLNLLRLINESFIEMSTPWAQLYDMYSGIMQYLPGRHNRVYYL 240
VISSVVF SRFDYEDKQFLNLLRLINESFIEMSTPWAQLYDMYSGIMQYLPGRHN +YYL
Sbjct:
181 VISSVVFRSRFDYEDKQFLNLLRLINESFIEMSTPWAQLYDMYSGIMQYLPGRHNLIYYL 240
Query:
241 IEQLKDFIASRVKINEASFDSQNPRDFIDCFLIKMHQDKNNPRTEFNLKNLVLTALNLFF 300
+E+LKDFIASRVKINEASFD QNPRDFIDCFLIKMHQDKNNPRTEFNLKNLVLT LNLFF
Sbjct:
241 VEELKDFIASRVKINEASFDPQNPRDFIDCFLIKMHQDKNNPRTEFNLKNLVLTTLNLFF 300
Query:
301 AGTETVSSTLRYGFLLLMKHPEVEARIHEEINQVIGPHRLPSVDDRVKMPYTDAVIHEIQ 360
AGTETVSSTLRYGFLLLMKHPEVEA+IHEEINQVIGPHRLP VDDRVKMPYTDAVIHEIQ
Sbjct:
301 AGTETVSSTLRYGFLLLMKHPEVEAKIHEEINQVIGPHRLPRVDDRVKMPYTDAVIHEIQ 360
Query:
361 RLVDIVPMGVPHNVIRDTQFRGYLLPKGTDVFPLLGSVLKDPKYFRYPEAFYPQHFLDEQ 420
RLVDIVPMGVPHN+IRDTQFRGYLLPKGTDVFPLLGSVLKDPKYFRYP+AFYPQHFLDEQ
Sbjct:
361 RLVDIVPMGVPHNLIRDTQFRGYLLPKGTDVFPLLGSVLKDPKYFRYPDAFYPQHFLDEQ 420
Query:
421 GRFKKNEAFVPFSS--GKRICLGEAMARMELFLYFTSILQNFSPRSLVPPADIDITPKLS 478
GRFKKNEAFVPFSS GKRICLGEAM RMELFLYFTS
LQNFS SLVPP DIDITPKLS
Sbjct:
421 GRFKKNEAFVPFSSGRGKRICLGEAMDRMELFLYFTSTLQNFSLHSLVPPVDIDITPKLS 480
Query:
479 GFGNIPPTYELCLVAR 494
GFGNIPPTYELCLVAR
Sbjct:
481 GFGNIPPTYELCLVAR 496
>CYP2J2
best hit (Z. Zhang) partial
GLIMSSGQIWKEQRRFTLTALRNFGLGKKSLEERIQEEAQHLTEAIKEENGQPFDPHFKI
NNAVSNIICSITFGERFDYQDSQFQELLKLLDEVTYLEASKTCQLYNIFPWLMKFLPGPH
QTLFSNWEKLKLFVSHMIEKHRKDWNPAETRDFIDAYLKEMSKHTGNSTSSFHEENLICS
TLDLFFAGTETTSTTLRWALLYMALYPEIQEKVQAEIDRVIGQGQQPSTAARESMPYTNA
VIHEVQRMGNIVPLNVPREVTVDTTLAGYHLPKRACLGEQLARTELFIFFTSLVQKFTFR
PPNNEKLSLKFRMGITISPVSHHLC
>CYP2J2 NM_000775 chr 1 Length = 497 Score = 618 bits (1594), Expect = e-180 Identities = 311/373 (83%), Positives = 319/373 (85%), Gaps = 48/373 (12%) Query: 1 GLIMSSGQIWKEQRRFTLTALRNFGLGKKSLEERIQEEAQHLTEAIKEENGQPFDPHFKI 60 GLIMSSGQ WKEQRRFTLTALRNFGLGKKSLEERIQEEAQHLTEAIKEENGQPFDPHFKISbjct: 125 GLIMSSGQAWKEQRRFTLTALRNFGLGKKSLEERIQEEAQHLTEAIKEENGQPFDPHFKI 184 Query: 61 NNAVSNIICSITFGERFDYQDSQFQELLKLLDEVTYLEASKTCQLYNIFPWLMKFLPGPH 120 NNAVSNIICSITFGERF+YQDS FQ+LLKLLDEVTYLEASKTCQLYN+FPW+MKFLPGPHSbjct: 185 NNAVSNIICSITFGERFEYQDSWFQQLLKLLDEVTYLEASKTCQLYNVFPWIMKFLPGPH 244 Query: 121 QTLFSNWEKLKLFVSHMIEKHRKDWNPAETRDFIDAYLKEMSKHTGNSTSSFHEENLICS 180 QTLFSNW+KLKLFVSHMI+KHRKDWNPAETRDFIDAYLKEMSKHTGN TSSFHEENLICSSbjct: 245 QTLFSNWKKLKLFVSHMIDKHRKDWNPAETRDFIDAYLKEMSKHTGNPTSSFHEENLICS 304 Query: 181 TLDLFFAGTETTSTTLRWALLYMALYPEIQEKVQAEIDRVIGQGQQPSTAARESMPYTNA 240 TLDLFFAGTETTSTTLRWALLYMALYPEIQEKVQ EIDRVIGQGQQPSTAARESMPYTNASbjct: 305 TLDLFFAGTETTSTTLRWALLYMALYPEIQEKVQVEIDRVIGQGQQPSTAARESMPYTNA 364 Query: 241 VIHEVQRMGNIVPLNVPREVTVDTTLAGYHLP---------------------------- 272 VIHEVQRMGNI+P NVPREVTVDTTLAGYHLP Sbjct: 365 VIHEVQRMGNIIPQNVPREVTVDTTLAGYHLPKGTMILTNLTALHRDPTEWATPDTFNPD 424 Query: 273 --------------------KRACLGEQLARTELFIFFTSLVQKFTFRPPNNEKLSLKFR 312 KRACLGEQLARTELFIFFTSL+QKFTFRPPNNEKLSLKFRSbjct: 425 HFLENGQFKKREAFMPFSIGKRACLGEQLARTELFIFFTSLMQKFTFRPPNNEKLSLKFR 484 Query: 313 MGITISPVSHHLC 325 MGITISPVSH LCSbjct: 485 MGITISPVSHRLC 497
>CYP2R1
(G.Zhu) partial
IFSLDLGGISTVVLNGYDVVKECLVHQSGIFADRPCLPLFMKMTKMGGLLNSRYGQGWVE
HRRLAVNSFRYFGYGQKSFESKILEETKFFTDAIETYKGRPFDFKQLITSAVSNITNLII
FGERFTYEDTDFQHMIELFSENVELAASASVFLYNAFPWIGILPFGKHQQLFRNASVVYD
FLSRLIEKASVNRKPQLPQHFVDAYFDEMDQGKNDPSSTFSKENLIFSVGELIIAGTETT
TNVLRWAILFMALYPNIQGQVQKEIDLIMGPNGKPSWDDKFKMPYTEAVLHEVLRFCNIV
PLGIFHATSEDAVVRGYSIPKGTTVITNLYSVHFDEKYWRDPEVFHPERFLDSSGYFAKK
EALVPFSLGRRHCLGEQLARMEMFLFFTALLQRFHLHFPHELVPDLKPRLGMTLQPQPYL
ICAERR
Alignment
of the two sequences, 97% identical
Query: 1 IFSLDLGGISTVVLNGYDVVKECLVHQSGIFADRPCLPLFMKMTKMGGLLNSRYGQGWVE 60 IFSLDLGGISTVVLNGYDVVKECLVHQS IFADRPCLPLFMKMTKMGGLLNSRYG+GWV+Sbjct: 76 IFSLDLGGISTVVLNGYDVVKECLVHQSEIFADRPCLPLFMKMTKMGGLLNSRYGRGWVD 135 Query: 61 HRRLAVNSFRYFGYGQKSFESKILEETKFFTDAIETYKGRPFDFKQLITSAVSNITNLII 120 HRRLAVNSFRYFGYGQKSFESKILEETKFF DAIETYKGRPFDFKQLIT+AVSNITNLIISbjct: 136 HRRLAVNSFRYFGYGQKSFESKILEETKFFNDAIETYKGRPFDFKQLITNAVSNITNLII 195 Query: 121 FGERFTYEDTDFQHMIELFSENVELAASASVFLYNAFPWIGILPFGKHQQLFRNASVVYD 180 FGERFTYEDTDFQHMIELFSENVELAASASVFLYNAFPWIGILPFGKHQQLFRNA+VVYDSbjct: 196 FGERFTYEDTDFQHMIELFSENVELAASASVFLYNAFPWIGILPFGKHQQLFRNAAVVYD 255 Query: 181 FLSRLIEKASVNRKPQLPQHFVDAYFDEMDQGKNDPSSTFSKENLIFSVGELIIAGTETT 240 FLSRLIEKASVNRKPQLPQHFVDAY DEMDQGKNDPSSTFSKENLIFSVGELIIAGTETTSbjct: 256 FLSRLIEKASVNRKPQLPQHFVDAYLDEMDQGKNDPSSTFSKENLIFSVGELIIAGTETT 315 Query: 241 TNVLRWAILFMALYPNIQGQVQKEIDLIMGPNGKPSWDDKFKMPYTEAVLHEVLRFCNIV 300 TNVLRWAILFMALYPNIQGQVQKEIDLIMGPNGKPSWDDK KMPYTEAVLHEVLRFCNIVSbjct: 316 TNVLRWAILFMALYPNIQGQVQKEIDLIMGPNGKPSWDDKCKMPYTEAVLHEVLRFCNIV 375 Query: 301 PLGIFHATSEDAVVRGYSIPKGTTVITNLYSVHFDEKYWRDPEVFHPERFLDSSGYFAKK 360 PLGIFHATSEDAVVRGYSIPKGTTVITNLYSVHFDEKYWRDPEVFHPERFLDSSGYFAKKSbjct: 376 PLGIFHATSEDAVVRGYSIPKGTTVITNLYSVHFDEKYWRDPEVFHPERFLDSSGYFAKK 435 Query: 361 EALVPFSLGRRHCLGEQLARMEMFLFFTALLQRFHLHFPHELVPDLKPRLGMTLQPQPYL 420 EALVPFSLGRRHCLGE LARMEMFLFFTALLQRFHLHFPHELVPDLKPRLGMTLQPQPYLSbjct: 436 EALVPFSLGRRHCLGEHLARMEMFLFFTALLQRFHLHFPHELVPDLKPRLGMTLQPQPYL 495 Query: 421 ICAERR 426 ICAERRSbjct: 496 ICAERR 501
>CYP2S1
exons 2,3 from CO649282.1, gene fragmented on multiple scaffolds
MEATGTWALLLALALLLLLTLALSGTRARGQLPPGPTPLPLLGNLLQLRPGALYSGLMR
(0)
LSKKYGPVFTIYLGPWRPVVVLVGQEAVREALGGQAEEFSGRGTVAMLEGTFDGH
(1)
GVFFSNGERWRQLRKFTMLALRDLGMGKREGEELIQAEARCLVETFQGTE
(1)
GRPFDPSLLLAQATSNVVCSLLFGLRFSYEDKEFQAMVRAAGGTLLGVSSRGGQ
(0)
TYEMCSWFLWPLPGPHKQLLHHVSTLAAFTVRQVQQHQGNLDASGPARDLVDAFLLKMAQ
(0)
EEQNPDTEFTNKNMLMTVIYLLFAGTMTVSATVGYTLLLLMKYPHVQ
(1)
KRVREELTQELGSGQAPSLGDRTRLPYTDAVLHEAQRLLALVPMGIPRTLMRTTRFRGYTLPQ
(0)
GTEVFPLLGSILHDPSIFKHPEEFNPDHFLDADGRFRKHEAFLPFSL
(1)
GKRVCLGEGLAKAELFLFFTTILQAFSLESPCPLDSLSLKPTISGLFNIPPAFQLQVRPTDLHSTTQTT*
>CYP2S1
AC011510 one exon per line 78% to mouse 2s1 49% to 2B6 47% to 2A13
Length = 504
Score = 970 bits (2508), Expect = 0.0
Identities = 485/503 (96%), Positives =
494/503 (98%)
Query:
1
MEATGTWALLLALALLLLLTLALSGTRARGQLPPGPTPLPLLGNLLQLRPGALYSGLMRL 60
MEATGTWALLLALALLLLLTLALSGTRARG LPPGPTPLPLLGNLLQLRPGALYSGLMRL
Sbjct:
1
MEATGTWALLLALALLLLLTLALSGTRARGHLPPGPTPLPLLGNLLQLRPGALYSGLMRL 60
Query:
61
SKKYGPVFTIYLGPWRPVVVLVGQEAVREALGGQAEEFSGRGTVAMLEGTFDGHGVFFSN 120
SKKYGPVFTIYLGPWRPVVVLVGQEAVREALGGQAEEFSGRGTVAMLEGTFDGHGVFFSN
Sbjct:
61
SKKYGPVFTIYLGPWRPVVVLVGQEAVREALGGQAEEFSGRGTVAMLEGTFDGHGVFFSN 120
Query:
121 GERWRQLRKFTMLALRDLGMGKREGEELIQAEARCLVETFQGTEGRPFDPSLLLAQATSN 180
GERWRQLRKFTMLALRDLGMGKREGEELIQAEARCLVETFQGTEGRPFDPSLLLAQATSN
Sbjct:
121 GERWRQLRKFTMLALRDLGMGKREGEELIQAEARCLVETFQGTEGRPFDPSLLLAQATSN 180
Query:
181 VVCSLLFGLRFSYEDKEFQAMVRAAGGTLLGVSSRGGQTYEMCSWFLWPLPGPHKQLLHH 240
VVCSLLFGLRFSYEDKEFQA+VRAAGGTLLGVSS+GGQTYEM SWFL PLPGPHKQLLHH
Sbjct:
181 VVCSLLFGLRFSYEDKEFQAVVRAAGGTLLGVSSQGGQTYEMFSWFLRPLPGPHKQLLHH 240
Query:
241 VSTLAAFTVRQVQQHQGNLDASGPARDLVDAFLLKMAQEEQNPDTEFTNKNMLMTVIYLL 300
VSTLAAFTVRQVQQHQGNLDASGPARDLVDAFLLKMAQEEQNP TEFTNKNMLMTVIYLL
Sbjct:
241 VSTLAAFTVRQVQQHQGNLDASGPARDLVDAFLLKMAQEEQNPGTEFTNKNMLMTVIYLL 300
Query:
301 FAGTMTVSATVGYTLLLLMKYPHVQKRVREELTQELGSGQAPSLGDRTRLPYTDAVLHEA 360
FAGTMTVS TVGYTLLLLMKYPHVQK VREEL +ELG+GQAPSLGDRTRLPYTDAVLHEA
Sbjct:
301 FAGTMTVSTTVGYTLLLLMKYPHVQKWVREELNRELGAGQAPSLGDRTRLPYTDAVLHEA 360
Query:
361 QRLLALVPMGIPRTLMRTTRFRGYTLPQGTEVFPLLGSILHDPSIFKHPEEFNPDHFLDA 420
QRLLALVPMGIPRTLMRTTRFRGYTLPQGTEVFPLLGSILH+P+IFKHPEEFNPD FLDA
Sbjct:
361 QRLLALVPMGIPRTLMRTTRFRGYTLPQGTEVFPLLGSILHEPNIFKHPEEFNPDRFLDA 420
Query:
421 DGRFRKHEAFLPFSLGKRVCLGEGLAKAELFLFFTTILQAFSLESPCPLDSLSLKPTISG 480
DGRFRKHEAFLPFSLGKRVCLGEGLAKAE+FLFFTTILQAFSLESPCP D+LSLKPT+SG
Sbjct:
421 DGRFRKHEAFLPFSLGKRVCLGEGLAKAEVFLFFTTILQAFSLESPCPPDTLSLKPTVSG 480
Query:
481 LFNIPPAFQLQVRPTDLHSTTQT 503
LFNIPPAFQLQVRPTDLHSTTQT
Sbjct:
481 LFNIPPAFQLQVRPTDLHSTTQT 503
>CYP2T2P
ortholog, SCAFFOLD100362 (+) 38209-41795
frameshift
in exon 4 after VIC, numerous other defects
MIAGIAALLLWLLVLALARWG*GGCRARMRGSLPPRPRPLPLLGNLQLQSGGTDHALHS
(?)
LSGRWGPVFTAQLGPRPAVVLCGYAALRDALVLQADAFSGRGSMAVFERFTRGH
(1)
GIFLSNGPRWWTLRNFAVGALKELGLGTRTIQAHVLEEAACLLDEMQATI
(1)
GAPFDPMRLLDNAVSNVICX
LVFGNRYGYGDPEFLRLLNLFSDNFRIMSSRWGE
(0)
(?)
SLMDWLPGRHRRIFRNF
SELWVFISEQIQQHWQMRQPAEPRDFINCLTRWVRRGSQ
QDPESHFQEETSVMMTHLFFGGTETSTTLCYGLLVLLKYPEVA
(1)
AKVQELDPVVGWRCAPSPDDHQRLPYTNAVLLQIQRFISVVPLGLPRX
TLNTHLHSHCLPK
(1)
GTFVIPLLVTAHXDRTQFKDPDCFNATNFLDKGKSQGNDPFMPFAS
(1)
(?)
GKQMCLGAGLAHLEIFLFLTATLPRFRLLPVVNPGTINLT
QFTGLGSVPPAFQLQLVAC
>CYP2T2P
AC008537
Length = 457
Score
= 2103 (740.3 bits), Expect = 7.1e-222, P = 7.1e-222
Identities = 413/482 (85%), Positives =
427/482 (88%)
Query: 1
MIAGIAALLLWLLVLALARWG*GGCRARMRGSLPPRPRPLPLLGNLQLQSGGTDHALHSL
60
M AGIAALLLWLLVLA A WG*GGCRA+MRGSLPPRPRPLPLLGNLQLQSGG D ALHSL
Sbjct: 1
MXAGIAALLLWLLVLAPAWWG*GGCRAQMRGSLPPRPRPLPLLGNLQLQSGGLDRALHSL
60
Query: 61
SGRWGPVFTAQLGPRPAVVLCGYAALRDALVLQADAFSGRGSMAVFERFTRGHGIFLSNG 120
SGRWG VFT +LGPRPAV LCGYAALRDALVLQADA SGRGSMAVFERFTRG+GI SN
Sbjct: 61
SGRWGRVFTVRLGPRPAVGLCGYAALRDALVLQADAVSGRGSMAVFERFTRGNGILFSNR 120
Query: 121
PRWWTLRNFAVGALKELGLGTRTIQAHVLEEAACLLDEMQATIGAPFDPMRLLDNAVSNV 180
P WWTLRNFA+GALK+ GLGTRT++A VLEEAACLLDE QATIGAPFDP+RLLDNAVSNV
Sbjct: 121
PCWWTLRNFALGALKKFGLGTRTVEARVLEEAACLLDEFQATIGAPFDPVRLLDNAVSNV 180
Query: 181 ICXLVFGNRYGYGDPEFLRLLNLFSDNFRIMSSRWGESLMDWLPGRHRRIFRNFSELWVF 240
IC LVFGNRY
YGDPEFLRLLNLFSDNF I+SSRWGESLMDWLPG H RIFRNFSEL V
Sbjct: 181 ICSLVFGNRYRYGDPEFLRLLNLFSDNFCIISSRWGESLMDWLPGPHHRIFRNFSELRV- 239
Query: 241
ISEQIQQHWQMRQPAEPRDFINCLTRWVRRGSQQDPESHFQEETSVMMTHLFFGGTET-S 299
ISEQIQ+HWQMRQPAEPRDFI+CLTRWVR G QQDPESHFQE TSVM TH FFG TET S
Sbjct: 240 ISEQIQRHWQMRQPAEPRDFIDCLTRWVRHG-QQDPESHFQE*TSVMTTHFFFGVTETTS 298
Query: 300
TTLCYGLLVLLKYPEVAAKVQELDPVVGWRCAPSPDDHQRLPYTNAVLLQIQRFISVVPL 359
TTLCYGLL+LLKY EVAAKVQELDPVVGWR APS D LPY NAVLL+IQ FISVVPL
Sbjct: 299
TTLCYGLLILLKYLEVAAKVQELDPVVGWRPAPSLDYRVCLPYANAVLLEIQCFISVVPL 358
Query: 360 GLPRX-TLNTHLHSHCLPKGTFVIPLLVTAHXDRTQFKDPDCFNATNFLDKGKSQGNDPF
418
GLPR TL+THLHSHCLPKGTFVIPLLVTAH D TQFKDPDCFN TNFLDKGK
QGND F
Sbjct: 359 GLPRTLTLDTHLHSHCLPKGTFVIPLLVTAHRDPTQFKDPDCFNPTNFLDKGKFQGNDAF
418
Query: 419
MPFASGKQMCLGAGLAHLEIFLFLTATLPRFRLLPVVNPGTINLTQFTGLGSVPPAFQLQ 478
MPFA KQMCLG GLAH IFLFLTATL RF LLPVV PGTINLTQ TGLGSVPP
FQLQ
Sbjct: 419
MPFAPAKQMCLGTGLAHSGIFLFLTATLQRFCLLPVVRPGTINLTQCTGLGSVPPDFQLQ 478
Query: 479 LVAC 482
VAC
Sbjct: 479 PVAC 482
>CYP2U1 (li Chen) note gc boundary between exons 7,8MSSPGPPQPPAEDPPWPARLLRAPLGLLRMDPSGDALLLCGLVAVLGWSWLRRRRARGIP PGPTPWPLVGNFGHVLLPPFLRRRSWLSSRTRAAGIDPSVVGPQVLLAHLARVYGSIFSF FIGHYLVVVLSDFHSVREALVQQAEVFSDRPRVPLISIVTKEK (1)GVVFAHYGPIWRQQRKFSHSTLRHFGLGKLSLEPKIIEEFKYVKAEMQKHGEDPFCPFSIISNAVSNIICSLCFGQRFDYTNSEFKKMLGFMSRGLEICLNSQVLMVNICPWLYYLPFGPFKELRQIEKDITSFLKKIIKDHQESLDRENPQDFIDMYLLHMEEERKNNSNSSFDEEYLFYIIGDLFIAGTDTTTNSLLWCLLYMSLNPDVQ (1)EKVHEEIERVIGANRAPSLTDKAQMPYTEATIMEVQRLTVVVPLAIPHMTSGNT (1)VLQGYTIPKGTLILPNLWSVHRDPAIWEKPEDFYPNRFLDDQGQLIKKETFIPFGI (1)
GKRVCMGEQLAKMELFLMFVSLMQSFAFALPEKSKKPLLTGRFGLTLAPHPFNITISRR*
Score
= 1098 bits (2841), Expect = 0.0
Identities = 535/544 (98%), Positives =
540/544 (99%)
Query:
1
MSSPGPPQPPAEDPPWPARLLRAPLGLLRMDPSGDALLLCGLVAVLGWSWLRRRRARGIP 60
MSSPGP
QPPAEDPPWPARLLRAPLGLLR+DPSG ALLLCGLVA+LGWSWLRRRRARGIP
Sbjct:
1
MSSPGPSQPPAEDPPWPARLLRAPLGLLRLDPSGGALLLCGLVALLGWSWLRRRRARGIP 60
Query:
61 PGPTPWPLVGNFGHVLLPPFLRRRSWLSSRTRAAGIDPSVVGPQVLLAHLARVYGSIFSF
120
PGPTPWPLVGNFGHVLLPPFLRRRSWLSSRTRAAGIDPSV+GPQVLLAHLARVYGSIFSF
Sbjct:
61
PGPTPWPLVGNFGHVLLPPFLRRRSWLSSRTRAAGIDPSVIGPQVLLAHLARVYGSIFSF 120
Query:
121 FIGHYLVVVLSDFHSVREALVQQAEVFSDRPRVPLISIVTKEKGVVFAHYGPIWRQQRKF 180
FIGHYLVVVLSDFHSVREALVQQAEVFSDRPRVPLISIVTKEKGVVFAHYGP+WRQQRKF
Sbjct:
121 FIGHYLVVVLSDFHSVREALVQQAEVFSDRPRVPLISIVTKEKGVVFAHYGPVWRQQRKF 180
Query:
181 SHSTLRHFGLGKLSLEPKIIEEFKYVKAEMQKHGEDPFCPFSIISNAVSNIICSLCFGQR 240
SHSTLRHFGLGKLSLEPKIIEEFKYVKAEMQKHGEDPFCPFSIISNAVSNIICSLCFGQR
Sbjct:
181 SHSTLRHFGLGKLSLEPKIIEEFKYVKAEMQKHGEDPFCPFSIISNAVSNIICSLCFGQR 240
Query:
241 FDYTNSEFKKMLGFMSRGLEICLNSQVLMVNICPWLYYLPFGPFKELRQIEKDITSFLKK 300
FDYTNSEFKKMLGFMSRGLEICLNSQVL+VNICPWLYYLPFGPFKELRQIEKDITSFLKK
Sbjct:
241 FDYTNSEFKKMLGFMSRGLEICLNSQVLLVNICPWLYYLPFGPFKELRQIEKDITSFLKK 300
Query:
301 IIKDHQESLDRENPQDFIDMYLLHMEEERKNNSNSSFDEEYLFYIIGDLFIAGTDTTTNS 360
IIKDHQESLDRENPQDFIDMYLLHMEEERKNNSNSSFDEEYLFYIIGDLFIAGTDTTTNS
Sbjct:
301 IIKDHQESLDRENPQDFIDMYLLHMEEERKNNSNSSFDEEYLFYIIGDLFIAGTDTTTNS 360
Query:
361 LLWCLLYMSLNPDVQEKVHEEIERVIGANRAPSLTDKAQMPYTEATIMEVQRLTVVVPLA 420
LLWCLLYMSLNPDVQEKVHEEIERVIGANRAPSLTDKAQMPYTEATIMEVQRLTVVVPLA
Sbjct:
361 LLWCLLYMSLNPDVQEKVHEEIERVIGANRAPSLTDKAQMPYTEATIMEVQRLTVVVPLA 420
Query:
421 IPHMTSGNTVLQGYTIPKGTLILPNLWSVHRDPAIWEKPEDFYPNRFLDDQGQLIKKETF 480
IPHMTS
NTVLQGYTIPKGTLILPNLWSVHRDPAIWEKPEDFYPNRFLDDQGQLIKKETF
Sbjct:
421 IPHMTSENTVLQGYTIPKGTLILPNLWSVHRDPAIWEKPEDFYPNRFLDDQGQLIKKETF 480
Query:
481 IPFGIGKRVCMGEQLAKMELFLMFVSLMQSFAFALPEKSKKPLLTGRFGLTLAPHPFNIT 540
IPFGIGKRVCMGEQLAKMELFLMFVSLMQSFAFALPE SKKPLLTGRFGLTLAPHPFNIT
Sbjct:
481 IPFGIGKRVCMGEQLAKMELFLMFVSLMQSFAFALPEDSKKPLLTGRFGLTLAPHPFNIT 540
Query:
541 ISRR 544
ISRR
Sbjct:
541 ISRR 544
>CYP2W1
Macaca mulatta rhesus monkey (Mahrous)
LSERYGPVFTVHLGCQKTVVLTGFEVVKEALAGPGQELADRPPIAIFQLIQRGG (1)
GIFFSSGARWRAARQFTVRALHSLGVGRKPVADKILQELKCLLGQLDGYR (1)
GQPFPLALLGWAPSNITFTLLFGRRFDYRDPVFVSLLGLIDEVMVLLGSPGLQ (0)
LFNVYPWLGALLQLHRPVLRKIEEVRAILRTLLEARRPHMRPGDPVCSYVDALIQQGQ (0)
GDDPEGLFAEDNAVACTLDMVMAGTETTSATLQWAALLMGRHPDVQ (1)
GRVQEELDRVLRRGRPPQPEDQQVLPYTSAVLHEVQRFITLLPHVPRCTATDMQLGGFLLPK (0)
GTPVIPLLTSVLLDETQWQTPDQFNPGHFLDADGHFVKQEAFLPFSA (1)
GRRVCVGERLARTELFLLFAGLLQKYYLLPPPGVSPASLDTTPAQAFTMRPRAQALCAVPRP
Missing
part =
ggggatgaccccgagggctxgtttgctgaggacaacgcagtggcx
G D
D P E
G L F
A E D
N A V
A
>CYP2W1
AC073957.7 chromosome 7 clone RP11-449P15 40% to 2F1
Length = 490
Score = 785 bits (2027), Expect = 0.0
Identities = 395/432 (91%), Positives =
403/432 (93%), Gaps = 14/432 (3%)
Query:
1
LSERYGPVFTVHLGCQKTVVLTGFEVVKEALAGPGQELADRPPIAIFQLIQRGGGIFFSS 60
LSERYGPVFTVHLG QKTVVLTGFE VKEALAGPGQELADRPPIAIFQLIQRGGGIFFSS
Sbjct:
59
LSERYGPVFTVHLGRQKTVVLTGFEAVKEALAGPGQELADRPPIAIFQLIQRGGGIFFSS 118
Query:
61
GARWRAARQFTVRALHSLGVGRKPVADKILQELKCLLGQLDGYRGQPFPLALLGWAPSNI 120
GARWRAARQFTVRALHSLGVGR+PVADKILQELKCL GQLDGYRG+PFPLALLGWAPSNI
Sbjct:
119 GARWRAARQFTVRALHSLGVGREPVADKILQELKCLSGQLDGYRGRPFPLALLGWAPSNI 178
Query:
121 TFTLLFGRRFDYRDPVFVSLLGLIDEVMVLLGSPGLQLFNVYPWLGALLQLHRPVLRKIE 180
TF
LLFGRRFDYRDPVFVSLLGLIDEVMVLLGSPGLQLFNVYPWLGALLQLHRPVLRKIE
Sbjct:
179 TFALLFGRRFDYRDPVFVSLLGLIDEVMVLLGSPGLQLFNVYPWLGALLQLHRPVLRKIE 238
Query:
181 EVRAILRTLLEARRPHMRPGDPVCSYVDALIQQGQ--------------VCTLDMVMAGT 226
EVRAILRTLLEARRPH+ PGDPVCSYVDALIQQGQ
CTLDMVMAGT
Sbjct:
239 EVRAILRTLLEARRPHVCPGDPVCSYVDALIQQGQGDDPEGLFAEANAVACTLDMVMAGT 298
Query:
227 ETTSATLQWAALLMGRHPDVQGRVQEELDRVLRRGRPPQPEDQQVLPYTSAVLHEVQRFI 286
ETTSATLQWAALLMGRHPDVQGRVQEELDRVL
GR P+ EDQQ LPYTSAVLHEVQRFI
Sbjct:
299 ETTSATLQWAALLMGRHPDVQGRVQEELDRVLGPGRTPRLEDQQALPYTSAVLHEVQRFI 358
Query:
287 TLLPHVPRCTATDMQLGGFLLPKGTPVIPLLTSVLLDETQWQTPDQFNPGHFLDADGHFV 346
TLLPHVPRCTA D QLGGFLLPKGTPVIPLLTSVLLDETQWQTP QFNPGHFLDA+GHFV
Sbjct:
359 TLLPHVPRCTAADTQLGGFLLPKGTPVIPLLTSVLLDETQWQTPGQFNPGHFLDANGHFV 418
Query:
347 KQEAFLPFSAGRRVCVGERLARTELFLLFAGLLQKYYLLPPPGVSPASLDTTPAQAFTMR 406
K+EAFLPFSAGRRVCVGERLARTELFLLFAGLLQ+Y LLPPPGVSPASLDTTPA+AFTMR
Sbjct:
419 KREAFLPFSAGRRVCVGERLARTELFLLFAGLLQRYRLLPPPGVSPASLDTTPARAFTMR 478
Query:
407 PRAQALCAVPRP 418
PRAQALCAVPRP
Sbjct:
479 PRAQALCAVPRP 490
>CYP2AB1P
SCAFFOLD46808:34-204, SCAFFOLD101629:758-8003 (no ESTs found)
missing
exons 2,3,7 exon 2 = Trace archive 650631246, 555635842
frameshift
after THG and gc boundary in exon 2 (gc also in human)
exon
3 = 497888434
MLSLLSGLALLAISFLLLKLGTFCWDRNRLPPGPFPFPILGNLWQLRFQLHPETLLQ
(0)
LAQTHG
VCLFTVWVSPIPIVVLSGFRAVKEALVSNSEQFSGRPLTSLFQDLFGEQ
(1)
GIVCSRRHMWWQQRRFCLVTLQGLGLGKLALEVQLQKQAAELVEAFRQEL
(1)
SRSFDPQVSIVRSTVRVIGALVFGHHFLSEDPIFQELTQAIDFGLALVRTVWHW
(0)
LHDVFPRALCHLPGSHREIFRYQGVVRSFTRREITGRKLKALEALKDFINCSLAQISK
(0)
AMDEPVSTFHEENLVQVVIDLFLGGTNTTATTQRWALVYMIQHGAVQ
(1?)
GTIILPLCRGSVLYDPECWETPPQFNPGHFLDKDGNFVANEAFLPFSA
(1)
GHCVCPGDQLARMELFLMFATLLRTFRFQLPEGSPGLKLEYIFGGTLQPQPQEICAVPR
>CYP2AB1
comparing only the known sequence, leaving out missing exon 7.
Length
= 423, human has lost the KYG motif (THG in rhesus), lost heme Cys
Score = 1794 (631.5 bits), Expect =
3.6e-189, P = 3.6e-189
Identities = 353/428 (82%), Positives =
376/428 (87%)
Query: 1
MLSLLSGLALLAISFLLLKLGTFCWDRNRLPPGPFPFPILGNLWQLRFQLHPETLLQLAQ 60
MLSLLSGLALLAISFLLLKLGTFCWDR+ LPPGP PFPILGNLWQL FQLHPETLLQLAQ
Sbjct: 1
MLSLLSGLALLAISFLLLKLGTFCWDRSCLPPGPLPFPILGNLWQLCFQLHPETLLQLAQ 60
Query: 61 THGVCLFTVWVSPIPIVVLSGFRAVKEALVSNSEQFSGRPLTSLFQDLFGEQGIVCSRRH
120
+ +FTVWV PIP+ VLSGF+
VKEALVSNSEQFSGR LT LFQDLFGE+GI+CS
H
Sbjct: 61 S----VFTVWVGPIPVAVLSGFQVVKEALVSNSEQFSGRSLTPLFQDLFGERGIICSSGH
116
Query: 121 MWWQQRRFCLVTLQGLGLGKLALEVQLQKQAAELVEAFRQELSRSFDPQVSIVRSTVRVI
180
W Q+RRFCLV + GLGLGKLALEVQLQK+AAEL
EAFRQE R FDPQVSIVRSTVRVI
Sbjct:
117 TWRQKRRFCLVMI*GLGLGKLALEVQLQKEAAELAEAFRQEQGRPFDPQVSIVRSTVRVI
176
Query: 181
GALVFGHHFLSEDPIFQELTQAIDFGLALVRTVWHWLHDVFPRALCHLPGSHREIFRYQG 240
GALVFGHHFL EDPIFQELTQAIDFGLA V TVW
L+DVFP ALCHLPG H+EIFRYQ
Sbjct: 177 GALVFGHHFLLEDPIFQELTQAIDFGLAFVSTVWRRLYDVFPWALCHLPGPHQEIFRYQE
236
Query: 241
VVRSFTRREITGRKLKALEALKDFINCSLAQISKAMDEPVSTFHEENLVQVVIDLFLGGT 300
VV S +EIT KL+A EA +DFI+C LAQISKAMD+PVSTF++ENLV VVIDLFLGGT
Sbjct: 237
VVLSLIHQEITRHKLRAPEAPRDFISCYLAQISKAMDDPVSTFNQENLV*VVIDLFLGGT 296
Query: 301 NTTATTQRWALVYMIQHGAVQGTIILPLCRGSVLYDPECWETPPQFNPGHFLDKDGNFVA
360
+TTATT WAL++MIQHGAVQGTIILP SVLYDPECWETP QFNPGHF
DKDGNFVA
Sbjct: 297 DTTATTLCWALIHMIQHGAVQGTIILPNL-ASVLYDPECWETPRQFNPGHFSDKDGNFVA
355
Query: 361 NEAFLPFSAGHCVCPGDQLARMELFLMFATLLRTFRFQLPEGSPGLKLEYIFGGTLQPQP
420
NEAFLPFSAGH V P
DQLA+MELFLMFATLLRTFRFQLPEGSPGLKLEYIFGGT QPQP
Sbjct: 356 NEAFLPFSAGHRVYPADQLAQMELFLMFATLLRTFRFQLPEGSPGLKLEYIFGGTWQPQP
415
Query: 421 QEICAVPR 428
QEICAVPR
Sbjct: 416 QEICAVPR 423
>CYP2AC1P
SCAFFOLD 55146 (+) 91919
103589
note:
upstream neighbor is the rhag gene that flanks the Xenopus 31 gene cluster. This large gene cluster may have
originated from the CYP2AC1 gene.
This gene is on human chromosome 6. In humans rhag is about 22kb upstream of CYP2AC1.
Exon
1 = SCAFFOLD 70822:43519-43602
Exon
3 = SCAFFOLD 103481:817-966
1
MSEFDASAILPIRVLILIFILSIKKFMTEASKQLSPPGPRPLLVIGNLYFLNLKRPYQTMLE (0)
3
GIAFFHGETWKTMRWFSLTTLQNFGMDEWIIEDTIIEECQNLIQNSEFHR
4
GKSFEMKTIMNASVVNIIVLVLPGKWFDYQDSQFLRLLALIGENVKLIGGLRIAVN (1)
5
SFQYVSFWGVLLKSHKTVFRNRDELFSFIRMIFLDHCHKLDKNDPRSFTDAFLVTQQE (0)
6
ENDTFADHFSDENLMALVNNLFTTGTETTASTLPWGILLVICLRSRV (1)
7
KKVHNEVTKVARSAQP*LAHQTQMPHTDAVSHEVQRFANILPTSLPHATPTNIFKNYYIPK
(0)
8
ATEVIILLASVRRDQAQWEKPDTFNPEHFLTSKGKFIKREAFLPFTV (1)
9
GRRMCAGESSAR MELFLFFTSLLQ KFTFQPPLGVSHLDLDLSLDIGFTT*
>CYP3A64
AY582531.1
MDLIPDLAVETWLLLAVTLVLLYLYGTHSHGLFKKLGIPGPTPL
PLLGNILSYRKGFWTFDMECYKKYGKVWGFYDGRQPVLAITDPNMIKTVLVKECYSVF
TNRRPFGPVGFMKNAISIAEDEEWKRIRSLLSPTFTSGKLKEMVPIIAKYGDVLVRNL
RREAETGKPVTLKDVFGAYSMDVITSTSFGVNIDSLNNPQDPFVENTKKLLRFDFLDP
FFLSITIFPFIIPILEVLNISIFPREVTSFLRKSVKRIKESRLKDTQKHRVDFLQLMI
DSQNSKETESHKALSDQELVAQSIIFIFAGYETTSSVLSFIIYELATHPDVQQKLQEE
IDTVLPNKAPPTYDTVLQMEYLDMVVNETLRIFPIAMRLERVCKKDVEINGIFIPKGV
VVMIPSYALHHDPKYWPEPEKFLPERFSKKNNDNIDPYIYTPFGSGPRNCIGMRFALM
NMKLAIIRVLQNFSFKPCKETQIPLKLRLGGLLQTEKPIVLKIESRDGTVSGA
>CYP3A43 ortholog? partial assembly (Aggarwal) 72% to 3A64MDLIPNFAMETWVLVATSLVLL (2)YIYGTHSHKLFKKLGIPGPTPLPFLGTILFYLR (0)GLWKFDRECNEKYGEMWG (2)LYEGQQPMLVIMDPDMIKTVLVKECYSVFTNRM (0)PLGPMGFMKSALSFAEDEEWKRIRTLLSPAFTSVKFKE (0)MVPIISQCGDMLVRSLRREAENSKPTNLKE >CYP3A43 AC011904 one exon per line Length = 504 Score = 348 bits (893), Expect = 5e-99 Identities = 168/174 (96%), Positives = 171/174 (98%) Query: 1 MDLIPNFAMETWVLVATSLVLLYIYGTHSHKLFKKLGIPGPTPLPFLGTILFYLRGLWKF 60 MDLIPNFAMETWVLVATSLVLLYIYGTHSHKLFKKLGIPGPTPLPFLGTILFYLRGLW FSbjct: 1 MDLIPNFAMETWVLVATSLVLLYIYGTHSHKLFKKLGIPGPTPLPFLGTILFYLRGLWNF 60 Query: 61 DRECNEKYGEMWGLYEGQQPMLVIMDPDMIKTVLVKECYSVFTNRMPLGPMGFMKSALSF 120 DRECNEKYGEMWGLYEGQQPMLVIMDPDMIKTVLVKECYSVFTN+MPLGPMGF+KSALSFSbjct: 61 DRECNEKYGEMWGLYEGQQPMLVIMDPDMIKTVLVKECYSVFTNQMPLGPMGFLKSALSF 120 Query: 121 AEDEEWKRIRTLLSPAFTSVKFKEMVPIISQCGDMLVRSLRREAENSKPTNLKE 174 AEDEEWKRIRTLLSPAFTSVKFKEMVPIISQCGDMLVRSLR+EAENSK NLKESbjct: 121 AEDEEWKRIRTLLSPAFTSVKFKEMVPIISQCGDMLVRSLRQEAENSKSINLKE 174 >CYP4A11 match (ramy.Naguib) partial seq. Missing exon 1 (added later)MSVSVLSPSRLLGGVSGILQVASLLILLLLLIKAAQLYLHRQWLLKAFQQFPCSPSHWLFGHKQE (0)FQQDQELQRIRKWVEMFPSACPLWLWGGKARVQLHDPDYMKVILGRS (1)DPKSQDPYRFLAPWI (1)GYGLLLLNGQTWFQHRRMLTPAFHYDILKAYVALMADSVRVML (0)DKWEKLLGQDSPLEVFQHVSLMTLDTIMKCAFSHQGSIQVDR (2)DSQSYIQAISDLNNLVFSRVRNVFHQNDTIYSLTSTGRWTHRACQLAHQHT (1)DQVIQLRKAQLQKEGELEKVKRKKHLDFLDILLLAK (0)MENGSILSDKDLRAEVDTFMFEGHDTTASGISWILYALATHPKHQERCREEIHGLLGDGASITW (2)NHLDQMPYTTMCIKEALRLYPPVPGISRELSTPVTFPDGRSLPK (1)GITVMLSIYGLHHNPKVWPNPE (0)VFDPSRFAPGSAQHSHAFLPFSGGSR (2)NCIGKQFAMNELKVATALTLLRFELLPDPTRIPIPMARLVLKSKNGIHLRLRRLPNPCEDKDQL The alignment of the two proteins:
Score
= 1010 bits (2612), Expect = 0.0
Identities = 491/519 (94%), Positives =
499/519 (96%)
Query:
1
MSVSVLSPSRLLGGVSGILQVASLLILLLLLIKAAQLYLHRQWLLKAFQQFPCSPSHWLF 60
MSVSVLSPSRLLG VSGILQ ASLLILLLLLIKA QLYLHRQWLLKA QQFPC PSHWLF
Sbjct:
1
MSVSVLSPSRLLGDVSGILQAASLLILLLLLIKAVQLYLHRQWLLKALQQFPCPPSHWLF 60
Query:
61
GHKQEFQQDQELQRIRKWVEMFPSACPLWLWGGKARVQLHDPDYMKVILGRSDPKSQDPY 120
GH QE
QQDQELQRI+KWVE FPSACP WLWGGK RVQL+DPDYMKVILGRSDPKS Y
Sbjct:
61
GHIQELQQDQELQRIQKWVETFPSACPHWLWGGKVRVQLYDPDYMKVILGRSDPKSHGSY 120
Query:
121 RFLAPWIGYGLLLLNGQTWFQHRRMLTPAFHYDILKAYVALMADSVRVMLDKWEKLLGQD 180
RFLAPWIGYGLLLLNGQTWFQHRRMLTPAFHYDILK YV LMADSVRVMLDKWE+LLGQD
Sbjct:
121 RFLAPWIGYGLLLLNGQTWFQHRRMLTPAFHYDILKPYVGLMADSVRVMLDKWEELLGQD 180
Query:
181 SPLEVFQHVSLMTLDTIMKCAFSHQGSIQVDRDSQSYIQAISDLNNLVFSRVRNVFHQND 240
SPLEVFQHVSLMTLDTIMKCAFSHQGSIQVDR+SQSYIQAISDLNNLVFSRVRN FHQND
Sbjct:
181 SPLEVFQHVSLMTLDTIMKCAFSHQGSIQVDRNSQSYIQAISDLNNLVFSRVRNAFHQND 240
Query:
241 TIYSLTSTGRWTHRACQLAHQHTDQVIQLRKAQLQKEGELEKVKRKKHLDFLDILLLAKM 300
TIYSLTS GRWTHRACQLAHQHTDQVIQLRKAQLQKEGELEK+KRK+HLDFLDILLLAKM
Sbjct:
241 TIYSLTSAGRWTHRACQLAHQHTDQVIQLRKAQLQKEGELEKIKRKRHLDFLDILLLAKM 300
Query:
301 ENGSILSDKDLRAEVDTFMFEGHDTTASGISWILYALATHPKHQERCREEIHGLLGDGAS 360
ENGSILSDKDLRAEVDTFMFEGHDTTASGISWILYALATHPKHQERCREEIH LLGDGAS
Sbjct:
301 ENGSILSDKDLRAEVDTFMFEGHDTTASGISWILYALATHPKHQERCREEIHSLLGDGAS 360
Query:
361 ITWNHLDQMPYTTMCIKEALRLYPPVPGISRELSTPVTFPDGRSLPKGITVMLSIYGLHH 420
ITWNHLDQMPYTTMCIKEALRLYPPVPGI RELSTPVTFPDGRSLPKGI V+LSIYGLHH
Sbjct:
361 ITWNHLDQMPYTTMCIKEALRLYPPVPGIGRELSTPVTFPDGRSLPKGIMVLLSIYGLHH 420
Query:
421 NPKVWPNPEVFDPSRFAPGSAQHSHAFLPFSGGSRNCIGKQFAMNELKVATALTLLRFEL 480
NPKVWPNPEVFDPSRFAPGSAQHSHAFLPFSGGSRNCIGKQFAMNELKVATALTLLRFEL
Sbjct:
421 NPKVWPNPEVFDPSRFAPGSAQHSHAFLPFSGGSRNCIGKQFAMNELKVATALTLLRFEL 480
Query:
481 LPDPTRIPIPMARLVLKSKNGIHLRLRRLPNPCEDKDQL 519
LPDPTRIPIP+ARLVLKSKNGIHLRLRRLPNPCEDKDQL
Sbjct:
481 LPDPTRIPIPIARLVLKSKNGIHLRLRRLPNPCEDKDQL 519
>CYP4A22
search (Puljic) same seq as the 4A11 hit above
MSVSVLSPSRLLGGVSGILQVASLLILLLLLIKAAQLYLHRQWLLKAFQQFPCSPSHWLF
GHKQEFQQDQELQRIRKWVEMFPSACPLWLWGGKARVQLHDPDYMKVILGRSGYGLLLLN
GQTWFQHRRMLTPAFHYDILKAYVALMADSVRVMLDKWEKLLGQDSPLEVFQHVSLMTLD
TIMKCAFSHQGSIQVDRDSQSYIQAISDLNNLVFSRVRNVFHQNDTIYSLTSTGRWTHRA
CQLAHQHTETK*SN*GRLNYRRRGSWRRSRGRSTWISWTSSSWPKWRMGASCQTRTSVPK
WTHSCSRATTPQPVGSPGFCMLWPHTPSIRRGAGKRSMASWVMEPPSPGENHLDQMPYTT
MCIKEALRLYPPVPGISRELSTPVTFPDGRSLPKGITVMLSIYGLHHNPKVWPNPEVFDP
SRFAPGSAQHSHAFLPFSGGSRNCIGKQFAMNELKVATALTLLRFELLPDPTRIPIPMAR
LVLKSKNGIHLRLRRLPNPCEDKDQL
>CYP4B1
MVPSFLSLRLSCLGLWASGLILVLGFLKLIRLLLRRQRLAKAMGNFPGPPTHWLFGHALE
(0)
IQQTGSLDKVVSWAHQFPYAHPLWFGQFIGFLNIYEPDYAKAVYSRG
(1)
DPKAPDVYDFFLQWI
(1)
GRGLLVLEGPKWFQHRKLLTPGFHYDVLKPYVALFAESTRVML (0)
DKWEEKAQEGKSFDIFCDVGHMALNTLMKCTFGRGDTGLGHR
(2)
DSSYYLAVSDLTLLMQQRLVSFHYHNDFIYWLTPHGRRFLRACQVAHDHT
(1)
DQVIRERKAALQDEKVRKKIQNRRHLDFLDILLGAR
(0)
DEDDSKLSDADLRAEVDTFMFEGHDTTTSGISWFLYCMALYPEHQHRCREEVREILGDQDSFQW
(2)
DDLGKMTYLTMCIKESFRLYPPVPQVYRQLSKPVTFVDGRSLPA
(1)
GSLISMHIYALHRNSAVWPDPE
(0)
VFDPLRFSTENASKRHPFAFMPFSAGPR
(2)
NCIGQQFAMSEMKVVTAMCLLHFEFSLDPSRLPIKMLQLVLRSKNGIHLHLKPLGPGSGK
>CYP4B1
NM_000779
Length = 511
Score = 1020 bits (2638), Expect = 0.0
Identities = 490/511 (95%), Positives =
495/511 (96%)
Query:
1
MVPSFLSLRLSCLGLWASGLILVLGFLKLIRLLLRRQRLAKAMGNFPGPPTHWLFGHALE 60
MVPSFLSL S
LGLWASGLILVLGFLKLI LLLRR+ LAKAM
FPGPPTHWLFGHALE
Sbjct:
1
MVPSFLSLSFSSLGLWASGLILVLGFLKLIHLLLRRRTLAKAMDKFPGPPTHWLFGHALE 60
Query:
61
IQQTGSLDKVVSWAHQFPYAHPLWFGQFIGFLNIYEPDYAKAVYSRGDPKAPDVYDFFLQ 120
IQ+TGSLDKVVSWAHQFPYAHPLWFGQFIGFLNIYEPDYAKAVYSRGDPKAPDVYDFFLQ
Sbjct:
61
IQETGSLDKVVSWAHQFPYAHPLWFGQFIGFLNIYEPDYAKAVYSRGDPKAPDVYDFFLQ 120
Query:
121 WIGRGLLVLEGPKWFQHRKLLTPGFHYDVLKPYVALFAESTRVMLDKWEEKAQEGKSFDI 180
WIGRGLLVLEGPKW QHRKLLTPGFHYDVLKPYVA+F ESTR+MLDKWEEKA+EGKSFDI
Sbjct:
121 WIGRGLLVLEGPKWLQHRKLLTPGFHYDVLKPYVAVFTESTRIMLDKWEEKAREGKSFDI 180
Query:
181 FCDVGHMALNTLMKCTFGRGDTGLGHRDSSYYLAVSDLTLLMQQRLVSFHYHNDFIYWLT 240
FCDVGHMALNTLMKCTFGRGDTGLGHRDSSYYLAVSDLTLLMQQRLVSF YHNDFIYWLT
Sbjct:
181 FCDVGHMALNTLMKCTFGRGDTGLGHRDSSYYLAVSDLTLLMQQRLVSFQYHNDFIYWLT 240
Query:
241 PHGRRFLRACQVAHDHTDQVIRERKAALQDEKVRKKIQNRRHLDFLDILLGARDEDDSKL 300
PHGRRFLRACQVAHDHTDQVIRERKAALQDEKVRKKIQNRRHLDFLDILLGARDEDD KL
Sbjct:
241 PHGRRFLRACQVAHDHTDQVIRERKAALQDEKVRKKIQNRRHLDFLDILLGARDEDDIKL 300
Query:
301 SDADLRAEVDTFMFEGHDTTTSGISWFLYCMALYPEHQHRCREEVREILGDQDSFQWDDL 360
SDADLRAEVDTFMFEGHDTTTSGISWFLYCMALYPEHQHRCREEVREILGDQD FQWDDL
Sbjct:
301 SDADLRAEVDTFMFEGHDTTTSGISWFLYCMALYPEHQHRCREEVREILGDQDFFQWDDL 360
Query:
361 GKMTYLTMCIKESFRLYPPVPQVYRQLSKPVTFVDGRSLPAGSLISMHIYALHRNSAVWP 420
GKMTYLTMCIKESFRLYPPVPQVYRQLSKPVTFVDGRSLPAGSLISMHIYALHRNSAVWP
Sbjct:
361 GKMTYLTMCIKESFRLYPPVPQVYRQLSKPVTFVDGRSLPAGSLISMHIYALHRNSAVWP 420
Query:
421 DPEVFDPLRFSTENASKRHPFAFMPFSAGPRNCIGQQFAMSEMKVVTAMCLLHFEFSLDP 480
DPEVFD
LRFSTENASKRHPFAFMPFSAGPRNCIGQQFAMSEMKVVTAMCLL FEFSLDP
Sbjct:
421 DPEVFDSLRFSTENASKRHPFAFMPFSAGPRNCIGQQFAMSEMKVVTAMCLLRFEFSLDP 480
Query:
481 SRLPIKMLQLVLRSKNGIHLHLKPLGPGSGK 511
SRLPIKM QLVLRSKNG HLHLKPLGPGSGK
Sbjct:
481 SRLPIKMPQLVLRSKNGFHLHLKPLGPGSGK 511
>CYP4V2
(S.Sarva)
MAGIWLGLVWQKLLLWGAASAVSLAGASLVLSLLQRVASYVRKWQQMRPIPTVARAYPLV GHALLMKRDGR (1)EFFQQIIEYTEEYRHMPLLKLWVGPVPMVALYNAENVE (0)VILTSSKQIDKSSMYKFLEPWLGLGLLTS (2)TGNKWRSRRKMLTPTFHFTILEDFLDIMNEQANILVKKLEKHVNQEAFNCFVYITLCALDIIC (1)ETAMGKNIGAQSNDDSEYVRAVYR (2)MSEMIFRRIKMPWLWLDLWYLMFKEGWEHKKSLKILHAFTNN (0)VIAERANEMNVDEDCRGDGRDSAPSKNKRRAFLDLLLSVTDDEGNRLSHEDIREEVDTFMFE (0)GHDTTAAAMNWSLYLLGSNPEVQKKVDHELDDVF (1)GRTDRPATVEDLKKLRYLECVIKETLRLFPSVPLFARSVSEDCEV (1)AGYRVLKGTEAVIIPYALHRDPRYFPNPEEFRPERFFPENAQGRHPYAYVPFSAGPRNCI (1)GQKFAVMEEKTILSCILRHFWIESNQKREELGLEGQLILRPTNGIWIKLKRRNADE 1551
Score
= 1040 bits (2688), Expect = 0.0
Identities = 508/524 (96%), Positives =
517/524 (98%), Gaps = 1/524 (0%)
Frame = +2
Query 1
MAGIWLGLVWQKLLLWGAASAVSLAGASLVLSLLQRVASYVRKWQQMRPIPTVARAYPLV 60
MAG+WLGLVWQKLLLWGAASA+SLAGASLVLSLLQRVASY RKWQQMRPIPTVARAYPLV
Sbjct 305
MAGLWLGLVWQKLLLWGAASALSLAGASLVLSLLQRVASYARKWQQMRPIPTVARAYPLV 484
Query 61
GHALLMKRDGREFFQQIIEYTEEYRHMPLLKLWVGPVPMVALYNAENVEVILTSSKQIDK 120
GHALLMK
DGREFFQQIIEYTEEYRHMPLLKLWVGPVPMVALYNAENVEVILTSSKQIDK
Sbjct 485
GHALLMKPDGREFFQQIIEYTEEYRHMPLLKLWVGPVPMVALYNAENVEVILTSSKQIDK 664
Query 121
SSMYKFLEPWLGLGLLTSTGNKWRSRRKMLTPTFHFTILEDFLDIMNEQANILVKKLEKH 180
SSMYKFLEPWLGLGLLTSTGNKWRSRRKMLTPTFHFTILEDFLDIMNEQANILVKKLEKH
Sbjct 665
SSMYKFLEPWLGLGLLTSTGNKWRSRRKMLTPTFHFTILEDFLDIMNEQANILVKKLEKH 844
Query 181
VNQEAFNCFVYITLCALDIICETAMGKNIGAQSNDDSEYVRAVYRMSEMIFRRIKMPWLW 240
+NQEAFNCF YITLCALDIICETAMGKNIGAQSNDDSEYVRAVYRMSEMIFRRIKMPWLW
Sbjct 845
INQEAFNCFFYITLCALDIICETAMGKNIGAQSNDDSEYVRAVYRMSEMIFRRIKMPWLW 1024
Query 241
LDLWYLMFKEGWEHKKSLKILHAFTNNVIAERANEMNVDEDCRGDGRDSAPSKNKRRAFL 300
LDLWYLMFKEGWEHKKSLKILH FTN+VIAERANEMN +EDCRGDGR SAPSKNKRRAFL
Sbjct 1025
LDLWYLMFKEGWEHKKSLKILHTFTNSVIAERANEMNANEDCRGDGRGSAPSKNKRRAFL 1204
Query 301
DLLLSVTDDEGNRLSHEDIREEVDTFMFEGHDTTAAAMNWSLYLLGSNPEVQKKVDHELD 360
DLLLSVTDDEGNRLSHEDIREEVDTFMFEGHDTTAAA+NWSLYLLGSNPEVQKKVDHELD
Sbjct
1205
DLLLSVTDDEGNRLSHEDIREEVDTFMFEGHDTTAAAINWSLYLLGSNPEVQKKVDHELD 1384
Query 361
DVFGRTDRPATVEDLKKLRYLECVIKETLRLFPSVPLFARSVSEDCEVAGYRVLKGTEAV 419
DVFG++DRPATVEDLKKLRYLECVIKETLRLFPSVPLFARSVSEDCEVAGYRVLKGTEAV
Sbjct 1385 DVFGKSDRPATVEDLKKLRYLECVIKETLRLFPSVPLFARSVSEDCEVAGYRVLKGTEAV 1564
Query 420
IIPYALHRDPRYFPNPEEFRPERFFPENAQGRHPYAYVPFSAGPRNCIGQKFAVMEEKTI 479
IIPYALHRDPRYFPNPEEF+PERFFPENAQGRHPYAYVPFSAGPRNCIGQKFAVMEEKTI
Sbjct 1565 IIPYALHRDPRYFPNPEEFQPERFFPENAQGRHPYAYVPFSAGPRNCIGQKFAVMEEKTI 1744
Query 480 LSCILRHFWIESNQKREELGLEGQLILRPTNGIWIKLKRRNADE 523
LSCILRHFWIESNQKREELGLEGQLILRP+NGIWIKLKRRNADE
Sbjct 1745 LSCILRHFWIESNQKREELGLEGQLILRPSNGIWIKLKRRNADE 1876
>CYP4X1
(Vasser) missing exon 1
FIQDDNMEKLEEIIEKYPRAFPFWIGPFQAFFYIYDPDYAKTFLSRT
(1)
DPKSQYLQKFLPPLI (1)
GKGLLALDGPKWFQHRRLLTPGFHFNILKAYIEVMAHSVKTML
(0)
DKWEKICSTQNTSVEVYEHINLMSLDIIMKCAFSKETNCQTN
(2)
STHDPYVKAIFELGKIIFHRLYSFLYHSDIIFKLSPQGYRFQKLSRVLNQYT
(1)
DAIIQERKKSLQAGEKQDNTQKRKYQDFLDIVLSAK
(0)
DENGSSFSDTDVHSEVSMFLLGGHDSLAASISWILYCLALNPEHQERCREEVRGILGDGCSITW
(2)
DQLGEMSYTTMCIKETCRLIPAVPSISRDLSKPLTFPDGCTLPA*
>CYP4X1 R56515, R53456, AA652746, AC026935
Length = 506
Score
= 664 bits (1714), Expect = 0.0
Identities = 323/343 (94%), Positives =
327/343 (95%)
Query:
1
FIQDDNMEKLEEIIEKYPRAFPFWIGPFQAFFYIYDPDYAKTFLSRTDPKSQYLQKFLPP 60
FIQDDNMEKLEEIIEKYPRAFPFWIGPFQAFF IYDPDYAKT LSRTDPKSQYLQKF PP
Sbjct:
60
FIQDDNMEKLEEIIEKYPRAFPFWIGPFQAFFCIYDPDYAKTLLSRTDPKSQYLQKFSPP 119
Query:
61
LIGKGLLALDGPKWFQHRRLLTPGFHFNILKAYIEVMAHSVKTMLDKWEKICSTQNTSVE 120
L+GKGL
ALDGPKWFQHRRLLTPGFHFNILKAYIEVMAHSVK MLDKWEKICSTQ+TSVE
Sbjct:
120 LLGKGLAALDGPKWFQHRRLLTPGFHFNILKAYIEVMAHSVKMMLDKWEKICSTQDTSVE 179
Query:
121 VYEHINLMSLDIIMKCAFSKETNCQTNSTHDPYVKAIFELGKIIFHRLYSFLYHSDIIFK 180
VYEHIN
MSLDIIMKCAFSKETNCQTNSTHDPY KAIFEL KIIFHRLYS LYHSDIIFK
Sbjct:
180 VYEHINSMSLDIIMKCAFSKETNCQTNSTHDPYAKAIFELSKIIFHRLYSLLYHSDIIFK 239
Query:
181 LSPQGYRFQKLSRVLNQYTDAIIQERKKSLQAGEKQDNTQKRKYQDFLDIVLSAKDENGS 240
LSPQGYRFQKLSRVLNQYTD IIQERKKSLQAG KQDNT KRKYQDFLDIVLSAKDE+GS
Sbjct:
240 LSPQGYRFQKLSRVLNQYTDTIIQERKKSLQAGVKQDNTPKRKYQDFLDIVLSAKDESGS 299
Query:
241 SFSDTDVHSEVSMFLLGGHDSLAASISWILYCLALNPEHQERCREEVRGILGDGCSITWD 300
SFSD
DVHSEVS FLL GHD+LAASISWILYCLALNPEHQERCREEVRGILGDG SITWD
Sbjct:
300 SFSDIDVHSEVSTFLLAGHDTLAASISWILYCLALNPEHQERCREEVRGILGDGSSITWD 359
Query:
301 QLGEMSYTTMCIKETCRLIPAVPSISRDLSKPLTFPDGCTLPA 343
QLGEMSYTTMCIKETCRLIPAVPSISRDLSKPLTFPDGCTLPA
Sbjct:
360 QLGEMSYTTMCIKETCRLIPAVPSISRDLSKPLTFPDGCTLPA 402
>CYP5A1
(Z. Zhang, N.Adeltawab) partial
ASGLEFKSVADSVLFLRDKRWEEVRGALMSAFSPEKLNEMTPLISQACDLLLAHLKRYAE
SGDAFDIQRCYCNYTTDVVASVAFGTPVDSQQAPEDPFVKHCKRFFEFCIPRPILVLLLS
FPSIMVPLARILPNKNRDELNGFFNKLIRNVIALRDQQAAEERRRDFLQMVLDARHSASP
VGVQDFDMVGDVFSSTRCKPNPSRQHQAGPMARPLTVDEIVGQAFIFLIAGYEIVTNTLS
FATYLLATNPDCQEKLLREVDLFKEKHMVPEFCSLEEGLPYLDMVIAETLRMYPPAF
>CYP5A1 NM_001061 this gene is 197000 bases long Length = 534 Score = 576 bits (1484), Expect = e-167 Identities = 285/297 (95%), Positives = 289/297 (97%) Query: 1 ASGLEFKSVADSVLFLRDKRWEEVRGALMSAFSPEKLNEMTPLISQACDLLLAHLKRYAE 60 ASGLEFKSVADSVLFLRDKRWEEVRGALMSAFSPEKLNEM PLISQACDLLLAHLKRYAESbjct: 113 ASGLEFKSVADSVLFLRDKRWEEVRGALMSAFSPEKLNEMVPLISQACDLLLAHLKRYAE 172 Query: 61 SGDAFDIQRCYCNYTTDVVASVAFGTPVDSQQAPEDPFVKHCKRFFEFCIPRPILVLLLS 120 SGDAFDIQRCYCNYTTDVVASV FGTPVDS QAPEDPFVKHCKRFFEFCIPRPILVLLLSSbjct: 173 SGDAFDIQRCYCNYTTDVVASVPFGTPVDSWQAPEDPFVKHCKRFFEFCIPRPILVLLLS 232 Query: 121 FPSIMVPLARILPNKNRDELNGFFNKLIRNVIALRDQQAAEERRRDFLQMVLDARHSASP 180 FPSIMVPLARILPNKNRDELNGFFNKLIRNVIALRDQQAAEERRRDFLQMVLDARHSASPSbjct: 233 FPSIMVPLARILPNKNRDELNGFFNKLIRNVIALRDQQAAEERRRDFLQMVLDARHSASP 292 Query: 181 VGVQDFDMVGDVFSSTRCKPNPSRQHQAGPMARPLTVDEIVGQAFIFLIAGYEIVTNTLS 240 +GVQDFD+V DVFSST CKPNPSRQHQ PMARPLTVDEIVGQAFIFLIAGYEI+TNTLSSbjct: 293 MGVQDFDIVRDVFSSTGCKPNPSRQHQPSPMARPLTVDEIVGQAFIFLIAGYEIITNTLS 352 Query: 241 FATYLLATNPDCQEKLLREVDLFKEKHMVPEFCSLEEGLPYLDMVIAETLRMYPPAF 297 FATYLLATNPDCQEKLLREVD+FKEKHM PEFCSLEEGLPYLDMVIAETLRMYPPAFSbjct: 353 FATYLLATNPDCQEKLLREVDVFKEKHMAPEFCSLEEGLPYLDMVIAETLRMYPPAF 409
>CYP7A1
(A.Bolen)
MMTTSLIWGIAIAACCCLWLILGIRRR (2)QTGEPPLENGLIPYLGCALQFGANPLEFLRANQRKHGHVFTCKLMGKYVHFITNPLSYHKVLCHGKYFDWKKFHFATSAK (0)AFGHRSIDPMDGNTTENINDTFIKTLQGHALNSLTESMMENLQRIMRPPVSSNSKTAAWVTEGMYSFCYRVMFEAGYLTIFGRDLTRRDTQKAHILNNLDNFKQFDKVFPALVAGLPIHMFRTAHNAREKLAESLRHENLQKRESISELISLRMFLNDTLSTFDDLEKAKTHLVVLWASQANTIPATFWSLFQMIR (2)NPEAMKAATEEVKRTLENAGQKVSLEGNPICLSQAELNDLPVL (1)DSIIKESLRLSSASLNIRTAKEDFTLHLEDGSYNIRKDSIIALYPQLMHLDPEIYPDPL (0)TFKYDRYLDENGKTKTTFYCNGLKLKYYYMPFGSGATICPGRLFA
IHEIKQFLILMLSYFELELIEGQAKCPPLDQSRAGLGILPPLNDIEFKYKFKHL*
>CYP7A1 NM_000780 Length = 504 Score = 998 bits (2581), Expect = 0.0 Identities = 485/504 (96%), Positives = 495/504 (98%) Query: 1 MMTISLIWGIAIAACCCLWLILGIRRRQTGEPPLENGLIPYLGCALQFGANPLEFLRANQ 60 MMT SLIWGIAIAACCCLWLILGIRRRQTGEPPLENGLIPYLGCALQFGANPLEFLRANQSbjct: 1 MMTTSLIWGIAIAACCCLWLILGIRRRQTGEPPLENGLIPYLGCALQFGANPLEFLRANQ 60 Query: 61 RKHGHVFTCKLMGKYVHFITNPLSYHKVLCHGKYFDWKKFHFATSAKAFGHRSIDPKDGN 120 RKHGHVFTCKLMGKYVHFITNPLSYHKVLCHGKYFDWKKFHFATSAKAFGHRSIDP DGNSbjct: 61 RKHGHVFTCKLMGKYVHFITNPLSYHKVLCHGKYFDWKKFHFATSAKAFGHRSIDPMDGN 120 Query: 121 TTENINNTFIKTLQGNALNSLTESMMENLQRIMRPPVFSNSKTAAWVTEGMYSFCYRVMF 180 TTENIN+TFIKTLQG+ALNSLTESMMENLQRIMRPPV SNSKTAAWVTEGMYSFCYRVMFSbjct: 121 TTENINDTFIKTLQGHALNSLTESMMENLQRIMRPPVSSNSKTAAWVTEGMYSFCYRVMF 180 Query: 181 EAGYLTIFGRDLTRQDTQKAHILNNLDNFKQFDKVFPALVAGLPIHMFRTAHSAREKLAE 240 EAGYLTIFGRDLTR+DTQKAHILNNLDNFKQFDKVFPALVAGLPIHMFRTAH+AREKLAESbjct: 181 EAGYLTIFGRDLTRRDTQKAHILNNLDNFKQFDKVFPALVAGLPIHMFRTAHNAREKLAE 240 Query: 241 SLRHENLQKRESVSELIRLRMFLNDTLSTFDDLEKAKTHLVVLWASQANTIPATFWSLFQ 300 SLRHENLQKRES+SELI LRMFLNDTLSTFDDLEKAKTHLVVLWASQANTIPATFWSLFQSbjct: 241 SLRHENLQKRESISELISLRMFLNDTLSTFDDLEKAKTHLVVLWASQANTIPATFWSLFQ 300 Query: 301 MIRNPEAMKAATEEVKRTLENAGQKVSLEGNPICLSQTQLNDLPVLESIIKESLRLSSAS 360 MIRNPEAMKAATEEVKRTLENAGQKVSLEGNPICLSQ +LNDLPVL SIIKESLRLSSASSbjct: 301 MIRNPEAMKAATEEVKRTLENAGQKVSLEGNPICLSQAELNDLPVLNSIIKESLRLSSAS 360 Query: 361 LNIRTAKEDFTLHLEDGSYNIRKDDIIALYPQLMHLDPEIYPDPLSFKYDRYLDENGKTK 420 LNIRTAKEDFTLHLEDGSYNIRKD IIALYPQLMHLDPEIYPDPL+FKYDRYLDENGKTKSbjct: 361 LNIRTAKEDFTLHLEDGSYNIRKDSIIALYPQLMHLDPEIYPDPLTFKYDRYLDENGKTK 420 Query: 421 TTFYCNGLKLKYYYMPFGSGATICPGRVFAIHEIKQFLVLMLSYFELELVEGQDKCPPLD 480 TTFYCNGLKLKYYYMPFGSGATICPGR+FAIHEIKQFL+LMLSYFELEL+EGQ KCPPLDSbjct: 421 TTFYCNGLKLKYYYMPFGSGATICPGRLFAIHEIKQFLILMLSYFELELIEGQAKCPPLD 480 Query: 481 QSRAGLGILPPLYDIEFKYKFKHL 504 QSRAGLGILPPL DIEFKYKFKHL
Sbjct: 481
QSRAGLGILPPLNDIEFKYKFKHL 504
>CYP7B1
(G.Zhu) partial
RRPGEPPLIKGWLPYLGVVLKLRKDPLSFMKTLQKQHGDTFTVLLG
GKYITFILDPFQYQLVIKNHKQLSFRLFSNKLLEKAFSISQLQKNHDMNDELHLCYQFLQGKSLDILLESMMQN
LKQVFESQLLKTTSWDTAQLYPFCSSIIFEITFTTIYGKVLVCDNKFISELRDDFLKFDD
KFAYLVSNIPIELLGNVKSIRKKIIKCLSSENLAKMQGWSEVFQSRQDVLEKYYVHEDLE
IGAHHLGFLWASVANTIPTMFWAMYYLLRHPEAMAAVRDEIDRLLQSTGQKKGSGFPIHL
TREQLDSLICLESTIFEALRLSSYSTTIRFVEEDLTLSAQTGDYCVRKGDLGAIFPPILH
GDPEIFEAPDSKEFRYDRFIEDGKKKTTFFKRGKKLKCYLMPFGTGTSKCPGRFFALMEI
KQLLVILLTYFDLEIIDDKPIGLNYNRLLFGIQYPDSDVLFRYKVKS
Alignment
of the two sequences, 95%identical
Query: 1 RRPGEPPLIKGWLPYLGVVLKLRKDPLSFMKTLQKQHGDTFTVLLGGKYITFILDPFQYQ 60 RRPGEPPLIKGWLPYLGVVL LRKDPL FMKTLQKQHGDTFTVLLGGKYITFILDPFQYQSbjct: 41 RRPGEPPLIKGWLPYLGVVLNLRKDPLRFMKTLQKQHGDTFTVLLGGKYITFILDPFQYQ 100 Query: 61 LVIKNHKQLSFRLFSNKLLEKAFSISQLQKNHDMNDELHLCYQFLQGKSLDILLESMMQN 120 LVIKNHKQLSFR+FSNKLLEKAFSISQLQKNHDMNDELHLCYQFLQGKSLDILLESMMQNSbjct: 101 LVIKNHKQLSFRVFSNKLLEKAFSISQLQKNHDMNDELHLCYQFLQGKSLDILLESMMQN 160 Query: 121 LKQVFESQLLKTTSWDTAQLYPFCSSIIFEITFTTIYGKVLVCDN-KFISELRDDFLKFD 179 LKQVFE QLLKTTSWDTA+LYPFCSSIIFEITFTTIYGKV+VCDN KFISELRDDFLKFDSbjct: 161 LKQVFEPQLLKTTSWDTAELYPFCSSIIFEITFTTIYGKVIVCDNNKFISELRDDFLKFD 220 Query: 180 DKFAYLVSNIPIELLGNVKSIRKKIIKCLSSENLAKMQGWSEVFQSRQDVLEKYYVHEDL 239 DKFAYLVSNIPIELLGNVKSIR+KIIKC SSE LAKMQGWSEVFQSRQDVLEKYYVHEDLSbjct: 221 DKFAYLVSNIPIELLGNVKSIREKIIKCFSSEKLAKMQGWSEVFQSRQDVLEKYYVHEDL 280 Query: 240 EIGAHHLGFLWASVANTIPTMFWAMYYLLRHPEAMAAVRDEIDRLLQSTGQKKGSGFPIH 299 EIGAHHLGFLWASVANTIPTMFWAMYYLLRHPEAMAAVRDEIDRLLQSTGQKKGSGFPIHSbjct: 281 EIGAHHLGFLWASVANTIPTMFWAMYYLLRHPEAMAAVRDEIDRLLQSTGQKKGSGFPIH 340 Query: 300 LTREQLDSLICLESTIFEALRLSSYSTTIRFVEEDLTLSAQTGDYCVRKGDLGAIFPPIL 359 LTREQLDSLICLES+IFEALRLSSYSTTIRFVEEDLTLS++TGDYCVRKGDL AIFPP+LSbjct: 341 LTREQLDSLICLESSIFEALRLSSYSTTIRFVEEDLTLSSETGDYCVRKGDLVAIFPPVL 400 Query: 360 HGDPEIFEAPDSKEFRYDRFIEDGKKKTTFFKRGKKLKCYLMPFGTGTSKCPGRFFALME 419 HGDPEIFEAP+ EFRYDRFIEDGKKKTTFFKRGKKLKCYLMPFGTGTSKCPGRFFALMESbjct: 401 HGDPEIFEAPE--EFRYDRFIEDGKKKTTFFKRGKKLKCYLMPFGTGTSKCPGRFFALME 458 Query: 420 IKQLLVILLTYFDLEIIDDKPIGLNYNRLLFGIQYPDSDVLFRYKVKS 467 IKQLLVILLTYFDLEIIDDKPIGLNY+RLLFGIQYPDSDVLFRYKVKSSbjct: 459 IKQLLVILLTYFDLEIIDDKPIGLNYSRLLFGIQYPDSDVLFRYKVKS 506
>CYP8A1
partial (Lin Zhu)
GDKDHMCSVKSRLWKLLSPARLATRAHRSKWLESYLLHLEEMGVSEEMQARALVLQLWATQ
(0)
GNMGPAAFWLLLFLLKNPEALAAVRGELESILWEAEQPVSQMTTLPQKVLDGTPVL
(1)
DSVLSESLRLTAAPFITREVVVDLAMPMADGREFNLRRGDRLLLFPFLSPQRDPEIYTDPE
(0)
VFKYNRFLNPDGSEKKDFYKDGKRLKNYNMPWGAGHNHCLGKSYAVNSIKQ
(2)
FVFLVLVHLDLELINADVEIPEFDLSRYGFGLMQPEHDVPIRYRIRP
Length = 500
Score
= 553 bits (1424), Expect = e-160
Identities = 270/276 (97%), Positives =
273/276 (98%)
Query:
1
GDKDHMCSVKSRLWKLLSPARLATRAHRSKWLESYLLHLEEMGVSEEMQARALVLQLWAT 60
GDKDHMCSVKSRLWKLLSPARLA RAHRSKWLESYLLHLEEMGVSEEMQARALVLQLWAT
Sbjct:
225 GDKDHMCSVKSRLWKLLSPARLARRAHRSKWLESYLLHLEEMGVSEEMQARALVLQLWAT 284
Query:
61
QGNMGPAAFWLLLFLLKNPEALAAVRGELESILWEAEQPVSQMTTLPQKVLDGTPVLDSV 120
QGNMGPAAFWLLLFLLKNPEALAAVRGELESILW+AEQPVSQ TTLPQKVLD TPVLDSV
Sbjct:
285 QGNMGPAAFWLLLFLLKNPEALAAVRGELESILWQAEQPVSQTTTLPQKVLDSTPVLDSV 344
Query:
121 LSESLRLTAAPFITREVVVDLAMPMADGREFNLRRGDRLLLFPFLSPQRDPEIYTDPEVF 180
LSESLRLTAAPFITREVVVDLAMPMADGREFNLRRGDRLLLFPFLSPQRDPEIYTDPEVF
Sbjct:
345 LSESLRLTAAPFITREVVVDLAMPMADGREFNLRRGDRLLLFPFLSPQRDPEIYTDPEVF 404
Query:
181 KYNRFLNPDGSEKKDFYKDGKRLKNYNMPWGAGHNHCLGKSYAVNSIKQFVFLVLVHLDL 240
KYNRFLNPDGSEKKDFYKDGKRLKNYNMPWGAGHNHCLG+SYAVNSIKQFVFLVLVHLDL
Sbjct:
405 KYNRFLNPDGSEKKDFYKDGKRLKNYNMPWGAGHNHCLGRSYAVNSIKQFVFLVLVHLDL 464
Query:
241 ELINADVEIPEFDLSRYGFGLMQPEHDVPIRYRIRP 276
ELINADVEIPEFDLSRYGFGLMQPEHDVP+RYRIRP
Sbjct:
465 ELINADVEIPEFDLSRYGFGLMQPEHDVPVRYRIRP 500
>CYP8B1
SCAFFOLD114862:8-613, SCAFFOLD39206:3-626 no introns
BB882888.1
Macaca fasicularis lower case
MVLWGPVLGALLVVIAGYLCLPGMLRQRRPREPPLDKGTVPWLGYAMAFRKNMFEFLKRM
RSKHGDVFTVQLGGQYFTFVMDPLSFGPILKDTQRKLDFGQYAKKLVLKVFGYRSVQGDH
EMIHSASTKHLRGDGLKDLNETMLDSLSFVMLKSKGWSLDASCWHEDSLFHFCYYILFTA
GYLSLFGYTKDKEQDLLQAGEL
40aa
gap
shsqxkegisnwlcnmlqflreqgvpsamqdkfnfmmlwasqgntgpts
FWALLFLLKHPEAIRAVRQETTQVLGEARLETKQSFAFKLSALQHTPVLDSVVEETLRLR
AAPTLLRLVHEDYTLKMASGQEYLFRRGDILALFPYLSVHVDPDIHPEPTIFKYDRFLNP
NGSRKVDFFKAGKKIHHYTMPWGSGVSICPGRFFALSEVKLFILLMVTHFDLELVDPDTP
LPHVDPQRWGFGTMQPSHDVRFRYRLRP
Query:
1
MVLWGPVLGALLVVIAGYLCLPGMLRQRRPREPPLDKGTVPWLGYAMAFRKNMFEFLKRM 60
MVLWGPVLGALLVVIAGYLCLPGMLRQRRP EPPLDKGTVPWLG+AMAFRKNMFEFLKRM
Sbjct:
1
MVLWGPVLGALLVVIAGYLCLPGMLRQRRPWEPPLDKGTVPWLGHAMAFRKNMFEFLKRM 60
Query:
61
RSKHGDVFTVQLGGQYFTFVMDPLSFGPILKDTQRKLDFGQYAKKLVLKVFGYRSVQGDH 120
R+KHGDVFTVQLGGQYFTFVMDPLSFGPILKDTQRKLDFGQYAKKLVLKVFGYRSVQGDH
Sbjct:
61
RTKHGDVFTVQLGGQYFTFVMDPLSFGPILKDTQRKLDFGQYAKKLVLKVFGYRSVQGDH 120
Query:
121 EMIHSASTKHLRGDGLKDLNETMLDSLSFVMLKSKGWSLDASCWHEDSLFHFCYYILFTA 180
EMIHSASTKHLRGDGLKDLNETMLDSLSFVML
SKGWSLDASCWHEDSLF FCYYILFTA
Sbjct:
121 EMIHSASTKHLRGDGLKDLNETMLDSLSFVMLTSKGWSLDASCWHEDSLFRFCYYILFTA 180
Query:
181 GYLSLFGYTKDKEQDLLQAGEL-------------------------------------- 202
GYLSLFGYTKDKEQDLLQAGEL
Sbjct:
181 GYLSLFGYTKDKEQDLLQAGELFMEFRKFDLLFPRFVYSLLWPREWLEVGRLQHLFHKML 240
Query:
203 --SHSQXKEGISNWLCNMLQFLREQGVPSAMQDKFNFMMLWASQGNTGPTSFWALLFLLK 260
SHSQ KEGISNWL NMLQFLREQGVPSAMQDKFNFMMLWASQGNTGPTSFWALL+LLK
Sbjct:
241 SVSHSQEKEGISNWLGNMLQFLREQGVPSAMQDKFNFMMLWASQGNTGPTSFWALLYLLK 300
Query:
261 HPEAIRAVRQETTQVLGEARLETKQSFAFKLSALQHTPVLDSVVEETLRLRAAPTLLRLV 320
HPEAIRAVR+E TQVLGEARLETKQSFAFKL ALQHTPVLDSVVEETLRLRAAPTLLRLV
Sbjct:
301 HPEAIRAVREEATQVLGEARLETKQSFAFKLGALQHTPVLDSVVEETLRLRAAPTLLRLV 360
Query:
321 HEDYTLKMASGQEYLFRRGDILALFPYLSVHVDPDIHPEPTIFKYDRFLNPNGSRKVDFF 380
HEDYTLKM+SGQEYLFR GDILALFPYLSVH+DPDIHPEPT+FKYDRFLNPNGSRKVDFF
Sbjct:
361 HEDYTLKMSSGQEYLFRHGDILALFPYLSVHMDPDIHPEPTVFKYDRFLNPNGSRKVDFF 420
Query:
381 KAGKKIHHYTMPWGSGVSICPGRFFALSEVKLFILLMVTHFDLELVDPDTPLPHVDPQRW 440
K
GKKIHHYTMPWGSGVSICPGRFFALSEVKLFILLMVTHFDLELVDPDTPLPHVDPQRW
Sbjct:
421 KTGKKIHHYTMPWGSGVSICPGRFFALSEVKLFILLMVTHFDLELVDPDTPLPHVDPQRW 480
Query:
441 GFGTMQPSHDVRFRYRLRP 459
GFGTMQPSHDVRFRYRL P
Sbjct:
481 GFGTMQPSHDVRFRYRLHP 499
>CYP11A1
N-term = DQ228169.1 Macaca fasicularis = lower case (Mahrous)
mlakglpprsvlvkgcqtflsapkerlghlrvptsegagistrs
prpfneipspgdngwlnlyhfwretgthkvhlhhvqnfqkydpiy
REKLGNVESVYVIDPEDVALLFKSEGPNPERFLIPPWVAYHQYYQRPIGVLLK (2)
KSAAWKKDRVALNQEVMAPETTKNFLPLLDAVSRDFVSVLHRRIKKAGSGNFSGDISDDLFRFAFE (1)
SITNVIFGERQGMLEEVVNPEAQRFIDAIYQMFHTSVPMLNLPPDLFRLFRTKTWKDHVAAWDVIFSK (1)
ADMYTENFHWELRQKGNVHHDYRGILYRLLGDSKMSFEDIKANVTEMLAGGVDT (0)
TSMTLQWHLYEMARNLKVQDMLRAEVLAARRQAQGDMATMLQLVPLLKASIKETLR (2)
LHPISVTLQRYLVNDLVLRGYMIPAK (0)
TLVQVAIYALGREPTFFFDPENFDPTRWLSKDKNITYFRNLGFGWGVRQCLGRRIAELEMTIFLIN (0)
MLENFRVEIQHLSDVGTTFNLILMPEKPISFTFWPFNQEATQ
>CYP11A1
NM_000781
Length = 521
Score = 857 bits (2214), Expect = 0.0
Identities = 422/431 (97%), Positives =
428/431 (99%)
Query:
1
REKLGNVESVYVIDPEDVALLFKSEGPNPERFLIPPWVAYHQYYQRPIGVLLKKSAAWKK 60
REKLGNVESVYVIDPEDVALLFKSEGPNPERFLIPPWVAYHQYYQRPIGVLLKKSAAWKK
Sbjct:
90 REKLGNVESVYVIDPEDVALLFKSEGPNPERFLIPPWVAYHQYYQRPIGVLLKKSAAWKK
149
Query:
61
DRVALNQEVMAPETTKNFLPLLDAVSRDFVSVLHRRIKKAGSGNFSGDISDDLFRFAFES 120
DRVALNQEVMAPE TKNFLPLLDAVSRDFVSVLHRRIKKAGSGN+SGDISDDLFRFAFES
Sbjct:
150 DRVALNQEVMAPEATKNFLPLLDAVSRDFVSVLHRRIKKAGSGNYSGDISDDLFRFAFES 209
Query:
121 ITNVIFGERQGMLEEVVNPEAQRFIDAIYQMFHTSVPMLNLPPDLFRLFRTKTWKDHVAA 180
ITNVIFGERQGMLEEVVNPEAQRFIDAIYQMFHTSVPMLNLPPDLFRLFRTKTWKDHVAA
Sbjct:
210 ITNVIFGERQGMLEEVVNPEAQRFIDAIYQMFHTSVPMLNLPPDLFRLFRTKTWKDHVAA 269
Query:
181 WDVIFSKADMYTENFHWELRQKGNVHHDYRGILYRLLGDSKMSFEDIKANVTEMLAGGVD 240
WDVIFSKAD+YT+NF+WELRQKG+VHHDYRG+LYRLLGDSKMSFEDIKANVTEMLAGGVD
Sbjct:
270 WDVIFSKADIYTQNFYWELRQKGSVHHDYRGMLYRLLGDSKMSFEDIKANVTEMLAGGVD 329
Query:
241 TTSMTLQWHLYEMARNLKVQDMLRAEVLAARRQAQGDMATMLQLVPLLKASIKETLRLHP 300
TTSMTLQWHLYEMARNLKVQDMLRAEVLAAR QAQGDMATMLQLVPLLKASIKETLRLHP
Sbjct:
330 TTSMTLQWHLYEMARNLKVQDMLRAEVLAARHQAQGDMATMLQLVPLLKASIKETLRLHP 389
Query:
301 ISVTLQRYLVNDLVLRGYMIPAKTLVQVAIYALGREPTFFFDPENFDPTRWLSKDKNITY 360
ISVTLQRYLVNDLVLR YMIPAKTLVQVAIYALGREPTFFFDPENFDPTRWLSKDKNITY
Sbjct:
390 ISVTLQRYLVNDLVLRDYMIPAKTLVQVAIYALGREPTFFFDPENFDPTRWLSKDKNITY 449
Query:
361 FRNLGFGWGVRQCLGRRIAELEMTIFLINMLENFRVEIQHLSDVGTTFNLILMPEKPISF 420
FRNLGFGWGVRQCLGRRIAELEMTIFLINMLENFRVEIQHLSDVGTTFNLILMPEKPISF
Sbjct:
450 FRNLGFGWGVRQCLGRRIAELEMTIFLINMLENFRVEIQHLSDVGTTFNLILMPEKPISF 509
Query:
421 TFWPFNQEATQ 431
TFWPFNQEATQ
Sbjct:
510 TFWPFNQEATQ 520
>CYP11B1
(Lin Zhu, S.Hill)
MALRAKAEVCMAAPWLSLQRARALGTRATRVPRTVLPFEAMPRRPGNRWLRLLQIWREQGYEHLHLEVHQTFQELGPIFR (2)YDLGGAGMVCVMLPEDVEKLQQVDSLNPRRMSLEPWVAYRQHRGHKCGVFLL (2)NGPEWRFNRLRLNPDVLSPRAVQRFLPMVDAVARDFSQALRKKVLQNARGSLTLDVQPSIFHYTIE (1)ASNLALFGERLGLVGHSPSSASLSFLHALEVMFKSTVQLMFMPRSLSRWTSPKVWKEHFEAWDCIFQY (1)GDNCIQKIYQELALSRPQQYTSIVAELLLNAELSPDAIKANSMELTAGSVDT (0) TVFPLLMTLFELARNPNVQQALRQESLAAAASISEHPQKATTELPLLRAALKETLR (2)LYPVGLFLERVVSSDLVLQNYHIPAG (0)TLVRVFLYSLGRNPALFPRPERYNPQRWLDIRGSGRNFYHVPFGFGMRQCLGRRLAEAEMLLLLHH (0)VLKHLQVETLTQEDIKMVYSFILRPSTFPLLTFRAIN
Score = 984 bits (2543), Expect = 0.0 Identities = 489/503 (97%), Positives = 494/503 (98%) Query: 1 MALRAKAEVCMAAPWLSLQRARALGTRATRVPRTVLPFEAMPRRPGNRWLRLLQIWREQG 60 MALRAKAEVCMA PWLSLQRA+ALGTRA RVPRTVLPFEAMPRRPGNRWLRLLQIWREQGSbjct: 1 MALRAKAEVCMAVPWLSLQRAQALGTRAARVPRTVLPFEAMPRRPGNRWLRLLQIWREQG 60 Query: 61 YEHLHLEVHQTFQELGPIFRYDLGGAGMVCVMLPEDVEKLQQVDSLNPRRMSLEPWVAYR 120 YE LHLEVHQTFQELGPIFRYDLGGAGMVCVMLPEDVEKLQQVDSL+P RMSLEPWVAYRSbjct: 61 YEDLHLEVHQTFQELGPIFRYDLGGAGMVCVMLPEDVEKLQQVDSLHPHRMSLEPWVAYR 120 Query: 121 QHRGHKCGVFLLNGPEWRFNRLRLNPDVLSPRAVQRFLPMVDAVARDFSQALRKKVLQNA 180 QHRGHKCGVFLLNGPEWRFNRLRLNP+VLSP AVQRFLPMVDAVARDFSQAL+KKVLQNASbjct: 121 QHRGHKCGVFLLNGPEWRFNRLRLNPEVLSPNAVQRFLPMVDAVARDFSQALKKKVLQNA 180 Query: 181 RGSLTLDVQPSIFHYTIEASNLALFGERLGLVGHSPSSASLSFLHALEVMFKSTVQLMFM 240 RGSLTLDVQPSIFHYTIEASNLALFGERLGLVGHSPSSASL+FLHALEVMFKSTVQLMFMSbjct: 181 RGSLTLDVQPSIFHYTIEASNLALFGERLGLVGHSPSSASLNFLHALEVMFKSTVQLMFM 240 Query: 241 PRSLSRWTSPKVWKEHFEAWDCIFQYGDNCIQKIYQELALSRPQQYTSIVAELLLNAELS 300 PRSLSRWTSPKVWKEHFEAWDCIFQYGDNCIQKIYQELA SRPQQYTSIVAELLLNAELSSbjct: 241 PRSLSRWTSPKVWKEHFEAWDCIFQYGDNCIQKIYQELAFSRPQQYTSIVAELLLNAELS 300 Query: 301 PDAIKANSMELTAGSVDTTVFPLLMTLFELARNPNVQQALRQESLAAAASISEHPQKATT 360 PDAIKANSMELTAGSVDTTVFPLLMTLFELARNPNVQQALRQESLAAAASISEHPQKATTSbjct: 301 PDAIKANSMELTAGSVDTTVFPLLMTLFELARNPNVQQALRQESLAAAASISEHPQKATT 360 Query: 361 ELPLLRAALKETLRLYPVGLFLERVVSSDLVLQNYHIPAGTLVRVFLYSLGRNPALFPRP 420 ELPLLRAALKETLRLYPVGLFLERV SSDLVLQNYHIPAGTLVRVFLYSLGRNPALFPRPSbjct: 361 ELPLLRAALKETLRLYPVGLFLERVASSDLVLQNYHIPAGTLVRVFLYSLGRNPALFPRP 420 Query: 421 ERYNPQRWLDIRGSGRNFYHVPFGFGMRQCLGRRLAEAEMLLLLHHVLKHLQVETLTQED 480 ERYNPQRWLDIRGSGRNFYHVPFGFGMRQCLGRRLAEAEMLLLLHHVLKHLQVETLTQEDSbjct: 421 ERYNPQRWLDIRGSGRNFYHVPFGFGMRQCLGRRLAEAEMLLLLHHVLKHLQVETLTQED 480 Query: 481 IKMVYSFILRPSTFPLLTFRAIN 503 IKMVYSFILRPS PLLTFRAINSbjct: 481 IKMVYSFILRPSMCPLLTFRAIN 503
>CYP17
AY746983.1 and AF458332.1
MWELVALLLLTLAYLFWPKRRCPGAKYPKSLLSLPLVGSLPFLP
RHGHMHNNFFKLQKKYGPIYSVRMGTKTTVIVGHHQLAKEVLIKKGKDFSGRPQVTTL
DILSNNRKGIAFADYGAHWQLHRRLAMATFALFKDGDQKLEKIICQEISTLCDMLATH
NGQTIDISFPVFVAITNVISLICFNISYKNGDPELKIVHNYNEGIIDSLGKESLVDLF
PWLKVFPNKTLEKLKRHVKTRNDLLTKIFENYKEKFHSDSITNMLDVLMQAKMNSDNG
NAGPDQDSELLSDNHILTTIGDIFGAGVETTTSVVKWIVAFLLHNPQVKKKLYEEIDQ
NVGFSRTPTISDRNRLLLLEATIREVLRIRPVAPMLIPHKANVDSSIGEFAVDKGTHV
IINLWALHHNEKEWHQPDQFMPERFLNPAGTQLISPSLSYLPFGAGPRSCIGEILARQ
ELFLIMAWLLQRFDLEVPDDGQLPSLEGNPKVVFLIDSFKVKIKVRQAWREAQAEGST
>CYP17 (S. Hill)
partial
MWELVALLLFTLAYLFWPKRRCPGAKYPKSLLSLPLVGSLPFLPRHGHMHNNFFKLQKKY
GPIYSVRMGTKTTVIVGHHQLAKEVLIKKGKDFSGRPQXTTLDILSNNRKGIAFADYGAHW
QLHRRLAMATFALFKDGDQKLEKIICQEISTLCDMLATHNGQTIDISFPVFVAITNVISL
ICFNISYKNGDPELKIVHNYNEGIIDSLGKESLVDLFPWLK
>CYP19 (Iyer)MVLEMLNPMHYNITSMVPEAMPAATMPILLLTGLFLLVWNYEGTSSIP (1)GPGYCMGIGPLISHGRFLWMGIGSACNYYNQVYGEFMRVWISGEETLIISK (2)SSSMFHIMKHNHYSSRFGSKLGLQCIGMHEKGIIFNNNPDLWKTTRPFFMK (1)ALSGPGLVRMVTVCAESLKTHLDRLEEVTNESGYVDVLTLLRRVMLDTSNMLFLRIPLD (1)ESAIVVKIQGYFDAWQALLIKPDIFFKISWLYKKYEKSV (2)KDLKDAIEVLIAEKRRRISTEEKLEECMDFATELILAE (0)KRGDLTRENVNQCILEMLIAAPDTMSVSLFFMLFLIAKHPNVEEAIMKEIQTVV (1)GERDVKIDDMQKLKVMENFIYESMRYQPVVDLVMRKALEDDVIDGYPVKKGTNIILNIGRMHRLEFFPKPNEFTLENFAKN (0)VPYRYFQPFGFGPRGCAGKYIAMVMMKAILVTLLRRFHVKTLQGQCVERIQKIHDLSSHPDETKNMLEMIFTPRNSDRCLEH >CYP19 NM_000103 Length = 503 Score = 997 bits (2578), Expect = 0.0 Identities = 490/503 (97%), Positives = 499/503 (99%) Query: 1 MVLEMLNPMHYNITSMVPEAMPAATMPILLLTGLFLLVWNYEGTSSIPGPGYCMGIGPLI 60 MVLEMLNP+HYNITS+VPEAMPAATMP+LLLTGLFLLVWNYEGTSSIPGPGYCMGIGPLISbjct: 1 MVLEMLNPIHYNITSIVPEAMPAATMPVLLLTGLFLLVWNYEGTSSIPGPGYCMGIGPLI 60 Query: 61 SHGRFLWMGIGSACNYYNQVYGEFMRVWISGEETLIISKSSSMFHIMKHNHYSSRFGSKL 120 SHGRFLWMGIGSACNYYN+VYGEFMRVWISGEETLIISKSSSMFHIMKHNHYSSRFGSKLSbjct: 61 SHGRFLWMGIGSACNYYNRVYGEFMRVWISGEETLIISKSSSMFHIMKHNHYSSRFGSKL 120 Query: 121 GLQCIGMHEKGIIFNNNPDLWKTTRPFFMKALSGPGLVRMVTVCAESLKTHLDRLEEVTN 180 GLQCIGMHEKGIIFNNNP+LWKTTRPFFMKALSGPGLVRMVTVCAESLKTHLDRLEEVTNSbjct: 121 GLQCIGMHEKGIIFNNNPELWKTTRPFFMKALSGPGLVRMVTVCAESLKTHLDRLEEVTN 180 Query: 181 ESGYVDVLTLLRRVMLDTSNMLFLRIPLDESAIVVKIQGYFDAWQALLIKPDIFFKISWL 240 ESGYVDVLTLLRRVMLDTSN LFLRIPLDESAIVVKIQGYFDAWQALLIKPDIFFKISWLSbjct: 181 ESGYVDVLTLLRRVMLDTSNTLFLRIPLDESAIVVKIQGYFDAWQALLIKPDIFFKISWL 240 Query: 241 YKKYEKSVKDLKDAIEVLIAEKRRRISTEEKLEECMDFATELILAEKRGDLTRENVNQCI 300 YKKYEKSVKDLKDAIEVLIAEKR RISTEEKLEECMDFATELILAEKRGDLTRENVNQCISbjct: 241 YKKYEKSVKDLKDAIEVLIAEKRCRISTEEKLEECMDFATELILAEKRGDLTRENVNQCI 300 Query: 301 LEMLIAAPDTMSVSLFFMLFLIAKHPNVEEAIMKEIQTVVGERDVKIDDMQKLKVMENFI 360 LEMLIAAPDTMSVSLFFMLFLIAKHPNVEEAI+KEIQTV+GERD+KIDD+QKLKVMENFISbjct: 301 LEMLIAAPDTMSVSLFFMLFLIAKHPNVEEAIIKEIQTVIGERDIKIDDIQKLKVMENFI 360 Query: 361 YESMRYQPVVDLVMRKALEDDVIDGYPVKKGTNIILNIGRMHRLEFFPKPNEFTLENFAK 420 YESMRYQPVVDLVMRKALEDDVIDGYPVKKGTNIILNIGRMHRLEFFPKPNEFTLENFAKSbjct: 361 YESMRYQPVVDLVMRKALEDDVIDGYPVKKGTNIILNIGRMHRLEFFPKPNEFTLENFAK 420 Query: 421 NVPYRYFQPFGFGPRGCAGKYIAMVMMKAILVTLLRRFHVKTLQGQCVERIQKIHDLSSH 480 NVPYRYFQPFGFGPRGCAGKYIAMVMMKAILVTLLRRFHVKTLQGQCVE IQKIHDLS HSbjct: 421 NVPYRYFQPFGFGPRGCAGKYIAMVMMKAILVTLLRRFHVKTLQGQCVESIQKIHDLSLH 480 Query: 481 PDETKNMLEMIFTPRNSDRCLEH 503 PDETKNMLEMIFTPRNSDRCLEHSbjct: 481 PDETKNMLEMIFTPRNSDRCLEH 503
>CYP21
(Blackwell) partial
LVSKNYPDLSLGDYSLLWKAHKKLTRSALLLGMRDSMEPVVEQLTQEFCERMRAQAGTPV
AIEEEFSLLTCSIICHLTFGDKIKDNLVPAYYKCIQEVLKTWSHWSIQIVDVIPFLRFFP
NPGLRRLKQAIEKRDHIVEKQLRQHKESLVAGQWRDMMDYMLQVVAQPSMEEGSGQLLEG
HVHMAAVDLLIGGTETTANTLSWAVVFLLHHPEIQQRLQEELDHELGPSASSSRVPYKDR
ARLPLLNATIAEVLRLRPVVPLALPHRTTRPSSISGYDIPEGTVIIPNLQGAHLDEMVWE
RPHEFWPDRFLEPGKNSRALAFGCGARVCLGEPLARLELFVVLTRLLQAFTLLPPGDALP
SLQPLPHCSVILKMQPFQVWLQPRGLGVHSLGQSQ
Query: 1
LVSKNYPDLSLGDYSLLWKAHKKLTRSALLLGMRDSMEPVVEQLTQEFCERMRAQAGTPV 60
LVS+NYPDLSLGDYSLLWKAHKKLTRSALLLG+RDSMEPVVEQLTQEFCERMRAQ GTPV
Sbjct: 100
LVSRNYPDLSLGDYSLLWKAHKKLTRSALLLGIRDSMEPVVEQLTQEFCERMRAQPGTPV 159
Query: 61
AIEEEFSLLTCSIICHLTFGDKIKD-NLVPAYYKCIQEVLKTWSHWSIQIVDVIPFLRFF 119
AIEEEFSLLTCSIIC+LTFGDKIKD NL+PAYYKCIQEVLKTWSHWSIQIVDVIPFLRFF
Sbjct: 160
AIEEEFSLLTCSIICYLTFGDKIKDDNLMPAYYKCIQEVLKTWSHWSIQIVDVIPFLRFF 219
Query: 120
PNPGLRRLKQAIEKRDHIVEKQLRQHKESLVAGQWRDMMDYMLQVVAQPSMEEGSGQLLE 179
PNPGLRRLKQAIEKRDHIVE QLRQHKESLVAGQWRDMMDYMLQ VAQPSMEEGSGQLLE
Sbjct: 220 PNPGLRRLKQAIEKRDHIVEMQLRQHKESLVAGQWRDMMDYMLQGVAQPSMEEGSGQLLE
279
Query: 180
GHVHMAAVDLLIGGTETTANTLSWAVVFLLHHPEIQQRLQEELDHELGPSASSSRVPYKD 239
GHVHMAAVDLLIGGTETTANTLSWAVVFLLHHPEIQQRLQEELDHELGP ASSSRVPYKD
Sbjct: 280
GHVHMAAVDLLIGGTETTANTLSWAVVFLLHHPEIQQRLQEELDHELGPGASSSRVPYKD 339
Query: 240
RARLPLLNATIAEVLRLRPVVPLALPHRTTRPSSISGYDIPEGTVIIPNLQGAHLDEMVW 299
RARLPLLNATIAEVLRLRPVVPLALPHRTTRPSSISGYDIPEGTVIIPNLQGAHLDE VW
Sbjct: 340
RARLPLLNATIAEVLRLRPVVPLALPHRTTRPSSISGYDIPEGTVIIPNLQGAHLDETVW 399
Query: 300
ERPHEFWPDRFLEPGKNSRALAFGCGARVCLGEPLARLELFVVLTRLLQAFTLLPPGDAL 359
ERPHEFWPDRFLEPGKNSRALAFGCGARVCLGEPLARLELFVVLTRLLQAFTLLP GDAL
Sbjct: 400
ERPHEFWPDRFLEPGKNSRALAFGCGARVCLGEPLARLELFVVLTRLLQAFTLLPSGDAL 459
Query: 360
PSLQPLPHCSVILKMQPFQVWLQPRGLGVHSLGQSQ 395
PSLQPLPHCSVILKMQPFQV LQPRG+G HS GQ+Q
Sbjct: 460
PSLQPLPHCSVILKMQPFQVRLQPRGMGAHSPGQNQ 495
>CYP24
(S.Jain) partial
MSSPISKSRSLAAFLQQLRSPRQPPRPVTSTAYTSPQPREVPVCPLTAGGETQNAAALPG
PTSWPLLGSLLQILWKGGLKKQHDTL(0)VEYHKKYGKIFRMKLGSFESVHLGSPCLLEALYR
TESAYPQRLEIKPWKAYRDYRKEGYGLLIL
>CYP24
NM_000782
Length = 513
Score = 295 bits (754), Expect = 6e-83
Identities = 146/150 (97%), Positives =
146/150 (97%), Gaps = 1/150 (0%)
Query:
1 MSSPISKSRSLAAFLQQLRSPRQPPRPVTSTAYTSPQPREVPVCPLTAGGETQNAAALPG
60
MSSPISKSRSLAAFLQQLRSPRQPPR VTSTAYTSPQPREVPVCPLTAGGETQNAAALPG
Sbjct:
1
MSSPISKSRSLAAFLQQLRSPRQPPRLVTSTAYTSPQPREVPVCPLTAGGETQNAAALPG 60
Query:
61
PTSWPLLGSLLQILWKGGLKKQHDTLVEYHKKYGKIFRMKLGSFESVHLGSPCLLEALYR 120
PTSWPLL SLLQILWKGGLKKQHDTLVEYHKKYGKIFRMKLGSFESVHLGSPCLLEALYR
Sbjct:
61
PTSWPLLASLLQILWKGGLKKQHDTLVEYHKKYGKIFRMKLGSFESVHLGSPCLLEALYR 120
Query:
121 TESAYPQRLEIKPWKAYRDYRKEGYGLLIL 150
TES PQRLEIKPWKAYRDYRKEGYGLLIL
Sbjct:
121 TESV-PQRLEIKPWKAYRDYRKEGYGLLIL 149
>CYP26A1
(Liao, Iyer)
MGLPALLASALCTFVLPLLLFLAAIKLWDLYCVSGRDRSCALPLPPGTMGFPFFGETLQM
VLQRRKFLQMKRRKYGFIYKTHLFGRPTVRVMGADNVRRILLGEHRLVSVHWPASVRTIL
GSGCLSNLHDSSHKQRKKVIMRAFSREALECYVPVITEEVGSSLEQWLSCGERGLLVYPE
VKRLMFRIAMRILLGCEPQLAGDGDAEQQLVEAFEEMTRNLFSLPIDVPFSGLYRGMKAR
NLIHARIEQNIRAKICGLRASEAGRGCKDALQLLIEHSWERGERLDMQALKQSSTELLFG
GHETTASAATSLITYLGLYPHVLQKVREELKSKGLLCKSNQDNKLDMEILEQLKYIGCVI
KETLRLNPPVPGGFRVALKTFELNGYQIPKGWNVIYSICDTHDVAEIFTNKEEFNPDRFM
LPHPEDASRFSFIPFGGGLRSCVGKEFAKILLKIFTVELARHCDWQLLNGPPTMKTSPTV
YPVDNLPARFTHFHGEI
Length = 497
Score = 1003 bits (2594), Expect = 0.0
Identities = 493/497 (99%), Positives =
496/497 (99%)
Query:
1
MGLPALLASALCTFVLPLLLFLAAIKLWDLYCVSGRDRSCALPLPPGTMGFPFFGETLQM 60
MGLPALLASALCTFVLPLLLFLAAIKLWDLYCVSGRDRSCALPLPPGTMGFPFFGETLQM
Sbjct:
1
MGLPALLASALCTFVLPLLLFLAAIKLWDLYCVSGRDRSCALPLPPGTMGFPFFGETLQM 60
Query:
61 VLQRRKFLQMKRRKYGFIYKTHLFGRPTVRVMGADNVRRILLGEHRLVSVHWPASVRTIL
120
VLQRRKFLQMKRRKYGFIYKTHLFGRPTVRVMGADNVRRILLG+ RLVSVHWPASVRTIL
Sbjct:
61
VLQRRKFLQMKRRKYGFIYKTHLFGRPTVRVMGADNVRRILLGDDRLVSVHWPASVRTIL 120
Query:
121 GSGCLSNLHDSSHKQRKKVIMRAFSREALECYVPVITEEVGSSLEQWLSCGERGLLVYPE 180
GSGCLSNLHDSSHKQRKKVIMRAFSREALECYVPVITEEVGSSLEQWLSCGERGLLVYPE
Sbjct:
121 GSGCLSNLHDSSHKQRKKVIMRAFSREALECYVPVITEEVGSSLEQWLSCGERGLLVYPE 180
Query:
181 VKRLMFRIAMRILLGCEPQLAGDGDAEQQLVEAFEEMTRNLFSLPIDVPFSGLYRGMKAR 240
VKRLMFRIAMRILLGCEPQLAGDGD+EQQLVEAFEEMTRNLFSLPIDVPFSGLYRGMKAR
Sbjct:
181 VKRLMFRIAMRILLGCEPQLAGDGDSEQQLVEAFEEMTRNLFSLPIDVPFSGLYRGMKAR 240
Query:
241 NLIHARIEQNIRAKICGLRASEAGRGCKDALQLLIEHSWERGERLDMQALKQSSTELLFG 300
NLIHARIEQNIRAKICGLRASEAG+GCKDALQLLIEHSWERGERLDMQALKQSSTELLFG
Sbjct:
241 NLIHARIEQNIRAKICGLRASEAGQGCKDALQLLIEHSWERGERLDMQALKQSSTELLFG 300
Query:
301 GHETTASAATSLITYLGLYPHVLQKVREELKSKGLLCKSNQDNKLDMEILEQLKYIGCVI 360
GHETTASAATSLITYLGLYPHVLQKVREELKSKGLLCKSNQDNKLDMEILEQLKYIGCVI
Sbjct:
301 GHETTASAATSLITYLGLYPHVLQKVREELKSKGLLCKSNQDNKLDMEILEQLKYIGCVI 360
Query:
361 KETLRLNPPVPGGFRVALKTFELNGYQIPKGWNVIYSICDTHDVAEIFTNKEEFNPDRFM 420
KETLRLNPPVPGGFRVALKTFELNGYQIPKGWNVIYSICDTHDVAEIFTNKEEFNPDRFM
Sbjct:
361 KETLRLNPPVPGGFRVALKTFELNGYQIPKGWNVIYSICDTHDVAEIFTNKEEFNPDRFM 420
Query:
421 LPHPEDASRFSFIPFGGGLRSCVGKEFAKILLKIFTVELARHCDWQLLNGPPTMKTSPTV 480
LPHPEDASRFSFIPFGGGLRSCVGKEFAKILLKIFTVELARHCDWQLLNGPPTMKTSPTV
Sbjct:
421 LPHPEDASRFSFIPFGGGLRSCVGKEFAKILLKIFTVELARHCDWQLLNGPPTMKTSPTV 480
Query:
481 YPVDNLPARFTHFHGEI 497
YPVDNLPARFTHFHGEI
Sbjct:
481 YPVDNLPARFTHFHGEI 497
>CYP26B1 (S.Jain, Penmatsa) partialVFSKIFSHEALESYLPKIQLVIQDTLRAWSSHPEAINVYQEAQKLTFRMAIRVLLGFSIP
EEDLGHLFEVYQQFVENVFSLPVDLPFSGYRRGIQARQILQKGLEKAIREKLQCTQGKDY
SDALDLLIESSKEHGKEMTMQELKDGTLELIFAAYATTASASTSLIMQLLKHPTVLEKLR
EELRAHGILHSGGCPCEGTLRLDTLSGLRYLDCVIKEVMRLFTPISGGYRTVLQTFELDG
FQIPKGWSVMYSIRDTHDTAPVFKDVNVFDPDRFSQARSEDKDGRFHYLPFGGGVRTCLG
KHLAKLFLKVLAVELASTSRFELATRTFPRITLVPVLHPVDGLSVKFFGLDSNQNKILPE
TEAMLSATV
>CYP26B1
AC007002
Length = 512
Score = 727 bits (1876), Expect = 0.0
Identities = 365/369 (98%), Positives =
368/369 (99%)
Query:
1 VFSKIFSHEALESYLPKIQLVIQDTLRAWSSHPEAINVYQEAQKLTFRMAIRVLLGFSIP
60
VFSKIFSHEALESYLPKIQLVIQDTLRAWSSHPEAINVYQEAQKLTFRMAIRVLLGFSIP
Sbjct:
144 VFSKIFSHEALESYLPKIQLVIQDTLRAWSSHPEAINVYQEAQKLTFRMAIRVLLGFSIP 203
Query:
61
EEDLGHLFEVYQQFVENVFSLPVDLPFSGYRRGIQARQILQKGLEKAIREKLQCTQGKDY 120
EEDLGHLFEVYQQFV+NVFSLPVDLPFSGYRRGIQARQILQKGLEKAIREKLQCTQGKDY
Sbjct:
204 EEDLGHLFEVYQQFVDNVFSLPVDLPFSGYRRGIQARQILQKGLEKAIREKLQCTQGKDY 263
Query:
121 SDALDLLIESSKEHGKEMTMQELKDGTLELIFAAYATTASASTSLIMQLLKHPTVLEKLR 180
DALDLLIESSKEHGKEMTMQELKDGTLELIFAAYATTASASTSLIMQLLKHPTVLEKLR
Sbjct:
264 LDALDLLIESSKEHGKEMTMQELKDGTLELIFAAYATTASASTSLIMQLLKHPTVLEKLR 323
Query:
181 EELRAHGILHSGGCPCEGTLRLDTLSGLRYLDCVIKEVMRLFTPISGGYRTVLQTFELDG 240
+ELRAHGILHSGGCPCEGTLRLDTLSGLRYLDCVIKEVMRLFTPISGGYRTVLQTFELDG
Sbjct:
324 DELRAHGILHSGGCPCEGTLRLDTLSGLRYLDCVIKEVMRLFTPISGGYRTVLQTFELDG 383
Query:
241 FQIPKGWSVMYSIRDTHDTAPVFKDVNVFDPDRFSQARSEDKDGRFHYLPFGGGVRTCLG 300
FQIPKGWSVMYSIRDTHDTAPVFKDVNVFDPDRFSQARSEDKDGRFHYLPFGGGVRTCLG
Sbjct:
384 FQIPKGWSVMYSIRDTHDTAPVFKDVNVFDPDRFSQARSEDKDGRFHYLPFGGGVRTCLG 443
Query:
301 KHLAKLFLKVLAVELASTSRFELATRTFPRITLVPVLHPVDGLSVKFFGLDSNQNKILPE 360
KHLAKLFLKVLAVELASTSRFELATRTFPRITLVPVLHPVDGLSVKFFGLDSNQN+ILPE
Sbjct:
444 KHLAKLFLKVLAVELASTSRFELATRTFPRITLVPVLHPVDGLSVKFFGLDSNQNEILPE 503
Query:
361 TEAMLSATV 369
TEAMLSATV
Sbjct:
504 TEAMLSATV 512
>CYP26C1
(C.Blackwell) missing an exon
MFPWGLSCLSVLGAAGTAVLCAGLLLSLAQHLWTLRWMLSRDRASTLPLPKGSMGWPFFGETLHWLV(0)
QGSRFHSSRRERYGTVFKTHLLGRPVIRVSGAENVRTILLGEHRLVRSQWPQSAHILLGSHTLLGAV
GEPHRRRRK (0)
VLARVFSRAALERYVPRLQGALRHEVRSWCAAGGPVSVYDAAKALTFRMAARILLGLRLDEAQCATL
ARTFEQLVENLFSLPLDVPFSGLRK
(0)
SAVELLFAAFFTTASASTSLVLLLLQHPAAIAKIREELVAQGLGRACGCAPGAAGGCVGPPPDCG
CEPDLSLAALGSLRYVDCVVKEVLRLLPPVSGGYRTALRTFELD(0)
GYQIPKGWSVMYSIRDTHETAAVYRSPPEGFDPERFGAAREDSRGASSRFHYIPFGGGARSCLGQEL
AQTVLQLLAVELVRTARWELATPAFPALQTVPIVHPVDGLRLFFHPLAPLVAGDGLCL
Query: 1
MFPWGLSCLSVLGAAGTAVLCAGLLLSLAQHLWTLRWMLSRDRASTLPLPKGSMGWPFFG 60
MFPWGLSCLSVLGAAGTA+LCAGLLLSLAQHLWTLRWMLSRDRASTLPLPKGSMGWPFFG
Sbjct: 1
MFPWGLSCLSVLGAAGTALLCAGLLLSLAQHLWTLRWMLSRDRASTLPLPKGSMGWPFFG 60
Query: 61
ETLHWLVQGSRFHSSRRERYGTVFKTHLLGRPVIRVSGAENVRTILLGEHRLVRSQWPQS 120
ETLHWLVQGSRFHSSRRERYGTVFKTHLLGRPVIRVSGAENVRTILLGEHRLVRSQWPQS
Sbjct: 61
ETLHWLVQGSRFHSSRRERYGTVFKTHLLGRPVIRVSGAENVRTILLGEHRLVRSQWPQS 120
Query: 121
AHILLGSHTLLGAVGEPHRRRRKVLARVFSRAALERYVPRLQGALRHEVRSWCAAGGPVS 180
AHILLGSHTLLGAVGEPHRRRRKVLARVFSRAALERYVPRLQGALRHEVRSWCAAGGPVS
Sbjct: 121
AHILLGSHTLLGAVGEPHRRRRKVLARVFSRAALERYVPRLQGALRHEVRSWCAAGGPVS 180
Query: 181
VYDAAKALTFRMAARILLGLRLDEAQCATLARTFEQLVENLFSLPLDVPFSGLRK----- 235
VYDA+KALTFRMAARILLGLRLDEAQCATLARTFEQLVENLFSLPLDVPFSGLRK
Sbjct: 181
VYDASKALTFRMAARILLGLRLDEAQCATLARTFEQLVENLFSLPLDVPFSGLRKGIRAR 240
Query: 236
------------------------------------------------SAVELLFAAFFT 247
SAVELLFAAFFT
Sbjct: 241 DQLHRHLEGAISEKLHEDKAAEPGDALDLIIHSARELGHEPSMQELKESAVELLFAAFFT
300
Query: 248
TASASTSLVLLLLQHPAAIAKIREELVAQGLGRACGCAPGAAGGCVGPPPDCGCEPDLSL 307
TASASTSLVLLLLQHPAAIAKIREELVAQGLGRACGCAPGAAGG GPPPDCGCEPDLSL
Sbjct: 301
TASASTSLVLLLLQHPAAIAKIREELVAQGLGRACGCAPGAAGGSEGPPPDCGCEPDLSL 360
Query: 308
AALGSLRYVDCVVKEVLRLLPPVSGGYRTALRTFELDGYQIPKGWSVMYSIRDTHETAAV 367
AALG
LRYVDCVVKEVLRLLPPVSGGYRTALRTFELDGYQIPKGWSVMYSIRDTHETAAV
Sbjct: 361
AALGRLRYVDCVVKEVLRLLPPVSGGYRTALRTFELDGYQIPKGWSVMYSIRDTHETAAV 420
Query: 368
YRSPPEGFDPERFGAAREDSRGASSRFHYIPFGGGARSCLGQELAQTVLQLLAVELVRTA 427
YRSPPEGFDPERFGAAREDSRGASSR HYIPFGGGARSCLGQELAQ VLQLLAVELVRTA
Sbjct: 421
YRSPPEGFDPERFGAAREDSRGASSRLHYIPFGGGARSCLGQELAQAVLQLLAVELVRTA 480
Query:
428 RWELATPAFPALQTVPIVHPVDGLRLFFHPLAPLVAGDGLCL 469
RWELATPAFPA+QTVPIVHPVDGLRLFFHPL P VAG+GLCL
Sbjct: 481
RWELATPAFPAMQTVPIVHPVDGLRLFFHPLTPSVAGNGLCL 522
>CYP27A1
(Q.Tran)
QVLYKAKYGPMWMSYLGPQMHVNLASAPLLEQVMRQEGKYPVRNDMELWKEHRDLHDLTYGPFTT(2)
EGHHWYQLRQALNQRLLKPAEAALYTDAFNEVIDDFMIRLDQLRAESASGNQVSDTAQLFYYFALE
(1)
AICYILFEKRIGCLQRSIPEDTVTFVRSIGLMFQNSLYATFLPKWTRPVLPFWKRYLDGWNAIFSF
(1)
GKKLIDEKLEDMEAQLQAEGPDGVQVSGYLHFLLASGQLSPREAMGSLPELLMAGVDT
(0)
TSNTLTWALYHLSKDPEIQEALHEEVVGVVPAGQVPQHKDFAHLPLLKAVLKETLR (2)
LYPVVPTNSRIIEKEIEVDGFLFPKN
(0)
TQFVFCHYVVSRDPTTFSEPESFQPHRWLRSSQPATPRIQHPFGSVPFGYGVRACLGRRIAELEMQLLLAR
(0)
LIQKYKVVLAPETGELKSVARIVLVPNKKVGLQFLQRQC
>CYP27A1 NM_000784 Length = 531 Score = 897 bits (2319), Expect = 0.0 Identities = 439/447 (98%), Positives = 442/447 (98%) Query: 1 QVLYKAKYGPMWMSYLGPQMHVNLASAPLLEQVMRQEGKYPVRNDMELWKEHRDLHDLTY 60 QVLYKAKYGPMWMSYLGPQMHVNLASAPLLEQVMRQEGKYPVRNDMELWKEHRD HDLTYSbjct: 85 QVLYKAKYGPMWMSYLGPQMHVNLASAPLLEQVMRQEGKYPVRNDMELWKEHRDQHDLTY 144 Query: 61 GPFTTEGHHWYQLRQALNQRLLKPAEAALYTDAFNEVIDDFMIRLDQLRAESASGNQVSD 120 GPFTTEGHHWYQLRQALNQRLLKPAEAALYTDAFNEVIDDFM RLDQLRAESASGNQVSDSbjct: 145 GPFTTEGHHWYQLRQALNQRLLKPAEAALYTDAFNEVIDDFMTRLDQLRAESASGNQVSD 204 Query: 121 TAQLFYYFALEAICYILFEKRIGCLQRSIPEDTVTFVRSIGLMFQNSLYATFLPKWTRPV 180 AQLFYYFALEAICYILFEKRIGCLQRSIPEDTVTFVRSIGLMFQNSLYATFLPKWTRPVSbjct: 205 MAQLFYYFALEAICYILFEKRIGCLQRSIPEDTVTFVRSIGLMFQNSLYATFLPKWTRPV 264 Query: 181 LPFWKRYLDGWNAIFSFGKKLIDEKLEDMEAQLQAEGPDGVQVSGYLHFLLASGQLSPRE 240 LPFWKRYLDGWNAIFSFGKKLIDEKLEDMEAQLQA GPDG+QVSGYLHFLLASGQLSPRESbjct: 265 LPFWKRYLDGWNAIFSFGKKLIDEKLEDMEAQLQAAGPDGIQVSGYLHFLLASGQLSPRE 324 Query: 241 AMGSLPELLMAGVDTTSNTLTWALYHLSKDPEIQEALHEEVVGVVPAGQVPQHKDFAHLP 300 AMGSLPELLMAGVDTTSNTLTWALYHLSKDPEIQEALHEEVVGVVPAGQVPQHKDFAH+PSbjct: 325 AMGSLPELLMAGVDTTSNTLTWALYHLSKDPEIQEALHEEVVGVVPAGQVPQHKDFAHMP 384 Query: 301 LLKAVLKETLRLYPVVPTNSRIIEKEIEVDGFLFPKNTQFVFCHYVVSRDPTTFSEPESF 360 LLKAVLKETLRLYPVVPTNSRIIEKEIEVDGFLFPKNTQFVFCHYVVSRDPT FSEPESFSbjct: 385 LLKAVLKETLRLYPVVPTNSRIIEKEIEVDGFLFPKNTQFVFCHYVVSRDPTAFSEPESF 444 Query: 361 QPHRWLRSSQPATPRIQHPFGSVPFGYGVRACLGRRIAELEMQLLLARLIQKYKVVLAPE 420 QPHRWLR+SQPATPRIQHPFGSVPFGYGVRACLGRRIAELEMQLLLARLIQKYKVVLAPESbjct: 445 QPHRWLRNSQPATPRIQHPFGSVPFGYGVRACLGRRIAELEMQLLLARLIQKYKVVLAPE 504 Query: 421 TGELKSVARIVLVPNKKVGLQFLQRQC 447 TGELKSVARIVLVPNKKVGLQFLQRQCSbjct: 505 TGELKSVARIVLVPNKKVGLQFLQRQC 531
>CYP27B1 (Xin Liu)MTQTLKYASRVFHRVRWAPELGASLGYREYDSARRSLADIPGPSTPSFLAELFCKGGLSR
LHELQVQGAARFGPVWLASFGTVRTVYVAAPALVEELLRQEGPRPERCSFSPWTEHRRRR
QRACGLLTAEGEEWQRLRSLLAPLLLRPQAAARYAETLDNVVRDLVRRLRCQRGRGTGPP
ALVRDVAGEFYKFGLEGIAAVLLGSRLGCLEAQVPPDTETFIRAVGSVFVSTLLTMAMPH
WLRRLVPGPWGRLCRDWDQMFAFAQRHVERREAEAAMRNRGQPDEDLESGAHLTHFLFQE
ELPAQSILGNVTELLLAGVDTVSNTLSWALYELSRHPEVQTALHSEITAALGPGSSAHPP
ATVLSQLPLLKAVVKEVLRLYPVVPGNSRVPDKDIHVGDYIIPKNTLVTLCHYATSRDPA
QFPEPNSFHPARWLGEGPTPHPFASLPFGFGKRSCMGRRLAELELQMALAQILIHFEVQP
EPGAAPIRPMTRTVLVPERSINLQFLDR >CYP27B1 NM_000785 Length = 508 Score = 985 bits (2547), Expect = 0.0 Identities = 489/508 (96%), Positives = 495/508 (97%) Query: 1 MTQTLKYASRVFHRVRWAPELGASLGYREYDSARRSLADIPGPSTPSFLAELFCKGGLSR 60 MTQTLKYASRVFHRVRWAPELGASLGYREY SARRSLADIPGPSTPSFLAELFCKGGLSRSbjct: 1 MTQTLKYASRVFHRVRWAPELGASLGYREYHSARRSLADIPGPSTPSFLAELFCKGGLSR 60 Query: 61 LHELQVQGAARFGPVWLASFGTVRTVYVAAPALVEELLRQEGPRPERCSFSPWTEHRRRR 120 LHELQVQGAA FGPVWLASFGTVRTVYVAAPALVEELLRQEGPRPERCSFSPWTEHRR RSbjct: 61 LHELQVQGAAHFGPVWLASFGTVRTVYVAAPALVEELLRQEGPRPERCSFSPWTEHRRCR 120 Query: 121 QRACGLLTAEGEEWQRLRSLLAPLLLRPQAAARYAETLDNVVRDLVRRLRCQRGRGTGPP 180 QRACGLLTAEGEEWQRLRSLLAPLLLRPQAAARYA TL+NVV DLVRRLR QRGRGTGPPSbjct: 121 QRACGLLTAEGEEWQRLRSLLAPLLLRPQAAARYAGTLNNVVCDLVRRLRRQRGRGTGPP 180 Query: 181 ALVRDVAGEFYKFGLEGIAAVLLGSRLGCLEAQVPPDTETFIRAVGSVFVSTLLTMAMPH 240 ALVRDVAGEFYKFGLEGIAAVLLGSRLGCLEAQVPPDTETFIRAVGSVFVSTLLTMAMPHSbjct: 181 ALVRDVAGEFYKFGLEGIAAVLLGSRLGCLEAQVPPDTETFIRAVGSVFVSTLLTMAMPH 240 Query: 241 WLRRLVPGPWGRLCRDWDQMFAFAQRHVERREAEAAMRNRGQPDEDLESGAHLTHFLFQE 300 WLR LVPGPWGRLCRDWDQMFAFAQRHVERREAEAAMRN GQP++DLESGAHLTHFLF+ESbjct: 241 WLRHLVPGPWGRLCRDWDQMFAFAQRHVERREAEAAMRNGGQPEKDLESGAHLTHFLFRE 300 Query: 301 ELPAQSILGNVTELLLAGVDTVSNTLSWALYELSRHPEVQTALHSEITAALGPGSSAHPP 360 ELPAQSILGNVTELLLAGVDTVSNTLSWALYELSRHPEVQTALHSEITAAL PGSSA+P Sbjct: 301 ELPAQSILGNVTELLLAGVDTVSNTLSWALYELSRHPEVQTALHSEITAALSPGSSAYPS 360 Query: 361 ATVLSQLPLLKAVVKEVLRLYPVVPGNSRVPDKDIHVGDYIIPKNTLVTLCHYATSRDPA 420 ATVLSQLPLLKAVVKEVLRLYPVVPGNSRVPDKDIHVGDYIIPKNTLVTLCHYATSRDPASbjct: 361 ATVLSQLPLLKAVVKEVLRLYPVVPGNSRVPDKDIHVGDYIIPKNTLVTLCHYATSRDPA 420 Query: 421 QFPEPNSFHPARWLGEGPTPHPFASLPFGFGKRSCMGRRLAELELQMALAQILIHFEVQP 480 QFPEPNSF PARWLGEGPTPHPFASLPFGFGKRSCMGRRLAELELQMALAQIL HFEVQPSbjct: 421 QFPEPNSFRPARWLGEGPTPHPFASLPFGFGKRSCMGRRLAELELQMALAQILTHFEVQP 480 Query: 481 EPGAAPIRPMTRTVLVPERSINLQFLDR 508 EPGAAP+RP TRTVLVPERSINLQFLDRSbjct: 481 EPGAAPVRPKTRTVLVPERSINLQFLDR 508
>CYP27C1 (F.Zhang) partialMQTSAMALLARILRAGLRPAPERGGLLGGAAPRRPQPAGARLPAGARAEDKGAGRPGAPP
AGGRAEGPRLAAMPGPRTLANLAEFFYRDGFSRIHEIQQKHT
QEYGKIFKSHFGPQFVVSIADRDMVAQVLRAEGAAPQRANMESWREYRDLRGRATGLISA
EGEQWLKMRSVLRQRILKPKDVAIYSGEVNQVIADLIKRIYLLRSQAEDGETVTNVNDLF
FKYSMEGVATILYESRLGCLENSIPQLTVEYIEALELMFSMFKTSMYAGAIPRWLRPFIP
KPWREFCRSWDGLFKF >CYP27C1 AC027142 43% identical to 27A1 assembled gene Length = 542 Score = 586 bits (1511), Expect = e-170 Identities = 293/301 (97%), Positives = 296/301 (98%), Gaps = 1/301 (0%) Query: 1 MQTSAMALLARILRAGLRPAPERGGLLGGAAPRRPQPAGARLPAGARAEDKGAGRPGAPP 60 MQTSAMALLARILRAGLRPAPERGGLLGG APRRPQPAGARLPAGARAEDKGAGRPG+PPSbjct: 1 MQTSAMALLARILRAGLRPAPERGGLLGGGAPRRPQPAGARLPAGARAEDKGAGRPGSPP 60 Query: 61 AGGRAEGPR-LAAMPGPRTLANLAEFFYRDGFSRIHEIQQKHTQEYGKIFKSHFGPQFVV 119 GGRAEGPR LAAMPGPRTLANLAEFF RDGFSRIHEIQQKHT+EYGKIFKSHFGPQFVVSbjct: 61 GGGRAEGPRSLAAMPGPRTLANLAEFFCRDGFSRIHEIQQKHTREYGKIFKSHFGPQFVV 120 Query: 120 SIADRDMVAQVLRAEGAAPQRANMESWREYRDLRGRATGLISAEGEQWLKMRSVLRQRIL 179 SIADRDMVAQVLRAEGAAPQRANMESWREYRDLRGRATGLISAEGEQWLKMRSVLRQRILSbjct: 121 SIADRDMVAQVLRAEGAAPQRANMESWREYRDLRGRATGLISAEGEQWLKMRSVLRQRIL 180 Query: 180 KPKDVAIYSGEVNQVIADLIKRIYLLRSQAEDGETVTNVNDLFFKYSMEGVATILYESRL 239 KPKDVAIYSGEVNQVIADLIKRIYLLRSQAEDGETVTNVNDLFFKYSMEGVATILYESRLSbjct: 181 KPKDVAIYSGEVNQVIADLIKRIYLLRSQAEDGETVTNVNDLFFKYSMEGVATILYESRL 240 Query: 240 GCLENSIPQLTVEYIEALELMFSMFKTSMYAGAIPRWLRPFIPKPWREFCRSWDGLFKFK 299 GCLENSIPQLTVEYIEALELMFSMFKTSMYAGAIPRWLRPFIPKPWREFCRSWDGLFKF Sbjct: 241 GCLENSIPQLTVEYIEALELMFSMFKTSMYAGAIPRWLRPFIPKPWREFCRSWDGLFKFS 300>CYP39
(Q.Tran)
MELIFPTVIIILGCLALFLLLQRKNLRRPPCIRGWIPWIGVGFEFGKAPLEFIEKARIK
(0)
YGPIFTVFAMGNRMTFVTEEEGINVFLKSKKVDFELAVQNIVYHT
(1)
ASIPKNVFLALHEKLYIMLKGKMGTVNLHQFTGQLTEELHEQLENLGTHGTMDLNNLV
(2)
RHLLYPVTVNTLFNKSWFPTNKKKIKEFHQYFQAYDEDFEYGSQLPECLL
(2)
RNWSKSKKWFLELFEKNIPDIKACKSAKDNSM
(0)
TLLQATLDIVETETSKENSPNYGLLLLWASLSNAVP(0)
VAFWTLAYVLSHPDIHKAVMEGISSVFGTA (1)
GKDKIKVSEDDLEKLLLIKWCVLETIRLKAPGVITRKVVKPVEIL
(0)
NYIIPSGDLLMLSPFWLHRNPKYFPEPELFKP
(0)
ERWKKANLEKHSFLDCFVAFGSGKFQCPG
(2)
RWFALLEVQMCVILILYKYDCSLLDPLPKQ (0)
SSLHLVGVPQPEGQCRIEYKQRI
>CYP39A1 AC008104 AL035670 note heme region exon corrected 1/18/02 Length = 469 Score = 934 bits (2413), Expect = 0.0 Identities = 455/469 (97%), Positives = 459/469 (97%) Query: 1 MELIFPTVIIILGCLALFLLLQRKNLRRPPCIRGWIPWIGVGFEFGKAPLEFIEKARIKY 60 MELI PTVIIILGCLALFLLLQRKNLRRPPCI+GWIPWIGVGFEFGKAPLEFIEKARIKYSbjct: 1 MELISPTVIIILGCLALFLLLQRKNLRRPPCIKGWIPWIGVGFEFGKAPLEFIEKARIKY 60 Query: 61 GPIFTVFAMGNRMTFVTEEEGINVFLKSKKVDFELAVQNIVYHTASIPKNVFLALHEKLY 120 GPIFTVFAMGNRMTFVTEEEGINVFLKSKKVDFELAVQNIVY TASIPKNVFLALHEKLYSbjct: 61 GPIFTVFAMGNRMTFVTEEEGINVFLKSKKVDFELAVQNIVYRTASIPKNVFLALHEKLY 120 Query: 121 IMLKGKMGTVNLHQFTGQLTEELHEQLENLGTHGTMDLNNLVRHLLYPVTVNTLFNKSWF 180 IMLKGKMGTVNLHQFTGQLTEELHEQLENLGTHGTMDLNNLVRHLLYPVTVN LFNKS FSbjct: 121 IMLKGKMGTVNLHQFTGQLTEELHEQLENLGTHGTMDLNNLVRHLLYPVTVNMLFNKSLF 180 Query: 181 PTNKKKIKEFHQYFQAYDEDFEYGSQLPECLLRNWSKSKKWFLELFEKNIPDIKACKSAK 240 TNKKKIKEFHQYFQ YDEDFEYGSQLPECLLRNWSKSKKWFLELFEKNIPDIKACKSAKSbjct: 181 STNKKKIKEFHQYFQVYDEDFEYGSQLPECLLRNWSKSKKWFLELFEKNIPDIKACKSAK 240 Query: 241 DNSMTLLQATLDIVETETSKENSPNYGLLLLWASLSNAVPVAFWTLAYVLSHPDIHKAVM 300 DNSMTLLQATLDIVETETSKENSPNYGLLLLWASLSNAVPVAFWTLAYVLSHPDIHKA+MSbjct: 241 DNSMTLLQATLDIVETETSKENSPNYGLLLLWASLSNAVPVAFWTLAYVLSHPDIHKAIM 300 Query: 301 EGISSVFGTAGKDKIKVSEDDLEKLLLIKWCVLETIRLKAPGVITRKVVKPVEILNYIIP 360 EGISSVFG AGKDKIKVSEDDLE LLLIKWCVLETIRLKAPGVITRKVVKPVEILNYIIPSbjct: 301 EGISSVFGKAGKDKIKVSEDDLENLLLIKWCVLETIRLKAPGVITRKVVKPVEILNYIIP 360 Query: 361 SGDLLMLSPFWLHRNPKYFPEPELFKPERWKKANLEKHSFLDCFVAFGSGKFQCPGRWFA 420 SGDLLMLSPFWLHRNPKYFPEPELFKPERWKKANLEKHSFLDCF+AFGSGKFQCP RWFASbjct: 361 SGDLLMLSPFWLHRNPKYFPEPELFKPERWKKANLEKHSFLDCFMAFGSGKFQCPARWFA 420 Query: 421 LLEVQMCVILILYKYDCSLLDPLPKQSSLHLVGVPQPEGQCRIEYKQRI 469 LLEVQMC+ILILYKYDCSLLDPLPKQS LHLVGVPQPEGQCRIEYKQRISbjct: 421 LLEVQMCIILILYKYDCSLLDPLPKQSYLHLVGVPQPEGQCRIEYKQRI 469 >CYP46 C-term part (Ramy.Naguib)RLQAEVDEVIGSKRYLDFEDLGRLQYLSQVLKESLRLYPPAWGTFRLLEEETLIDGVRVP
GNTPLLFSTYVMGRMDTYFEDPLTFNPDRFGPGAPKPRFTYFPFSLGHRSCIGQQFAQME
VKVVMAKLLQRLEFRLVPGQRFGLQEQATLKPLDPVLCTLRPRGWQPTPPPPPC Alignment of the two proteins Identities = 173/174 (99%), Positives = 173/174 (99%), Gaps = 0/174 (0%) Query 1 RLQAEVDEVIGSKRYLDFEDLGRLQYLSQVLKESLRLYPPAWGTFRLLEEETLIDGVRVP 60 RLQAEVDEVIGSKRYLDFEDLGRLQYLSQVLKESLRLYPPAWGTFRLLEEETLIDGVRVPSbjct 327 RLQAEVDEVIGSKRYLDFEDLGRLQYLSQVLKESLRLYPPAWGTFRLLEEETLIDGVRVP 386 Query 61 GNTPLLFSTYVMGRMDTYFEDPLTFNPDRFGPGAPKPRFTYFPFSLGHRSCIGQQFAQME 120 GNTPLLFSTYVMGRMDTYFEDPLTFNPDRFGPGAPKPRFTYFPFSLGHRSCIGQQFAQMESbjct 387 GNTPLLFSTYVMGRMDTYFEDPLTFNPDRFGPGAPKPRFTYFPFSLGHRSCIGQQFAQME 446 Query 121 VKVVMAKLLQRLEFRLVPGQRFGLQEQATLKPLDPVLCTLRPRGWQPTPPPPPC 174 VKVVMAKLLQRLEFRLVPGQRFGLQEQATLKPLDPVLCTLRPRGWQP PPPPPCSbjct 447 VKVVMAKLLQRLEFRLVPGQRFGLQEQATLKPLDPVLCTLRPRGWQPAPPPPPC 500 >CYP51 (Y.Peng) partialAAAAGMMLLGLLQAGGSVLGQAMEKVTGGNLLSMLLIACAFTLSLVYLFRLAAGHLVQLPAG (0)KSPPYIFSPIPFLGHAIAFGKSPVEFLENAYEK (0)VFGKGVAYDVPNP (0)VFLEQKKMLKSGLNIAHFKQHVSIIEKETKEYFQSWGESGEK (1)VFEALSELIILTASHCLHGKEIRSQLNEKVAQLYADLDGGFSHAAWLLPGWLPLPSF (2)RRRDRAHREIKNIFYKAIQKRRQSQEKIDDILQTLLDATYK (0)DGRPLTDDEVAGMLIGLLLAGQHTSSTTSAWMGFFLARDKTLQEKCYLEQKT
VCGENLPPLTYDQ (0)TVAGYTIPPGHQVCVSPTVNQRLKDSWVERLDFNPDRYLQDNPASGE
KFAYVPFGA (1)GRHRCIGENFAYVQIKTIWSTMLRLYEFDLIDGYFPTVNYTTMIHTPENPVIRYKRRSK Score = 2176 (766.0 bits), Expect = 5.2e-231, P = 5.2e-231 Identities = 422/428 (98%), Positives = 423/428 (98%) Query: 1 AAAAGMLLLGLLQAGGSVLGQAMEKVTGGNLLSMLLIACAFTLSLVYL-RLAAGHLVQLP 59 AAAAGM+LLGLLQAGGSVLGQAMEKVTGGNLLSMLLIACAFTLSLVYL RLAAGHLVQLPSbjct: 1 AAAAGMMLLGLLQAGGSVLGQAMEKVTGGNLLSMLLIACAFTLSLVYLFRLAAGHLVQLP 60 Query: 60 AGKSPPYIFSPIPFLGHAIAFGKSP-EFLENAYEKVFGKGVAYDVPNPVFLEQKKMLKSG 118 AGKSPPYIFSPIPFLGHAIAFGKSP EFLENAYEKVFGKGVAYDVPNPVFLEQKKMLKSGSbjct: 61 AGKSPPYIFSPIPFLGHAIAFGKSPVEFLENAYEKVFGKGVAYDVPNPVFLEQKKMLKSG 120 Query: 119 LNIAHFKQHVSIIEKETKEYF-SWGESGEKVFEALSELIILTASHCLHGKEIRSQLNEKV 177 LNIAHFKQHVSIIEKETKEYF SWGESGEKVFEALSELIILTASHCLHGKEIRSQLNEKVSbjct: 121 LNIAHFKQHVSIIEKETKEYFQSWGESGEKVFEALSELIILTASHCLHGKEIRSQLNEKV 180 Query: 178 AQLYADLDGGFSHAAWLLPGWLPLPSFRRRDRAHREIK-IFYKAIQKRRQSQEKIDDILQ 236 AQLYADLDGGFSHAAWLLPGWLPLPSFRRRDRAHREIK IFYKAIQKRRQSQEKIDDILQSbjct: 181 AQLYADLDGGFSHAAWLLPGWLPLPSFRRRDRAHREIKNIFYKAIQKRRQSQEKIDDILQ 240 Query: 237 TLLDATYKDGRPLTDDEVAGMLIGLLLAGQHTSSTTSAWMGFFLARDKTLQ-KCYLEQKT 295 TLLDATYKDGRPLTDDEVAGMLIGLLLAGQHTSSTTSAWMGFFLARDKTLQ KCYLEQKTSbjct: 241 TLLDATYKDGRPLTDDEVAGMLIGLLLAGQHTSSTTSAWMGFFLARDKTLQEKCYLEQKT 300 Query: 296 VCGENLPPLTYDQTVAGYTIPPGHQVCVSPTVNQRLKDSWVERLDFNPDRYLQDNPASGE 355 VCGENLPPLTYDQTVAGYTIPPGHQVCVSPTVNQRLKDSWVERLDFNPDRYLQDNPASGESbjct: 301 VCGENLPPLTYDQTVAGYTIPPGHQVCVSPTVNQRLKDSWVERLDFNPDRYLQDNPASGE 360 Query: 356 KFAYVPFGAGRHRCIGENFAYVQIKTIWSTMLRLYEFDLIDGYFPTVNYTTMIHTPENPV 415 KFAYVPFGAGRHRCIGENFAYVQIKTIWSTMLRLYEFDLIDGYFPTVNYTTMIHTPENPVSbjct: 361 KFAYVPFGAGRHRCIGENFAYVQIKTIWSTMLRLYEFDLIDGYFPTVNYTTMIHTPENPV 420 Query: 416 IRYKRRSK 423 IRYKRRSKSbjct: 421 IRYKRRSK 428 FASTA format of assembled sequences
>CYP1A1
MLFRISMSATEFLLASLIFCLVFWVIRASRPRVPKGLKNPPGPW
GWPLIGHILTLGKNPHLALSRMSQRYGDVLQIRIGSTPVLVLSGLDTIRQALVQQGDD
FKGRPNLYSFTLISNGQSMSFGPDSGPVWAARRRLAQNGLKSFSIASDPASSSSCYLE
EHVSKEAEVLISKLQEQMAGPGHFNPYRYVVISVANVICAICFGQRYDHNHQELLSLV
NLSNNFGEVVGSGNPADFIPILRYLPNRSLNGFKDLNEKFHSFMQKMIKEHYKTFEKG
HIRDITDSLIEHCQEKQLDENANIQLSDEKIVNVVLDLFGAGFDTVTTAISWSLMYLV
TNPRVQRKIQEELDTVIGRSRRPRLSDRSHLPYMEAFILETFRHSSFVPFTIPHSTTR
DTSLKGFYIPKGRCVFVNQWQINHDQKLWVNPSEFLPERFITPDGAIDKVLSEKVILF
GLGKRKCIGETIARWEVFLFLAILLQRVEFSVPPGVKVDMTPIYGLTMKHACCEHFQM
QLRS
>CYP1A2
(Aggarwal)
MALSQSVPFSATELLLASAIFCLVFWVLRGSRPRVPKGLKSPPEPWGWPLLGHVLTLGKNPHLALARMSQLYGDVLQIRIGSTPVLVLSGLDTIRQALVRQGNDFKGRPDLYSFTFITDGQSMSFSPDSGPVWAARRRLAQNALNTFSIASDPASSSSCYLEEHVSKEAEALISRLQELMAGPGHFDPYNQVVVSVANVIGAMCFGQHFPESSDEMLSLVKNSHEFVESASSANPVDFFPILRYLPNPALQRFKAFNQRFRRFLQKTVQEHYQDFDK (0)NSVQDITGALFKHSKKGPRASGNLIPQEKTVNLVNDIFGA (1)GFDTIATAISWSLMYLVTKPEIQRKIQKEL (1)DAVIGRGRRPRLSDRPQLPYLEAFILETFRHSSFVPFTIPH (2)STTRDTTLNGFYIPRECCVFINQWQVNHDP (2)QLWGDPSEFRPERFLTAEGTTINKPLSEKIMLFGLGKRRCIGEVLGKWEVFLFLAILLQQLEFSVPPGVKVDLTPIYGLTMKHARCEHFQAR
>CYP1A8P
ortholog possibly a functional
gene in rhesus
MILNLAVTPGEVTTSLIILVMVFVFVRALRSKGRKQLSPPGPWSFPIIGNLLQLGEHPYL
TLMEMRKKYGDVFLLKLGMVPVLVVNGMEMVKQVLLKDGEHFAGRPNMHTFSFLAEGKSL
SFSVNYGESWKLHKKIASKALRTLSNAEAKSSTCSCLLEEHVTEEVSELVTVFVELSSKN
GGFDPRNAITCAVANVVCALCFGKRYDHSDEEFLKIVKTNDDLLKASSAANPADFIPCLR
YLPLQIINAPREFYRALNGFIALHVQDHLATYDK
(0)
DHIRDITDALINVCHNKYAATKTDTLNDSEIISTVNDLFGA
(1)
GFETVSTCLYWSFLYLIHYPEIQAKIQEEI
(1)
DGNIGLKPPRFEDRKILPYTEAFISEVFRHASFLPFTIPHC
(2)
TTADTTLNGYFIPRKTCTFINMYQVNHDE
(2)
TIWDNPSLFRPDRFLNENRELNKSLVEKVLIFGMGIRKCLGEDVARNEIFIFITAVLQQL
KLKKCPRAKLDLTPTYGLVMRPKPYQLEAERRSSGSSSA
>CYP1B1
MGTGLSPKDPWPLNLLSTQQTTLLLLLSVLVAVHVGQWLLRQRRRQLGSTPPGPFAWPLI
GNAAAVGQASHLSFARLARRYGDVFQIRLGSCPIVVLNGERAIHQALVQQGSAFADRPSF
ASFRVISGGRSMAFGHYSEHWKVQRRAAHSTMRNFSTRQLRSRQVLEGHVLSEARELVAL
LVRGSADGAFLDPRQLTVVAVANVMSAVCFGCRYSHDDPEFRELLSHNEEFGRTVGAGSL
VDVMPWLQYFPNPMRTAFREFEQLNRNFSNFVLDKFLRHCESLRPGAAPRDMMDAFILSA
EKKAARDSDDGGARLDLENVPATVTDIFGASQDTLSTALQWLLLLFIR
(2)
YPDVQARVQAELDQVVGRDRLPCMDDQPNLPYVLAFLYEAMRFSSFVPVTIPHATNANTS
VLGYHIPKDTVIFVNQWSVNHDPVKWPNPENFDPARFLDKDGLINKDLTSRVMIFSVGKR
RCIGEELSKMQLFLFISILAHQCNFRANPNGPEMNFSYGLTIKPKSFKVNVTLRESMELL
DSAVQKLQAEETCQ
>CYP2A24
AY635460
MLASGLLLVALLACLTVMVLMSVWQQRNSKGKLPPGPTPLPFIG
NYLQLNTEQMCNSLMKISERYGPVFTIHLGPRRVVVLCGYDAVKEALVDQAEEFSGRG
EQATFDWVFKGYGVVFSNGERAKQLRRFSIATLRDFGVGKRGIEERIQEEAGFLIEAL
RDTHGANIDPTFFLSRTVSNVISSIVFGDRFDYEDKEFLSLLGMMLGSFQFTSTSTGQ
LYEMFSSVMKHLPGPQQQAFKELQGLEDFIAKKVEHNQHTLDPNSPRDFIDSFLIRMQ
EEEKNPNTEFYLKNLMMTTLNLFIAGTETVSTTLRYGFLLLMKYPEVEAKVHEEIDRV
IGKNRQPKFEDRVKMPYMEAVIHEIQRFGDVIPMSLARRVNKDTKFRDFFLPKGTEVF
PMLGSVLRDPRFFSNPQDFNPQHFLDEKGQFKKSDAFVPFSIGKRNCFGEGLARMELF
LFFTTIMQNFRFKSPQLPKDIDVSPKHVGFATIPPNYTMSFLPR
>CYP2A23
AY635459.1
MLASGLLLVALLACLTVMVLMSVWQQRNSKGKLPPGPTPLPFIG
NYLQLNTEQMYNSIMKISERYGPVFTIHLGPRRIVVLCGYDAVKEALVDQAEEFSGRG
EQATFDWLFKGYGVVFSNGERAKQLRRFSIATLRDFGVGKRGIEERIQEEAGFLIEAL
RDTQGANIDPTFFLSRTVSNVISSIVFGDRFDYEDKEFLSLLRMMLGSFQFTATSAGQ
LYEMFSSVMKHLPGPQQQAFKELQGLEDFIAKKVEHNRRTLDPNSPRDFIDSFLIRMQ
EEEKNPNTEFHLKNLVLTSLNLFFGGTETVSTTLRYGFLLLMKHPEVEAKVHEEIDRV
IGKNRQPKFEDQARMPYMEAVIHEIQRFGDMLPLGVAHRVIKDTKFRDFFLPKGTEVF
PMLGSVLKDPKFFSNPQDFNPQHFLDEKGQFKKSDAFVPFSIGKRNCFGEGLARMELF
LFFTTIMQNFRFKSPQSPKDIDVSPKHVGFATIPPNYTMSFLPR
>CYP2C43
AB212264.1
MDSLVVLVLCLSCLLLLSLWRQRSGRGKLPPGPTPLPVIGNILK
IGIKDVSKSLTNLSKVYGPVFTLYFGLERMVVLHGYEAVKEALIDLGEEFSGRGHFPL
FERANRRFGLVFSNGKRWKEIRRFSLMTLRNFGMGKRSIEDRVQEEARCLVEELRKTK
ASPCDPTFILGCAPCNVICSIIFHKRFDYKDQQFLNLMEKFNENAKILSSPWIQIYNN
FSPIIDYFPGTHNKLLKNIAFVKSYILEKVKEHQESMDMNNPRDFIDCFLIKMEKEKH
NQQSEFNIENLENTAVDLFAAGTETTSTTLRYALLLLLKHPEVAAKVQEEIEHVIGRN
RSPCMQDRSRMPYTDAVVHEIQRYIDLLPTSVPHAVTCDVKFRNYLIPKGTTILISLT
SVLRDNKEFPNPEMFDPRHFLDEGGNFKNSNYFMPFSAGKRICVGEALARMELFLFLT
SILQNFNLKSLVDLKDLDTTPVFNGFVSVPPIYQLCFIPV
>CYP2C74 variant (S.Sarva) missing exon 1, 4 aa diffs to 2C74 AY635462.1FSKVYGPVFTVYFGMNPVVVLHGYETVKEALIDNAEEFSGRGILPISERITNGL (1)GIISSNGKRWKETRRFSLTTLRNFGMGKRSIEDRVQEEARCLVEELRKTK (1)ASPCDPTFILGCAPCNVICSVVFQKRFDYKDENFLTLMKRFTVNFRILTSPWIQ (0)VCNNFPLLIDCFPGTHNKLLKNVALTKSYIRKKVKEHQATLDVNNPRDFIDCFLIKMEQ (0)EKDNQQSEFTIENLVGTVADLFVAGTETTSTTLRYGLLLLLKHPEVT (1)AKVQEEIDHVIGRHRSPCMQDRSHMPYTDAVIHEIQRYIDLVPTGVPHAVTTDIKFRNYLIPK (0)
GTIIITLLTSVLQDDKEFPNPKIFDPGHFLDENGNFKKSDYFMPFSA (1)GKRICAGEGLARMELFLFLTTILQNFNLKSVADLKNLNTTSATRGIISLPPSYQICFIPV
>CYP2C75
AY635463.1
MDSLVVLVLCLSCLLLLSLWRQRSGRGKFPPGPTPLPVIGNILQ
IDIKDVSKSLTNLSKVYGPVFTLYFGLERMVVLHGYEAVKEALIDLGEEFSGRGHFPL
ADRANRGFGIVFSNGKRWKEIRRFSLMTLRNFGMGKRSIEDRVQEEARCLVEELRKTK
GSPCDPTFILGCAPCNVICSIIFHKRFDYKDQQFLKVMEKLNENVKILSSPWIQICNN
FPPFIDYFPGAHNKLLKNIAFLKSYILEKVKEHQESMDMNNPRDFIDCFLMKMEKEKH
NQQSEFTIENLENTAVDLFGAGTETTSTTLRYALLLLLKHPEVTAKVQEEIERVIGRN
RSPCMQDRSHMPYTDAVVHEIQRYIDLLPTNLPHAVTCDVKFRNYLIPKGTTILISLT
SVLHDNKEFPNPEMFDPRHFLDEGGNFKKSNYFMPFSAGKRICVGEALARMELFLFLT
SVLQNFNLKSLVDPKDLDTTPVVNGFASVPPFYQLCFIPV
>searched
with 2C29 differs from 2C43, 2C74, 2C75 (Liao)
MDPAVALVLCLSCLFLLSLWRQSSGRGRLPSGPTPLPIIGNILQLDVKDMSKSLTNFSKV
YGPVFTVYFGLKPIVVLHGYEAVKEALIDHGEKFSGRGSFPVAEKVNKGLGILFSNGKRW
KEIRRFSLMTLRNFGMGKRSIEDRVQEEALCLVEELRKTNASPCDPTFILGCAPCNVICS
VIFHNRFDYKDQRFLNLMEKFNENLRILSSPWIQ
>CYP2B30
(Puljic) also AY635461.1
MELSVLLFLALLTGLLLLLVQRHPNAHGRLPPGPCPLPLLGNLL
QMDRRGLLRSFLRFREKYGDVFTVYLGPRPVVMLCGVEAIREALVDNAEAFSGRGKIA
ITDPVFQGYGVVFANGNRWKVLRRFSLTTMRDFGMGKRSVEERIQEEAQCLIEELRKS
KGALVDPTFLFHSITANIICSIVFGKRFHYQDQEFLKILNLFYHTFSLASSMFGQLFE
LLSGFLKYFPGAHRQVYKNLQEINAYIGHSVEKHRETLDPSAPQDLIDSYLLQMEKEK
SNPHSEFSHRNLIINTLSLFFAGTETTSTTLRYGFLLMLKYPHVAERIYKEIEQVIGP
HRPPALDDRAKMPYTEAVIHEIQRFADLLPMGVPHIVTQQTSFRGYIIPKDTEVFPLL
STALHDPHYFEKPDTFNPDHFLDANGALKKNEAFIPFSLGRRMCLGEGIARNELFLFF
TTILQNFSVASPVAPEDIDLTPQESGVGKIPPTYQIRFLPR
>CYP2F6
AY952296.1
MDSISTAILLLLLALVCLLLTLSSRDKXKLPPGPRPLPLLGNLLLLRSQNMLTSLTQ
LSKEYGSVYTVHLGPRRVVVLSGYQAVKEALVDQGEEFSGRGDYPVFFNFTKGN
GIAFSNGDRWKVLRRFSIQILRNFGMGKRSIEERILEEGSFLLAELRKTE
GEPFDPTFVLSRSVSNIICSVLFGSRFDYDDERLLTVIRLINDNFQIMSSPWGE
(0)
LYNIFPSLLDWVPGPHQRIFQNFKRLRDLIAHRVHDQQASLDPRSPRDFIDCFLTKMAE
EKEDPLSHFHMDTLLMTTHNLLFGGTETVGTTLRHAFLALMKYPKVQ
ARVQEEIDLVVGRTRLPTLEDRAAMPYTDAVIHEVQRFADIIPMNLPHRVIRDTAFRXFLIPK
GTDIITLLNTVHYDPSQFLXPQEFNPEHFLDANQSFKKSPAFMPFSA
GRRLCLGESLARMELFLYLTAILQSFSLQPLGAPEDIDLTPLSSGLGNLPRSFQLCLCPR
>CYP2D42
(Vasser) also AY635464.1
MELDALVPLAVIVAIFLLLVDLMHRRQRWAARYPPGPLPLPGLGNLLHVDFKNTPYCFDQ
(0)
LRRRFGDVFSLQLAWTPVVVLNGLAAVREALVTCGEDTADRPPVPINQVLGVGPRSQ
(1)
GVFLARYGPAWREQRRFSVSTLRNLGLGKKSLEQWVTEEAACLCAAFTDQA
(1)
GRPFRPNSLLDKAVSNVIASLTYGRRFEYDDPRFLRLFDLTHEALKEESGFLRE
(0)
VLNAVPLLLRIPGLAGKVLRSQKAFLTQLDELLTEHRMTWDPAQPPRDLTEAFLAEMEK
(0)
AKGNPESSFNEENLRIVVADLFSAGMVTTSTTLAWGLLLMILHPDVQR
(1)
RVQQEIDNVIGQVRRPEMGDQARMPYTTAVIHEVQRFGDIVPLGVTHMTSRDIELQGFLIPK
(0)
GTTLFTNLSSVLKDEAVWEKPFRFHPEHFLDAQGHFVKPEAFLPFSA
(1)
GRRACLGEPLARMELFLFFTCLLQRFSFSVPAGQPRPSHHGVFAFLVTPSPYELCAVPR*
>CYP2E1
AY635465.1
MSALGVSVALLVWVAVLLLVSIWRQVHSSWNLPPGPFPLPIIGN
LFQLELKNIPKSFTRLAQRFGPVFTLYVGSRRVVVVHGYKAVREVLLDHKDEFSGRGD
IPAFHAHRDRGIIFNNGPTWKDIRRFSLTTLRNYGMGKQGNESRIQREAHFLLEALRK
TQGQPFDPTFLIGCAPCNVIADILFRKHFDYNDEKFLRLMYLFNENFQLLSTPWLQLY
NNFPSLLHYLPGSHRKVMKNVAEIKEYVSERVKEHLQSLDPNCPRDLTDCLLVEMEKE
KHSAERLYTMDGITVTVADLFFAGTETTSTTLRYGLLILMKYPEIEEKLHEEIDRVIG
PSRIPAIKDRQEMPYMDAVVHEIQRFITLVPSNLPHEATRDTIFRGYIIPKGTVIVPT
LDSVLYDNQEFPDPEKFKPEHFLDESGKFKYSDYFKPFSAGKRVCAGEGLARMELFLL
LSAILQHFNLKPLVDPKDIDISPVNIGFGCIPPRFKLCVIPRS
>CYP2G2P best hit (Li Chen) Note this does not look like a pseudogeneMELGGAVTIFLALCLSCLLVLIAWKRMNKAGKLPPGPTPIPFLGNLLQVRTDATFQSFMK (0)LKEKYGPLFTVYMGLWPVVVLCGHEAVKEALIDQADEFSGRGKLASIEQNFQGH (1)
GVALANGERWRILRRFSLTILRDFGMGKRSIEERILEEASYLLEEFRKTK (1)GAPIDPTFLLSRTVSNVISSVVFGSRFDYEDKQFLNLLRLINESFIEMSTPWAQ (0)LYDMYSGIMQYLPGRHNRVYYLIEQLKDFIASRVKINEASFDSQNPRDFIDCFLIKMHQ (0)DKNNPRTEFNLKNLVLTALNLFFAGTETVSSTLRYGFLLLMKHPEVE (1)ARIHEEINQVIGPHRLPSVDDRVKMPYTDAVIHEIQRLVDIVPMGVPHNVIRDTQFRGYLLPK (0)GTDVFPLLGSVLKDPKYFRYPEAFYPQHFLDEQGRFKKNEAFVPFSS (1)GKRICLGEAMARMELFLYFTSILQNFSPRSLVPPADIDITPKLSGFGNIPPTYELCLVAR
>CYP2J2
best hit (Z. Zhang) partial
GLIMSSGQIWKEQRRFTLTALRNFGLGKKSLEERIQEEAQHLTEAIKEENGQPFDPHFKI
NNAVSNIICSITFGERFDYQDSQFQELLKLLDEVTYLEASKTCQLYNIFPWLMKFLPGPH
QTLFSNWEKLKLFVSHMIEKHRKDWNPAETRDFIDAYLKEMSKHTGNSTSSFHEENLICS
TLDLFFAGTETTSTTLRWALLYMALYPEIQEKVQAEIDRVIGQGQQPSTAARESMPYTNA
VIHEVQRMGNIVPLNVPREVTVDTTLAGYHLPKRACLGEQLARTELFIFFTSLVQKFTFR
PPNNEKLSLKFRMGITISPVSHHLC
>CYP2R1
(G.Zhu) partial
IFSLDLGGISTVVLNGYDVVKECLVHQSGIFADRPCLPLFMKMTKMGGLLNSRYGQGWVE
HRRLAVNSFRYFGYGQKSFESKILEETKFFTDAIETYKGRPFDFKQLITSAVSNITNLII
FGERFTYEDTDFQHMIELFSENVELAASASVFLYNAFPWIGILPFGKHQQLFRNASVVYD
FLSRLIEKASVNRKPQLPQHFVDAYFDEMDQGKNDPSSTFSKENLIFSVGELIIAGTETT
TNVLRWAILFMALYPNIQGQVQKEIDLIMGPNGKPSWDDKFKMPYTEAVLHEVLRFCNIV
PLGIFHATSEDAVVRGYSIPKGTTVITNLYSVHFDEKYWRDPEVFHPERFLDSSGYFAKK
EALVPFSLGRRHCLGEQLARMEMFLFFTALLQRFHLHFPHELVPDLKPRLGMTLQPQPYL
ICAERR
>CYP2S1
AC011510 one exon per line 78% to mouse 2s1 49% to 2B6 47% to 2A13
exons
2,3 from CO649282.1
MEATGTWALLLALALLLLLTLALSGTRARGQLPPGPTPLPLLGNLLQLRPGALYSGLMR
(0)
LSKKYGPVFTIYLGPWRPVVVLVGQEAVREALGGQAEEFSGRGTVAMLEGTFDGH
(1)
GVFFSNGERWRQLRKFTMLALRDLGMGKREGEELIQAEARCLVETFQGTE
(1)
GRPFDPSLLLAQATSNVVCSLLFGLRFSYEDKEFQAMVRAAGGTLLGVSSRGGQ
(0)
TYEMCSWFLWPLPGPHKQLLHHVSTLAAFTVRQVQQHQGNLDASGPARDLVDAFLLKMAQ
(0)
EEQNPDTEFTNKNMLMTVIYLLFAGTMTVSATVGYTLLLLMKYPHVQ
(1)
KRVREELTQELGSGQAPSLGDRTRLPYTDAVLHEAQRLLALVPMGIPRTLMRTTRFRGYTLPQ
(0)
GTEVFPLLGSILHDPSIFKHPEEFNPDHFLDADGRFRKHEAFLPFSL
(1)
GKRVCLGEGLAKAELFLFFTTILQAFSLESPCPLDSLSLKPTISGLFNIPPAFQLQVRPTDLHSTTQTT*
>CYP2T2P
ortholog, SCAFFOLD100362 (+) 38209-41795
frameshift
in exon 4 after VIC, numerous other defects
MIAGIAALLLWLLVLALARWG*GGCRARMRGSLPPRPRPLPLLGNLQLQSGGTDHALHS
(?)
LSGRWGPVFTAQLGPRPAVVLCGYAALRDALVLQADAFSGRGSMAVFERFTRGH
(1)
GIFLSNGPRWWTLRNFAVGALKELGLGTRTIQAHVLEEAACLLDEMQATI
(1)
GAPFDPMRLLDNAVSNVICX
LVFGNRYGYGDPEFLRLLNLFSDNFRIMSSRWGE
(0)
(?)
SLMDWLPGRHRRIFRNF
SELWVFISEQIQQHWQMRQPAEPRDFINCLTRWVRRGSQ
QDPESHFQEETSVMMTHLFFGGTETSTTLCYGLLVLLKYPEVA
(1)
AKVQELDPVVGWRCAPSPDDHQRLPYTNAVLLQIQRFISVVPLGLPRX
TLNTHLHSHCLPK
(1)
GTFVIPLLVTAHXDRTQFKDPDCFNATNFLDKGKSQGNDPFMPFAS
(1)
(?)
GKQMCLGAGLAHLEIFLFLTATLPRFRLLPVVNPGTINLT
QFTGLGSVPPAFQLQLVAC
>CYP2U1 (li Chen) note gc boundary between exons 7,8MSSPGPPQPPAEDPPWPARLLRAPLGLLRMDPSGDALLLCGLVAVLGWSWLRRRRARGIP PGPTPWPLVGNFGHVLLPPFLRRRSWLSSRTRAAGIDPSVVGPQVLLAHLARVYGSIFSF FIGHYLVVVLSDFHSVREALVQQAEVFSDRPRVPLISIVTKEK (1)GVVFAHYGPIWRQQRKFSHSTLRHFGLGKLSLEPKIIEEFKYVKAEMQKHGEDPFCPFSIISNAVSNIICSLCFGQRFDYTNSEFKKMLGFMSRGLEICLNSQVLMVNICPWLYYLPFGPFKELRQIEKDITSFLKKIIKDHQESLDRENPQDFIDMYLLHMEEERKNNSNSSFDEEYLFYIIGDLFIAGTDTTTNSLLWCLLYMSLNPDVQ (1)EKVHEEIERVIGANRAPSLTDKAQMPYTEATIMEVQRLTVVVPLAIPHMTSGNT (1)VLQGYTIPKGTLILPNLWSVHRDPAIWEKPEDFYPNRFLDDQGQLIKKETFIPFGI (1)
GKRVCMGEQLAKMELFLMFVSLMQSFAFALPEKSKKPLLTGRFGLTLAPHPFNITISRR*
>CYP2W1
Macaca mulatta rhesus monkey (Mahrous)
LSERYGPVFTVHLGCQKTVVLTGFEVVKEALAGPGQELADRPPIAIFQLIQRGG
(1)
GIFFSSGARWRAARQFTVRALHSLGVGRKPVADKILQELKCLLGQLDGYR
(1)
GQPFPLALLGWAPSNITFTLLFGRRFDYRDPVFVSLLGLIDEVMVLLGSPGLQ
(0)
LFNVYPWLGALLQLHRPVLRKIEEVRAILRTLLEARRPHMRPGDPVCSYVDALIQQGQ
(0)
GDDPEGLFAEDNAVACTLDMVMAGTETTSATLQWAALLMGRHPDVQ
(1)
GRVQEELDRVLRRGRPPQPEDQQVLPYTSAVLHEVQRFITLLPHVPRCTATDMQLGGFLLPK
(0)
GTPVIPLLTSVLLDETQWQTPDQFNPGHFLDADGHFVKQEAFLPFSA
(1)
GRRVCVGERLARTELFLLFAGLLQKYYLLPPPGVSPASLDTTPAQAFTMRPRAQALCAVPRP
>CYP2AB1P
SCAFFOLD46808:34-204, SCAFFOLD101629:758-8003 (no ESTs found)
MLSLLSGLALLAISFLLLKLGTFCWDRNRLPPGPFPFPILGNLWQLRFQLHPETLLQ
(0)
LAQTHG
VCLFTVWVSPIPIVVLSGFRAVKEALVSNSEQFSGRPLTSLFQDLFGEQ
(1)
GIVCSRRHMWWQQRRFCLVTLQGLGLGKLALEVQLQKQAAELVEAFRQEL
(1)
SRSFDPQVSIVRSTVRVIGALVFGHHFLSEDPIFQELTQAIDFGLALVRTVWHW
(0)
LHDVFPRALCHLPGSHREIFRYQGVVRSFTRREITGRKLKALEALKDFINCSLAQISK
(0)
AMDEPVSTFHEENLVQVVIDLFLGGTNTTATTQRWALVYMIQHGAVQ
(1?)
GTIILPLCRGSVLYDPECWETPPQFNPGHFLDKDGNFVANEAFLPFSA
(1)
GHCVCPGDQLARMELFLMFATLLRTFRFQLPEGSPGLKLEYIFGGTLQPQPQEICAVPR
>CYP2AC1P
SCAF55146 (+) 91919-103589, SCAF70822:43519-43602, SCAF103481:817-966
1
MSEFDASAILPIRVLILIFILSIKKFMTEASKQLSPPGPRPLLVIGNLYFLNLKRPYQTMLE (0)
3
GIAFFHGETWKTMRWFSLTTLQNFGMDEWIIEDTIIEECQNLIQNSEFHR
4
GKSFEMKTIMNASVVNIIVLVLPGKWFDYQDSQFLRLLALIGENVKLIGGLRIAVN (1)
5
SFQYVSFWGVLLKSHKTVFRNRDELFSFIRMIFLDHCHKLDKNDPRSFTDAFLVTQQE (0)
6
ENDTFADHFSDENLMALVNNLFTTGTETTASTLPWGILLVICLRSRV (1)
7
KKVHNEVTKVARSAQP*LAHQTQMPHTDAVSHEVQRFANILPTSLPHATPTNIFKNYYIPK
(0)
8
ATEVIILLASVRRDQAQWEKPDTFNPEHFLTSKGKFIKREAFLPFTV (1)
9
GRRMCAGESSAR MELFLFFTSLLQ KFTFQPPLGVSHLDLDLSLDIGFTT*
>CYP3A64
AY582531.1
MDLIPDLAVETWLLLAVTLVLLYLYGTHSHGLFKKLGIPGPTPL
PLLGNILSYRKGFWTFDMECYKKYGKVWGFYDGRQPVLAITDPNMIKTVLVKECYSVF
TNRRPFGPVGFMKNAISIAEDEEWKRIRSLLSPTFTSGKLKEMVPIIAKYGDVLVRNL
RREAETGKPVTLKDVFGAYSMDVITSTSFGVNIDSLNNPQDPFVENTKKLLRFDFLDP
FFLSITIFPFIIPILEVLNISIFPREVTSFLRKSVKRIKESRLKDTQKHRVDFLQLMI
DSQNSKETESHKALSDQELVAQSIIFIFAGYETTSSVLSFIIYELATHPDVQQKLQEE
IDTVLPNKAPPTYDTVLQMEYLDMVVNETLRIFPIAMRLERVCKKDVEINGIFIPKGV
VVMIPSYALHHDPKYWPEPEKFLPERFSKKNNDNIDPYIYTPFGSGPRNCIGMRFALM
NMKLAIIRVLQNFSFKPCKETQIPLKLRLGGLLQTEKPIVLKIESRDGTVSGA
>CYP3A43 ortholog? partial assembly (Aggarwal) 72% to 3A64MDLIPNFAMETWVLVATSLVLL (2)YIYGTHSHKLFKKLGIPGPTPLPFLGTILFYLR (0)GLWKFDRECNEKYGEMWG (2)LYEGQQPMLVIMDPDMIKTVLVKECYSVFTNRM (0)PLGPMGFMKSALSFAEDEEWKRIRTLLSPAFTSVKFKE (0)MVPIISQCGDMLVRSLRREAENSKPTNLKE
>CYP4A11 match (ramy.Naguib, Puljic) (exon 1 added later)MSVSVLSPSRLLGGVSGILQVASLLILLLLLIKAAQLYLHRQWLLKAFQQFPCSPSHWLFGHKQE (0)FQQDQELQRIRKWVEMFPSACPLWLWGGKARVQLHDPDYMKVILGRS (1)DPKSQDPYRFLAPWI (1)GYGLLLLNGQTWFQHRRMLTPAFHYDILKAYVALMADSVRVML (0)DKWEKLLGQDSPLEVFQHVSLMTLDTIMKCAFSHQGSIQVDR (2)DSQSYIQAISDLNNLVFSRVRNVFHQNDTIYSLTSTGRWTHRACQLAHQHT (1)DQVIQLRKAQLQKEGELEKVKRKKHLDFLDILLLAK (0)MENGSILSDKDLRAEVDTFMFEGHDTTASGISWILYALATHPKHQERCREEIHGLLGDGASITW (2)NHLDQMPYTTMCIKEALRLYPPVPGISRELSTPVTFPDGRSLPK (1)GITVMLSIYGLHHNPKVWPNPE (0)VFDPSRFAPGSAQHSHAFLPFSGGSR (2)NCIGKQFAMNELKVATALTLLRFELLPDPTRIPIPMARLVLKSKNGIHLRLRRLPNPCEDKDQL
>CYP4B1
MVPSFLSLRLSCLGLWASGLILVLGFLKLIRLLLRRQRLAKAMGNFPGPPTHWLFGHALE
(0)
IQQTGSLDKVVSWAHQFPYAHPLWFGQFIGFLNIYEPDYAKAVYSRG
(1)
DPKAPDVYDFFLQWI
(1)
GRGLLVLEGPKWFQHRKLLTPGFHYDVLKPYVALFAESTRVML (0)
DKWEEKAQEGKSFDIFCDVGHMALNTLMKCTFGRGDTGLGHR
(2)
DSSYYLAVSDLTLLMQQRLVSFHYHNDFIYWLTPHGRRFLRACQVAHDHT
(1)
DQVIRERKAALQDEKVRKKIQNRRHLDFLDILLGAR
(0)
DEDDSKLSDADLRAEVDTFMFEGHDTTTSGISWFLYCMALYPEHQHRCREEVREILGDQDSFQW
(2)
DDLGKMTYLTMCIKESFRLYPPVPQVYRQLSKPVTFVDGRSLPA
(1)
GSLISMHIYALHRNSAVWPDPE
(0)
VFDPLRFSTENASKRHPFAFMPFSAGPR
(2)
NCIGQQFAMSEMKVVTAMCLLHFEFSLDPSRLPIKMLQLVLRSKNGIHLHLKPLGPGSGK
>CYP4V2 (S.Sarva)MAGIWLGLVWQKLLLWGAASAVSLAGASLVLSLLQRVASYVRKWQQMRPIPTVARAYPLV GHALLMKRDGR (1)EFFQQIIEYTEEYRHMPLLKLWVGPVPMVALYNAENVE (0)VILTSSKQIDKSSMYKFLEPWLGLGLLTS (2)TGNKWRSRRKMLTPTFHFTILEDFLDIMNEQANILVKKLEKHVNQEAFNCFVYITLCALDIIC (1)ETAMGKNIGAQSNDDSEYVRAVYR (2)MSEMIFRRIKMPWLWLDLWYLMFKEGWEHKKSLKILHAFTNN (0)VIAERANEMNVDEDCRGDGRDSAPSKNKRRAFLDLLLSVTDDEGNRLSHEDIREEVDTFMFE (0)GHDTTAAAMNWSLYLLGSNPEVQKKVDHELDDVF (1)GRTDRPATVEDLKKLRYLECVIKETLRLFPSVPLFARSVSEDCEV (1)AGYRVLKGTEAVIIPYALHRDPRYFPNPEEFRPERFFPENAQGRHPYAYVPFSAGPRNCI (1)GQKFAVMEEKTILSCILRHFWIESNQKREELGLEGQLILRPTNGIWIKLKRRNADE 1551
>CYP4X1
(Vasser) missing exon 1
FIQDDNMEKLEEIIEKYPRAFPFWIGPFQAFFYIYDPDYAKTFLSRT
(1)
DPKSQYLQKFLPPLI
(1)
GKGLLALDGPKWFQHRRLLTPGFHFNILKAYIEVMAHSVKTML
(0)
DKWEKICSTQNTSVEVYEHINLMSLDIIMKCAFSKETNCQTN
(2)
STHDPYVKAIFELGKIIFHRLYSFLYHSDIIFKLSPQGYRFQKLSRVLNQYT
(1)
DAIIQERKKSLQAGEKQDNTQKRKYQDFLDIVLSAK
(0)
DENGSSFSDTDVHSEVSMFLLGGHDSLAASISWILYCLALNPEHQERCREEVRGILGDGCSITW
(2)
DQLGEMSYTTMCIKETCRLIPAVPSISRDLSKPLTFPDGCTLPA*
>CYP5A1
(Z. Zhang) partial
ASGLEFKSVADSVLFLRDKRWEEVRGALMSAFSPEKLNEMTPLISQACDLLLAHLKRYAE
SGDAFDIQRCYCNYTTDVVASVAFGTPVDSQQAPEDPFVKHCKRFFEFCIPRPILVLLLS
FPSIMVPLARILPNKNRDELNGFFNKLIRNVIALRDQQAAEERRRDFLQMVLDARHSASP
VGVQDFDMVGDVFSSTRCKPNPSRQHQAGPMARPLTVDEIVGQAFIFLIAGYEIVTNTLS
FATYLLATNPDCQEKLLREVDLFKEKHMVPEFCSLEEGLPYLDMVIAETLRMYPPAF
>CYP7A1
(A.Bolen)
MMTTSLIWGIAIAACCCLWLILGIRRR
(2)
QTGEPPLENGLIPYLGCALQFGANPLEFLRANQRKHGHVFTCKLMGKYVHFITNPLSY
HKVLCHGKYFDWKKFHFATSAK
(0)
AFGHRSIDPMDGNTTENINDTFIKTLQGHALNSLTESMMENLQRIMRPPVSSN
SKTAAWVTEGMYSFCYRVMFEAGYLTIFGRDLTRRDTQKAHILNNLDNFKQFDKVFPA
LVAGLPIHMFRTAHNAREKLAESLRHENLQKRESISELISLRMFLNDTLSTFDDLEKA
KTHLVVLWASQANTIPATFWSLFQMIR
(2)
NPEAMKAATEEVKRTLENAGQKVSLEGNPICLSQAELNDLPVL
(1)
DSIIKESLRLSSASLNIRTAKEDFTLHLEDGSYNIRKDSIIALYPQLMHLDPEIYPDPL
(0)
TFKYDRYLDENGKTKTTFYCNGLKLKYYYMPFGSGATICPGRLFA
IHEIKQFLILMLSYFELELIEGQAKCPPLDQSRAGLGILPPLNDIEFKYKFKHL*
>CYP7B1
(G.Zhu) partial
RRPGEPPLIKGWLPYLGVVLKLRKDPLSFMKTLQKQHGDTFTVLLG
GKYITFILDPFQYQLVIKNHKQLSFRLFSNKLLEKAFSISQLQKNHDMNDELHLCYQFLQGKSLDILLESMMQN
LKQVFESQLLKTTSWDTAQLYPFCSSIIFEITFTTIYGKVLVCDNKFISELRDDFLKFDD
KFAYLVSNIPIELLGNVKSIRKKIIKCLSSENLAKMQGWSEVFQSRQDVLEKYYVHEDLE
IGAHHLGFLWASVANTIPTMFWAMYYLLRHPEAMAAVRDEIDRLLQSTGQKKGSGFPIHL
TREQLDSLICLESTIFEALRLSSYSTTIRFVEEDLTLSAQTGDYCVRKGDLGAIFPPILH
GDPEIFEAPDSKEFRYDRFIEDGKKKTTFFKRGKKLKCYLMPFGTGTSKCPGRFFALMEI
KQLLVILLTYFDLEIIDDKPIGLNYNRLLFGIQYPDSDVLFRYKVKS
>CYP8A1
partial (Lin Zhu)
GDKDHMCSVKSRLWKLLSPARLATRAHRSKWLESYLLHLEEMGVSEEMQARALVLQLWATQ
(0)
GNMGPAAFWLLLFLLKNPEALAAVRGELESILWEAEQPVSQMTTLPQKVLDGTPVL
(1)
DSVLSESLRLTAAPFITREVVVDLAMPMADGREFNLRRGDRLLLFPFLSPQRDPEIYTDPE
(0)
VFKYNRFLNPDGSEKKDFYKDGKRLKNYNMPWGAGHNHCLGKSYAVNSIKQ
(2)
FVFLVLVHLDLELINADVEIPEFDLSRYGFGLMQPEHDVPIRYRIRP
>CYP8B1
SCAFFOLD114862:8-613, SCAFFOLD39206:3-626 no introns
BB882888.1
Macaca fasicularis lower case
MVLWGPVLGALLVVIAGYLCLPGMLRQRRPREPPLDKGTVPWLGYAMAFRKNMFEFLKRM
RSKHGDVFTVQLGGQYFTFVMDPLSFGPILKDTQRKLDFGQYAKKLVLKVFGYRSVQGDH
EMIHSASTKHLRGDGLKDLNETMLDSLSFVMLKSKGWSLDASCWHEDSLFHFCYYILFTA
GYLSLFGYTKDKEQDLLQAGEL
40aa
gap
shsqxkegisnwlcnmlqflreqgvpsamqdkfnfmmlwasqgntgpts
FWALLFLLKHPEAIRAVRQETTQVLGEARLETKQSFAFKLSALQHTPVLDSVVEETLRLR
AAPTLLRLVHEDYTLKMASGQEYLFRRGDILALFPYLSVHVDPDIHPEPTIFKYDRFLNP
NGSRKVDFFKAGKKIHHYTMPWGSGVSICPGRFFALSEVKLFILLMVTHFDLELVDPDTP
LPHVDPQRWGFGTMQPSHDVRFRYRLRP
>CYP11A1
N-term = DQ228169.1 Macaca fasicularis = lower case (Mahrous)
mlakglpprsvlvkgcqtflsapkerlghlrvptsegagistrs
prpfneipspgdngwlnlyhfwretgthkvhlhhvqnfqkydpiy
REKLGNVESVYVIDPEDVALLFKSEGPNPERFLIPPWVAYHQYYQRPIGVLLK
(2)
KSAAWKKDRVALNQEVMAPETTKNFLPLLDAVSRDFVSVLHRRIKKAGSGNFSGDISDDLFRFAFE
(1)
SITNVIFGERQGMLEEVVNPEAQRFIDAIYQMFHTSVPMLNLPPDLFRLFRTKTWKDHVAAWDVIFSK
(1)
ADMYTENFHWELRQKGNVHHDYRGILYRLLGDSKMSFEDIKANVTEMLAGGVDT
(0)
TSMTLQWHLYEMARNLKVQDMLRAEVLAARRQAQGDMATMLQLVPLLKASIKETLR
(2)
LHPISVTLQRYLVNDLVLRGYMIPAK
(0)
TLVQVAIYALGREPTFFFDPENFDPTRWLSKDKNITYFRNLGFGWGVRQCLGRRIAELEMTIFLIN
(0)
MLENFRVEIQHLSDVGTTFNLILMPEKPISFTFWPFNQEATQ
>CYP11B1
(Lin Zhu)
MALRAKAEVCMAAPWLSLQRARALGTRATRVPRTVLPFEAMPRRPGNRWLRLLQIWREQGYEHLHLEVHQTFQELGPIFR (2)YDLGGAGMVCVMLPEDVEKLQQVDSLNPRRMSLEPWVAYRQHRGHKCGVFLL (2)NGPEWRFNRLRLNPDVLSPRAVQRFLPMVDAVARDFSQALRKKVLQNARGSLTLDVQPSIFHYTIE (1)ASNLALFGERLGLVGHSPSSASLSFLHALEVMFKSTVQLMFMPRSLSRWTSPKVWKEHFEAWDCIFQY (1)GDNCIQKIYQELALSRPQQYTSIVAELLLNAELSPDAIKANSMELTAGSVDT (0) TVFPLLMTLFELARNPNVQQALRQESLAAAASISEHPQKATTELPLLRAALKETLR (2)LYPVGLFLERVVSSDLVLQNYHIPAG (0)TLVRVFLYSLGRNPALFPRPERYNPQRWLDIRGSGRNFYHVPFGFGMRQCLGRRLAEAEMLLLLHH (0)VLKHLQVETLTQEDIKMVYSFILRPSTFPLLTFRAIN
>CYP17
AY746983.1 and AF458332.1
MWELVALLLLTLAYLFWPKRRCPGAKYPKSLLSLPLVGSLPFLP
RHGHMHNNFFKLQKKYGPIYSVRMGTKTTVIVGHHQLAKEVLIKKGKDFSGRPQVTTL
DILSNNRKGIAFADYGAHWQLHRRLAMATFALFKDGDQKLEKIICQEISTLCDMLATH
NGQTIDISFPVFVAITNVISLICFNISYKNGDPELKIVHNYNEGIIDSLGKESLVDLF
PWLKVFPNKTLEKLKRHVKTRNDLLTKIFENYKEKFHSDSITNMLDVLMQAKMNSDNG
NAGPDQDSELLSDNHILTTIGDIFGAGVETTTSVVKWIVAFLLHNPQVKKKLYEEIDQ
NVGFSRTPTISDRNRLLLLEATIREVLRIRPVAPMLIPHKANVDSSIGEFAVDKGTHV
IINLWALHHNEKEWHQPDQFMPERFLNPAGTQLISPSLSYLPFGAGPRSCIGEILARQ
ELFLIMAWLLQRFDLEVPDDGQLPSLEGNPKVVFLIDSFKVKIKVRQAWREAQAEGST
>CYP19
(Iyer)
MVLEMLNPMHYNITSMVPEAMPAATMPILLLTGLFLLVWNYEGTSSIP
(1)
GPGYCMGIGPLISHGRFLWMGIGSACNYYNQVYGEFMRVWISGEETLIISK
(2)
SSSMFHIMKHNHYSSRFGSKLGLQCIGMHEKGIIFNNNPDLWKTTRPFFMK
(1)
ALSGPGLVRMVTVCAESLKTHLDRLEEVTNESGYVDVLTLLRRVMLDTSNMLFLRIPLD
(1)
ESAIVVKIQGYFDAWQALLIKPDIFFKISWLYKKYEKSV
(2)
KDLKDAIEVLIAEKRRRISTEEKLEECMDFATELILAE
(0)
KRGDLTRENVNQCILEMLIAAPDTMSVSLFFMLFLIAKHPNVEEAIMKEIQTVV
(1)
GERDVKIDDMQKLKVMENFIYESMRYQPVVDLVMRKALEDDVIDGYPVKKG
TNIILNIGRMHRLEFFPKPNEFTLENFAKN
(0)
VPYRYFQPFGFGPRGCAGKYIAMVMMKAILVTLLRRFHVKTLQGQCVERIQ
KIHDLSSHPDETKNMLEMIFTPRNSDRCLEH
>CYP21
(Blackwell) partial
LVSKNYPDLSLGDYSLLWKAHKKLTRSALLLGMRDSMEPVVEQLTQEFCERMRAQAGTPV
AIEEEFSLLTCSIICHLTFGDKIKDNLVPAYYKCIQEVLKTWSHWSIQIVDVIPFLRFFP
NPGLRRLKQAIEKRDHIVEKQLRQHKESLVAGQWRDMMDYMLQVVAQPSMEEGSGQLLEG
HVHMAAVDLLIGGTETTANTLSWAVVFLLHHPEIQQRLQEELDHELGPSASSSRVPYKDR
ARLPLLNATIAEVLRLRPVVPLALPHRTTRPSSISGYDIPEGTVIIPNLQGAHLDEMVWE
RPHEFWPDRFLEPGKNSRALAFGCGARVCLGEPLARLELFVVLTRLLQAFTLLPPGDALP
SLQPLPHCSVILKMQPFQVWLQPRGLGVHSLGQSQ
>CYP24
(S.Jain) partial
MSSPISKSRSLAAFLQQLRSPRQPPRPVTSTAYTSPQPREVPVCPLTAGGETQNAAALPGPTSWPLLGSLLQILWKGGLKKQHDTL(0)VEYHKKYGKIFRMKLGSFESVHLGSPCLLEALYRTESAYPQRLEIKPWKAYRDYRKEGYGLLIL
>CYP26A1
(Liao, Iyer)
MGLPALLASALCTFVLPLLLFLAAIKLWDLYCVSGRDRSCALPLPPGTMGFPFFGETLQM
VLQRRKFLQMKRRKYGFIYKTHLFGRPTVRVMGADNVRRILLGEHRLVSVHWPASVRTIL
GSGCLSNLHDSSHKQRKKVIMRAFSREALECYVPVITEEVGSSLEQWLSCGERGLLVYPE
VKRLMFRIAMRILLGCEPQLAGDGDAEQQLVEAFEEMTRNLFSLPIDVPFSGLYRGMKAR
NLIHARIEQNIRAKICGLRASEAGRGCKDALQLLIEHSWERGERLDMQALKQSSTELLFG
GHETTASAATSLITYLGLYPHVLQKVREELKSKGLLCKSNQDNKLDMEILEQLKYIGCVI
KETLRLNPPVPGGFRVALKTFELNGYQIPKGWNVIYSICDTHDVAEIFTNKEEFNPDRFM
LPHPEDASRFSFIPFGGGLRSCVGKEFAKILLKIFTVELARHCDWQLLNGPPTMKTSPTV
YPVDNLPARFTHFHGEI
>CYP26B1 (S.Jain) partialVFSKIFSHEALESYLPKIQLVIQDTLRAWSSHPEAINVYQEAQKLTFRMAIRVLLGFSIPEEDLGHLFEVYQQFVENVFSLPVDLPFSGYRRGIQARQILQKGLEKAIREKLQCTQGKDYSDALDLLIESSKEHGKEMTMQELKDGTLELIFAAYATTASASTSLIMQLLKHPTVLEKLREELRAHGILHSGGCPCEGTLRLDTLSGLRYLDCVIKEVMRLFTPISGGYRTVLQTFELDGFQIPKGWSVMYSIRDTHDTAPVFKDVNVFDPDRFSQARSEDKDGRFHYLPFGGGVRTCLGKHLAKLFLKVLAVELASTSRFELATRTFPRITLVPVLHPVDGLSVKFFGLDSNQNKILPETEAMLSATV
>CYP26C1
(C.Blackwell) missing an exon
MFPWGLSCLSVLGAAGTAVLCAGLLLSLAQHLWTLRWMLSRDRASTLPLPKGSMGWPFFGETLHWLV(0)
QGSRFHSSRRERYGTVFKTHLLGRPVIRVSGAENVRTILLGEHRLVRSQWPQSAHILLGSHTLLGAV
GEPHRRRRK (0)
VLARVFSRAALERYVPRLQGALRHEVRSWCAAGGPVSVYDAAKALTFRMAARILLGLRLDEAQCATL
ARTFEQLVENLFSLPLDVPFSGLRK
(0)
SAVELLFAAFFTTASASTSLVLLLLQHPAAIAKIREELVAQGLGRACGCAPGAAGGCVGPPPDCG
CEPDLSLAALGSLRYVDCVVKEVLRLLPPVSGGYRTALRTFELD(0)
GYQIPKGWSVMYSIRDTHETAAVYRSPPEGFDPERFGAAREDSRGASSRFHYIPFGGGARSCLGQEL
AQTVLQLLAVELVRTARWELATPAFPALQTVPIVHPVDGLRLFFHPLAPLVAGDGLCL
>CYP27A1
(Q.Tran)
QVLYKAKYGPMWMSYLGPQMHVNLASAPLLEQVMRQEGKYPVRNDMELWKEHRDLHDLTYGPFTT(2)
EGHHWYQLRQALNQRLLKPAEAALYTDAFNEVIDDFMIRLDQLRAESASGNQVSDTAQLFYYFALE
(1)
AICYILFEKRIGCLQRSIPEDTVTFVRSIGLMFQNSLYATFLPKWTRPVLPFWKRYLDGWNAIFSF
(1)
GKKLIDEKLEDMEAQLQAEGPDGVQVSGYLHFLLASGQLSPREAMGSLPELLMAGVDT
(0)
TSNTLTWALYHLSKDPEIQEALHEEVVGVVPAGQVPQHKDFAHLPLLKAVLKETLR
(2)
LYPVVPTNSRIIEKEIEVDGFLFPKN
(0)
TQFVFCHYVVSRDPTTFSEPESFQPHRWLRSSQPATPRIQHPFGSVPFGYGVRACLGRRIAELEMQLLLAR
(0)
LIQKYKVVLAPETGELKSVARIVLVPNKKVGLQFLQRQC
>CYP27B1 (Xin Liu)MTQTLKYASRVFHRVRWAPELGASLGYREYDSARRSLADIPGPSTPSFLAELFCKGGLSRLHELQVQGAARFGPVWLASFGTVRTVYVAAPALVEELLRQEGPRPERCSFSPWTEHRRRRQRACGLLTAEGEEWQRLRSLLAPLLLRPQAAARYAETLDNVVRDLVRRLRCQRGRGTGPPALVRDVAGEFYKFGLEGIAAVLLGSRLGCLEAQVPPDTETFIRAVGSVFVSTLLTMAMPHWLRRLVPGPWGRLCRDWDQMFAFAQRHVERREAEAAMRNRGQPDEDLESGAHLTHFLFQEELPAQSILGNVTELLLAGVDTVSNTLSWALYELSRHPEVQTALHSEITAALGPGSSAHPPATVLSQLPLLKAVVKEVLRLYPVVPGNSRVPDKDIHVGDYIIPKNTLVTLCHYATSRDPAQFPEPNSFHPARWLGEGPTPHPFASLPFGFGKRSCMGRRLAELELQMALAQILIHFEVQPEPGAAPIRPMTRTVLVPERSINLQFLDR >CYP27C1 (F.Zhang) partialMQTSAMALLARILRAGLRPAPERGGLLGGAAPRRPQPAGARLPAGARAEDKGAGRPGAPPAGGRAEGPRLAAMPGPRTLANLAEFFYRDGFSRIHEIQQKHTQEYGKIFKSHFGPQFVVSIADRDMVAQVLRAEGAAPQRANMESWREYRDLRGRATGLISAEGEQWLKMRSVLRQRILKPKDVAIYSGEVNQVIADLIKRIYLLRSQAEDGETVTNVNDLFFKYSMEGVATILYESRLGCLENSIPQLTVEYIEALELMFSMFKTSMYAGAIPRWLRPFIPKPWREFCRSWDGLFKF
>CYP39
(Q.Tran)
MELIFPTVIIILGCLALFLLLQRKNLRRPPCIRGWIPWIGVGFEFGKAPLEFIEKARIK
(0)
YGPIFTVFAMGNRMTFVTEEEGINVFLKSKKVDFELAVQNIVYHT
(1)
ASIPKNVFLALHEKLYIMLKGKMGTVNLHQFTGQLTEELHEQLENLGTHGTMDLNNLV
(2)
RHLLYPVTVNTLFNKSWFPTNKKKIKEFHQYFQAYDEDFEYGSQLPECLL
(2)
RNWSKSKKWFLELFEKNIPDIKACKSAKDNSM
(0)
TLLQATLDIVETETSKENSPNYGLLLLWASLSNAVP(0)
VAFWTLAYVLSHPDIHKAVMEGISSVFGTA
(1)
GKDKIKVSEDDLEKLLLIKWCVLETIRLKAPGVITRKVVKPVEIL
(0)
NYIIPSGDLLMLSPFWLHRNPKYFPEPELFKP
(0)
ERWKKANLEKHSFLDCFVAFGSGKFQCPG
(2)
RWFALLEVQMCVILILYKYDCSLLDPLPKQ
(0)
SSLHLVGVPQPEGQCRIEYKQRI
>CYP46 C-term part (Ramy.Naguib)RLQAEVDEVIGSKRYLDFEDLGRLQYLSQVLKESLRLYPPAWGTFRLLEEETLIDGVRVPGNTPLLFSTYVMGRMDTYFEDPLTFNPDRFGPGAPKPRFTYFPFSLGHRSCIGQQFAQMEVKVVMAKLLQRLEFRLVPGQRFGLQEQATLKPLDPVLCTLRPRGWQPTPPPPPC
>CYP51 (Y.Peng) partialAAAAGMMLLGLLQAGGSVLGQAMEKVTGGNLLSMLLIACAFTLSLVYLFRLAAGHLVQLPAG (0)KSPPYIFSPIPFLGHAIAFGKSPVEFLENAYEK (0)VFGKGVAYDVPNP (0)VFLEQKKMLKSGLNIAHFKQHVSIIEKETKEYFQSWGESGEK (1)VFEALSELIILTASHCLHGKEIRSQLNEKVAQLYADLDGGFSHAAWLLPGWLPLPSF (2)RRRDRAHREIKNIFYKAIQKRRQSQEKIDDILQTLLDATYK (0)DGRPLTDDEVAGMLIGLLLAGQHTSSTTSAWMGFFLARDKTLQEKCYLEQKT VCGENLPPLTYDQ (0)TVAGYTIPPGHQVCVSPTVNQRLKDSWVERLDFNPDRYLQDNPASGEKFAYVPFGA (1)GRHRCIGENFAYVQIKTIWSTMLRLYEFDLIDGYFPTVNYTTMIHTPENPVIRYKRRSK