Bos taurus Cytochrome P450s
Newly updated by David
Drane (Ridgeway High School, Memphis)
Last revised July 21, 2005
All cytochrome P450 genes
that could be assembled from EST data,
mRNA sequences in the nr
section of Genbank, and WGS (whole genome
shotgun sequences) at NCBI,
were assembled and compared to the human
P450 collection.
59 complete genes were
assembled, this number includes three pseudogenes.
6 partial genes are
lacking small regions, and there are several other small
pieces and a few shorter
pseudogenes.
There are a minimum of 62
CYP genes in the cow.
The old Bos taurus
sequence page has been preserved here.
CYP1 family
CYP1A1
AB060696.1
= NP338778, TC208445 XM_588298 CC766779.1 Bac end
MFSVFGLPIPISATELLLASAVFCL
VFWVVRTWRPRVPQGLKSPPEPWGWPLLGHMLMLGKNPHVVLSQLSQRYGDVLQIRIG
CTPVLVLSGLDTVRQALVRQGDDFKGRPDLYSFTLITNGQSMTFNPDSGPVWAARRRL
AQNALKSFSTASDPASSSSCYLEEHVNKEAKYLLGKFQELMSGPGRFDPYRYIVVSVA
NVICAICFGRRYDHNDQEFLSLVNLSNEFGEITASGNPSDFIPVLRYLPNTALDLFKD
LNQRFYVFVQKIVKEHYKTFEKGHIRDITDSLIEHCQDKRLDENANIQLSDEKIINVV
IDLFGAGFDTVTTALSWSLLYLVTSPRVQKKIQEELDTVIGRARRPRLSDRPQLPYLE
AFILETFRHSSFVPFTIPHSTTRDSNLNGFYIPKGRCVFVNQWQINHDQKLWEDPSEF
RPERFLTADGTINKVLSEKVIIFGLGKRKCIGETIARLEVFLFLAILLHQVEFCVTPG
VKVDMTPVYGLTMKYARCEHFQAHMRS
CYP1A2
82% to human 1A2
Database
location : AAFC01721680 742 to 1116 (-)
Genomic location : SCAFFOLD261464 3 to 377 (+)
trace file gnl|ti|669459617
MALSQLSPFSAMELLLASAIFCLVFWVVRTWRPRVPQGLKSPPEPWGWPLLGHMLTLG
KNPHVVLSQLSQRYGDVLQIRIGCTPVLVLSGLDTVRQALVRQGDDFKGRPDLYSFTLVT
DGQSMTFNPDSGPVWAARRRLAQNALNTFSVASD
PSSSSSCYLEDHVSKEAEALLGKFQELMSGPGRFDPYGHVVASV
ANVIGAMCFGQHFPQSSKEMLSLVESSHDFVESASSGNPVDFFPILKYLPNPALQRFK
SFNQRFLQFVRKTVQEHYQDFDKNSIQDIIGALFKHSEDNSRASSRLISQEKTVNLVN
DLFAAGFDTITTAISWSLMYLVTNPKIQRKIQEELD
RVVGRARRPRLSDRPQLPYLES
FILETFRHSSFVPFTIPHSTTRDTTLNGFFIPKERCVFINQWQVNHDPKLWGDPSVFR
PERFLTSDGTTIDKTASEKVLLFGMGKRRCIGEVMARWEVFLFLAILLQRLEFSVPPG
VKVDLTPTYGLTMKHARCEHMQARLRFPIK
CYP1B1
82% to human 1B1
MATGLSPDDHLSPTLLSVQQTMLLLLLSVLAAVHVGQWLLRQRRRQPGSAPPGPFAWPLI
GNAASMGSAPHLLFARLARRYGDVFQIHLGSCRVVVLNGERAIRQALVHQSAAFADRPPF
ASFRLVSGGRSLAFGQYSESWKAQRRAAHSTMRAFSTRQPRGRRVLEGHVVGEVRELVEL
LVRRSAGGAFLDPRPLTLVAVANVMSALCFGCRYSHDDAEFLELLSHNEEFGRTVGAGSL
VDVLPWLQRFPNPVRTAFREFEQLNRNFSNFVLDKFLRHRESLRPGAAPRDMMDAFIHSA
GADSGDGGPRLDVDYVPATVTDIFGASQDTLSTALQWLLVLFTR
(2)
YSEVQARVQAELDQVVGRHRLPTLEDQPRLPYVMAFLYEAMRFSSFVPVTIPHATTANAS
VLGYHIPKDTVVFVNQWSVNHDPVKWSNPEDFDPTRFLDKDGLINKDLTGSVMVFSVGKR
RCIGEEISKMQLFLFISILAHQCNFKANPDEPSKMDFNYGLTIKPKSFKINVTLRESMEL
LDSAVQKLQVEKECQ*
CYP1A8P ortholog (probably
a new subfamily since it is functional in opossum)
68% to
1A8P 40% to CYP1A1
BM25891 SCAFFOLD 61062 SCAFFOLD 129297 SCAFFOLD 245449
AAFC02085398.1
AAFC02099349.1
MIFGMAVTSGEVTTSRIILVMVFVFVRELGNKGRKEVFPPGPWSLPIVENLLQLG
DHLYFTFMEMRKKYGDVFLIKLGMVPVLVVNGMEMVKEVLLRNGEHFAA*PNV
LTFSFLAQ*KSLTFS
NYGENWTLHKKIASNALRTFPKAETKSSTRSCLLEKHVIEEVSELVKV
FTELTSRSGSFEPRGAITCAMANVV
CTLCFGKRYDHSDEEFLRIVKTDHDLLKASSAANPADFIPYF*YLPLRIINAPQEFYHARNQ
FIALHIRDHLTT
CPQDHIQDITDALINACHNKYAVAKITILNDDEIISTVSDLVGAG
FEIISTCIYWSFLYLIYYPEIQVKIQEEI
DGNTGMKSPRFENRKILP
YTEAFINEIFRHTSFLPFTIPHC (2)
TTADTTLNGYFIPRKTCTFINMYQVNHDE (2)
TIWDNPNLLRPERFLNENRELNKNLIEKIFIFGMGIQKCL
REEVAQNEVFVFITTVLQQLTLKKCPVVKLDLTPTYGLVMKPKPYQLPAEPRSMGSSCS*
CYP2 Family
90% to 2A13 86% to 2A7
CB434432.1 CB463229
TC193989
MLASGLLLVALLACLTIMVLMSVWRQRNLKGKLPPGPTPLPFIGNYLQLNTEQMCNSLMK
ISEHYGPVFTV
HLGTRQIVVLCGYDAVKEALVDQAEEFSGRGKQATFDWLFKGYGVAFSNGERAKQLRRFS
ITTLRDFGVGKRGIEERIQEEAGFLIEAFRGTRS
AFIDPTFFLSRTVSNVISSIVFGDRFDYEDKEFLSLLRMMLGSFQFTATSTGQ
1
LYEMFYSVMKYLPGPQQQAFKELQGLEDFIAKKVEQNQRTLDPNSPRDFIDSFLIRMQEEKENPNTEFYRK 177
178 NLVMTTLNLFFAGTETVSTTMRDGFLLLMKHPDVEAKIHEEIDRVIGKNRQPKFEDRAKM
357
358
PYTEAVIHEIQRFGDMIPMGLARRVTKDTKFRDFLLPKGTEVFPMLGSVLRDPKFFSNPR 537
538
DFNPQHFLDEKGQFKKSDAFVPFSIGKRYCFGESLARMELFLFFTTIMQNFRFKSPQS 711
712
PQDINVSPKLVGFATIPPNYTMSFLPR*
CYP2B6
76% to 2B6 CN790156.1
CN787808.1 CB222090 TC193614
40
MELSMLLLFALLTGLLVLLARGRPKAHGRLPPGPRPLPFLGNLLQMDRKGLLKSFLR
FQQKYGDVFTVYLGPRPVVIICGTEAIREALVDQAEVFSGRAKIAVVDPIFQGY
GVIFANGERWKALRRFSLATMRDFGMGKRSVEERIQDEAQCLVEELRKSQ
GALQDPVFYFHSITANIICSIVFGKRFDYRDPEFLRLLELLFQSFVLISSLSSQ
LFELYSSFLKYFPGSHRQIYKNLQEINVFIGRSVEQHRETLDPNAPRDFIDCYLLRMEKDKSNPQSQFDHQN
LIMSVLSLFFAGTETTSTTLRYGFLLMLKYPHITERIQKEIDQVIGSYR
PALDDRAQMPYTDAVIHEIQRFADLIPIGVPHMVTKDTHFRGYILPK
GTEVYPVLSSALHESCYFEKPDDFNPDHFLDANGVVKKNDAFMPFSI
GKRICLGEGIARIELFLFFTTILQNFSVASPVAPEDIDLTPQESGVGNVPPNYRIQFLPRQRG*
CYP2B pseudogene fragment
SCAFFOLD
393933 stop codon same as in human 2B7
PALDDRAQMPYTDTVIHEIQRFADLISIGVSHMDAKDAHF*GYILPK
CYP2C85 missing exons 8,9
76% to 2C18 CB422177
AAFC02018840.1 TC198839 SCAFFOLD251527 SCAFFOLD131911
SCAFFOLD45421
Last two exons
from XM_612374, see controversy below about exon 6
MDLPVVLVLCLCCLLLISLWKQSSGKGKLPPGPTPLPILGNILQLDVKDISKSVSN
LSKVYGPVFTLYFGMNPLVVLHGYEAVKEALIGLGEEFSGRGSCPVIQRASKGY
GVIFSNGKIWKETRRFSLMTLRDFGMGKRSMEDRVQQEACCLVEELRKTD
GLPCDPTFILGCAPCNVICSIIFQNHFDYKDQIFLDLMERLNENARILGSPWIQ
LCSSFPALIDYVPGKHKKFFENYACMKSYVLEKTREHQASLDMNNPRDFIDCFLTKMEQ
EKHNQELEYTVENLAHTVLDLFVAGTETTSTTLRYGLLLLLKHPEVT
AKVQEEIDHVIGRHRSPCMQDKSHMPYTDAVVHEIQRYIDLVPTNLPHAVTCDIKFRNYLIPK
GTGILTSLTSVLYDDKEFPNPEVFDPGHFLDESGNFRKSDHFMAFST
GKRICVGEGLARMELFLFLTTILQNFTLKSVVDPKDLDTTPVVNGLLSVPPFYQLCFIPV*
duplicate
sequence on same contig AAFC02018840.1
EST
CB422177.1 matches yellow region
PCDPTFILGCAPCNVICSIIFQNHFDYKDQIFLDLMERLNENARILGSPWIQ
LCSSFPALIDYVPGKHKKFFENYACMKSYVLEKTREHQASLDMNNPRDFIDCFLTKMEQ
EKHNQELEYTVENLAHTVLDLFVAGTETTSTTLRYGLLLLLKHPE
TAKVQEEIDHVIGRHQSPCMQDKSHMPYTDAVVHEIQRYIDLVPTNLPHAVTCDIKFRNYLIPK
CYP2C
fragment C-term exons 8,9
CB421823
AAFC02018839 (exons 8,9) 88% to 2C87 81% to 2C18
Matches XM_612374 predicted from NW_616319
Same
as 2C85 except yellow area in exon 6
EST
CK949679.1 matches this yellow region, this EST is from 2C87
There
might be an assembly error joining an exon from 2C87 into the 2C85 gene
Or
there might be two exon 6 sequences in one gene.
SCAFFOLD54892
matches exon 6
SCAFFOLD131911
matches exon 5
LPCDPTFILGCAPCNVICSIIFQNHFDYKDQIFLDLMERLNENARILGSPWIQ
LCSSFPALIDYVPGKHKKFFENYACMKSYVLEKTREHQASLDMNNPRDFIDCFLTKMEQ
EKHNHQSEYTFENLTITVSDLFGAGTETTSTTLRYGLLLLLKHPEVT
AKVQEEIDHVIGRHRSPCMQDKSHMPYTDAVVHEIQRYIDLVPTNLPHAVTCDIKFRNYLIPK
GTGILTSLTSVLYDDKEFPNPEVFDPGHFLDESGNFRKSDHFMAFST
GKRICVGEGLARMELFLFLTTILQNFTLKSVVDPKDLDTTPVVNGLLSVPPFYQLCFIPV*
CYP2C86
58% to 2C19 CB428210
AAFC02035479.1 complete
MERLEITTLALVICVTCLVFLFVWKKSHKGLGKLPPGPTPLPIIGNLMQLNLKDIPASLSK
LAKQYGPVYTLHLGSQTTVVLHGYEVVKEALIDQGDEFLGRAHFPIIDDTQRGY
GLIFSNGDTWKQMRRFSSLMTLRDFGMGKRSLEERIQEEAQFLVEEFRKSE
AQPFNPAVTLSCATCNIICSILFNERFHYQDKTLHSLLDLLNENFNRISSLWNQ
IYNLWPKLIKPLPGEHRAFSKRLKDVHYFVLEKVKEHQKSLNHNNPRDYIDCFLSRMEQ
EKQNPESQFHLENLATCGSNLFSAGVETTTATLSYGFLLLMKYPEVQ
AKVHEEIDRVIGRTRSPCMKDKMKLPYTEAVLHEIQRYVTLVPSNLPHAVVQDTKFRQYVIPK
GTTVLPLLSSILYDCKEFPNPEKFDPGHFLDKNGSFRKTKYFVAFSI
GKRACVGEGLAQMELFLFFTTILQNFVLKPLGETKDIETKPIVIGLINMPPPFKLCLIPR*
CYP2C87
78% TO 2C19
CN791444 CK951564 SCAFFOLD
53069 CN791418.1 CK949679.1 CB441113
MDLAVVLVLCLSCLLLLSLWKQSSGKGKLPPGPTPLPILGNIFQLDVKNISKSLTS
LSKVYGPVFTVYFGMKPTVVLHGYEAVKEALIDLGEEFSRRGSFPVIERNVKGH
GIVFSNGKTWKETRRFSLMTLRNFGMGKRSIEDRVQEEARCLVEELRKTN
GLPCDPTFILGCAPCNVICSIIFQNRFDYKDQTFLNLMKTINENIKILGSPWIQ
VLNIFPVLLDFFPWSYSYKKLYTNTAYVKNYVLEKTREHQASLDINNPRDFIDCFLIKMEQ
EKHNHQSEYTFENLTITVSDLFGAGTETTSTTLRYGLLLLLKHPEVT
AKIQEEIDRVIGRHRSPCMQDRTHMPYMDAVLHEIQRYIDLAPTSVPHAVNCDVKFRNYLIPK
GTDILTSLTSVLHDDKEFPNPEVFDPGHFLDENGNFRKSDYFMAFSAGKRVCVGEGLA
RMELFLFLTTILQTFTLKSVVDPKDLDTTPAVTGIANVPPPYQLCFIPV*
CYP2C87-de2b fragment exon
2,
6kb
downstream of 2C87 without an intervening exon 1, same orientation
LSKVCGPVFTVYFGMKPTVVLHGYEALQEALIDLGEEFSGRYSFPVNEKTRRGH
CYP2C fragment exons 7,8,9
BZ845111 SCAFFOLD133344 XM_612392 predicted from NW_616354 81% to 2C89
AAFC02088219
Note exon
7 = same seq as 2C87 but exons 8,9 differ
Possible
alternative splice variant of 2C87.
EST CK949679.1 supports
Exon 7
joining with different exons 8 and 9 than these
So these
exons may be skipped over.
Gene order
might be 2C87 exons 1-7, then these exons 8,9
Follwed by
CYP2C87-de2b and the exons 8,9 of 2C87 above
AKIQEEIDRVIGRHRSPCMQDRTHMPYMDAVLHEIQRYIDLAPTSVPHAVNCDVKFRNYLIPK
GTTILTSLTSVLHDGKEFPNPEQFDPGHFLDKSGNFKKSDYFMVFSA
GKRVCVGEGLARMELFLLLVSILQKFTLKPLVDPKNIDTAPLLKGVGSIPHFYEVCFIPV*
CYP2C88
78% TO 2C19
CN786649 CK956312
CN787335.1 CN786468.1 CC503360
MDLAVVLVLCLSCLLLLSLWKQSSGKGKLPPGPTPLPILGNILQLDVKNISKSLTN
LSKVYGPVFTVYFGMKPIVVLHGYEAVKEALIDLGEEFSGRGMFPLAERANIVN
GILFSNGKTWKEIRRFSLMTLRNFGMGKRSIEDRVQEEACCLVEELRKTN
GLPCDPTFILGCAPCNVICSIIFQNRFDYKDPVFLDLMERLNEILRILSSPWVQ
VCNNFPALFDYLPGSHNKVLKNVANLKSFVLEKAMEHKASLDINNPRDYIDCFLIRMEQ
EKQNQQLEFTLENLTTTVFDLFGAGTETMSTTLRYGLLLLLKHPEVT
AKVQEEIDRVIGRHRSPCMQDRSHMPYTDAVVHEIQRYIDLVPSSLPHMVTHDIELRNYIIPK
GTGVLVSLTSVLYDDKVFPNPEMFDPGHFLDDSGNFKKSDHFMPFSA
GKRICAGESLARMEVFLFLTVILQKFTLKSVVDPKDIDTTPIANGFASVPPPYKLCFIPL
CYP2C89
missing exon 1 and part of exon 2
69% to 2C18 TC211258
CB531366 AY265992
XXXXX
3
XXXXXGPVFTLYFGMKPTVVLHGYEAVKQVLIDQSEEFSGRGSLPVADNINQGL
GIVFSNGEIWKQTRRFSLMVLRNMGMGKRTIEHRIQEEALCLVEALKKTN
GSPCDPTLLLSCAPCNVICSIIFRNRFEYNDERLLTLIKYFNENSRLVSTPWVE
LYNTFPSLLHYFPGSHNTIFKNMTEQRKFILEEIKKHQESLDLNNPQDFIDYFLIKMEK
EKHNKHSEFTMDNLITTVWDVFSAGTETTSLTLRYGLLLLLKHPEVT
AKVQEEIDRVVGRNRSPCMQDKSCMPYTDAVLHEIQRYIDLVPSSMPHAATQDVKFREYLIPK
GTVILTSLTSVLHDDNEFSNPGQFDPGHFLDESGNFKKTDHFMAFSA
GKRVCVGEGLARMELFLLLVSILQHFTLKSVVDPKHIDTAPSFKGLISIPPFCEMCFIPV* 1292
CYP2C90
partial seq. missing exon 1
CB222086 82% to 2C18 77% to 2C87 SCAFFOLD35528 AAFC02073759
DR713168
LSNTYGPVFTVYFGLRPTVVLHGYEAVKEALIDQGEEFSGRGNIPMSQRVNKGY
GIIFSNGKRWKEIRRFSLMTLRNFGMGKRSIEDRVQEEAHCLVEELRKTN
GSPCDPTFILGCAPCNVICSIIFQNRFDYTDQNFLNLLDKFNENLQVVSSPWMQ
VCNTFPILIDYFPGSHNKLFKNFAYIRSYVLEKVKEHQATLDINNPRDFIDCFLIKMEQ
EKHNQEMEFTFENLIASVSDLFGAGTETTSTTLRYGLLMLLKHPEVT
AKVQEEIDRVIGRHRSPCMQDRSHMPYMDAVVHEIQRYIDLVPTNLPHAVTRDIKFRNYLIPK
GTTVVTSLSSVLHDEKEFPNPKVFDPAHFLDESGNFKKSDYFMAFSA
GKRSCVGEGLARMELFLFLTTILQKFTLKSVVDPKDLDTTPVSSGFGHVPPPYQLCFTPL*
CYP2C exon 1
SCAFFOLD291268
MDLFVVLVICLSILIFLFLWNQRHAKGKLPPGPTPLPIVGNILQINIKNVSKSISK
CYP2C partial pseudogene sequence
with frameshift and stop codon exons 1-3
SCAFFOLD113483
MDLAVVLVLYFSCLLLLSLWKQSSGKGKLLPGPTPLPILGNTLQLDVNDISKSLSD
LSEVCGPVFTVYFGMKPTVVLYGYEAVKETLI
DLGEEFSGRGSLPLPERISKGH
GILFSSGKRWKETRRFSLVTLRNFRMGE*SVEDRVQEEARCLVEELRKTNG
CYP2C pseudogene fragments
SCAFFOLD51208
EKHNQELEFTVENLMITVSDLFGAGTEMTSTTLRYGLLLLLKHPEVT
AKFQEEIDHMIGRDQSPCMQDRSHMPYVDAVVHEIQRYIDL
(frameshift)
IPTNLPLAVTHDVKFRNYLIPK
Gap
missing exon 8
XXXXXXXEGLTCVELFLFLTTILQ
(frameshift)
KFTLKSVVDPKDIDTTPIVN
CYP2C fragments exons 5-7,
two copies, 94% identical
BZ878104 78%
to 2C18 87% to 2C89 SCAFFOLD40567 no ESTs
AAFC02042798.1
LYNAFPSLLHYLPGSHNTLFKNMTEQRKFILEKIKEHQESLDLNNPQDFIDYFLIKMEK
EKHNKQSEFTMDNLITTVWDVFTAGIETTSISLKYGLLLLLKHPEVT
AKVQEEINRVVGRNRSPCMQDRSRMPYTDAVIHEIQRYIDIVPNNLPHAAAQDIKFREYLIPK
Second partial
gene on same contig AAFC02042798.1
Note:
exons 8,9 not found between these.
LYNAFPSLLHYLPGSHNTLFKNMTEQRKFILEKIKEHQESLDLNNPQDFIDYFLIKMEK
EKHNKQSEFTMDNLITTIWDVLTAGIETTSLTLRYGLLLLLKHPEVT
AKVQEEINHVVGRNRSPCMEDRSRMPYTDAVIHEIQRFIDLVPNNLPHAAAQDIKFREYLIPK
CYP2D14
TC205271 78% to 2D6
MGLLSGDTLGPLAVALLIFLLLLDLMHRRSRWAPRYPPGPTPLPVLGNLLQVDFEDPRPSFNQ
LRRRFGNVFSLQQVWTPVVVLNGLAAVREALVYRSQDTADRPPPAVYEHLGYGPRAEG
VILARYGDAWREQRRFSLTTLRNFGLGKKSLEQWVTEEASCLCAAFADQA
GRPFSPMDLLNKAVSNVIASLTFGCRFEYNDPRIIKLLDLTEDGLKEEFNLVRKV
VEAVPVLLSIPGLAARVFPAQKAFMALIDELIAEQKMTRDPTQPPRHLTDAFLDEVKE
AKGNPESSFNDENLRLVVADLFSAGMVTTSTTLAWALLLMILHPDVQR
RVQQEIDEVIGQVRRPEMGDQALMPFTVAVVHEVQRFADIVPLGLPHMTSRDIEVQGFHIPK
GTTLITNLSSVLKDETVWEKPFRFHPEHFLDAQGRFVKQEAFIPFSA
GRRACLGEPLARMELFLFFTSLLQHFSFSVPAGQPRPSEHGVFAFLVTPAPYQLCAVPR*
CYP2D43
SCAFFOLD 102164
5681
MGLLSGDTLGPLAVALLIFLLLLDLMHRRSRWAPRYPPGPMPLPVLGNLLQVDFEDPRPSFNQ
LRRRFGNVFSLQQVWTPVVVLNGLAAVREALVYRSQDTADRPPQALYKHLGFGPRAEG 6760
7291
VILARYGNAWREQRRFSLSTLRNFGLGKKSLEQWVTEEASCLCAAFADQA 7449
7550
GHPFSPMDLLNKAVSNVIASLTFGCRFEYNDPRIVKLLDVMEDGLKEEMKIMRQV 7714
8109
VEAVPVLLSIPGLAAKVVPGQKAFMTLVDELIAEQKMTRDPTQPPRHLTDAFLDEVKE 8288
AKGNPESSFSDENLRLVVADLFSAGMVTTSTTLAWALLLMILHPDVQR 8591
8806
RVQQEIDEVIGQVRRPEMGDQALMPFTVAVVHEVQRFADIVPLGLPHMTSRDIEVQGFHIPK 8985
9424
GTTLITNLSSVLKDETVWEKPFRFHPEHFLDAQGRFVKQEAFIPFSA 9603
GRRACLGEPLARMELFLFFTSLLQHFSFSVPAGQPRPSDHGVFVALVTPAPYQLCAVPR 9843
CYP2E1
AJ001715 TC189658 79% to
human 2E1
MAALGITVALLVWMATLLFISIWKHIYSSWKLPPGPFPLPIIGNLLQLDIKNIPKSFTR
LAERYGPVFTLYLGSQRAVVVHGYKPVKEVLLDYKNEFSGRGENPGFQMHKNN
GIIFNNGSTWRDTRRFSLTTLRDLGMGKQGNEQRIQREAHFLLEVLRKTQ
GQPFDPTFVVGFAPYNVISDILFHKRFDYKDQTSLRLMSLFNENFYLLSSPWIQ
LYNNFPDYLQYLPGSHRKLLKNVSEVKSYALERVKDHQKSLEPSCPRGFLDTMLIEM
AKERHSVDPMYTLENIAVTVADLLFAGTETTSTTLRYGLLILMKYPEVE
EKLHEEIDRVIGPSRIPAVKDRLDMPYLDAVVHEIQRFIDLLPSNLLHEATQDTVFRGYVIPK
GTVVIPTLDSVLHDRQEFPEPEKFKPEHFLNENGKFKYSDHFKAFSA
GKRVCVGEGLARMELFLLLAAILQHFNLKSLVDPKDIDLSPIAIGFGKIPPRYKLCLIPRSKV*
CYP2F1
missing exon 1
AW312559.1 AAFC02178016.1
AAFC02183743.1 AJ459276 CC540963 BAC end
XXXXX
LSKEFGAVYTVYLGPRRVVVLSGYQAVKEALVDQAEEFGGRGDYPVFFNFTKGN
GIAFSNGDRWKVLRKYSVQILRNFGMGKRTIEERILEEGHFLLEELRKTQ
GKPFDPTFVVSRSVSNIICSVIFGSRFDYDDDRPLSIIHLINENFQIMSSPWGE
MYNIFPNLLDWVPGPHRRLFKNYGRIKDIIARSVREHQASLDPNSPRDFIDCFLTRWH
QEKQDPLSHFFMDTLLMTTHNLLFGGTETVGTTLRHAFRLLMKYPEVQ
VRVQEEIDRVVGHERLPTVEDRAAMPYTDAVIHEVQRFADVIPMSLPHRVTRDTNFRGFTIPR
GTDVITLLNTVHYDPSQFLKPKEFNPEHFLDANMSFKKSPAFMPFSA
GRRLCLGEALARMELFLYLTAILQSFSLQPLGAPEDIDLTPLSSGLGNLPRPYQLCVLAR
CYP2G1
83% to mouse (88% to human
pseudogene)
Database
location : AAFC01082069 3860 to 4039 (+)
Genomic
location : SCAFFOLD219575
3860 to 4039 (+)
Database
location : AAFC01082069 4854 to 5015 (+)
Genomic
location : SCAFFOLD219575
4854 to 5015 (+)
Database
location : AAFC01082069 6748 to 6897 (+)
Genomic
location : SCAFFOLD219575
6748 to 6897 (+)
Database
location : AAFC01082069 8738 to 8899 (+)
Genomic
location : SCAFFOLD219575
8738 to 8899 (+)
Database
location : AAFC01082069 9151 to 9327 (+)
Genomic
location : SCAFFOLD219575
9151 to 9327 (+)
Database
location : AAFC01202105 145 to 300 (-)
Genomic
location : SCAFFOLD199473
145 to 300 (-)
Database
location : AAFC01421374 997 to 1185 (+)
Genomic
location : SCAFFOLD281678
3294 to 3482 (-)
Database
location : AAFC01421374 1314 to 1454 (+)
Genomic
location : SCAFFOLD281678
3025 to 3165 (-)
Database
location : AAFC01603617 586 to 765 (+)
Genomic
location : SCAFFOLD281678
1315 to 1494 (-)
3860
MELGGAFTIFLALCLSCLLILIAWKRMSKGGKLPPGPTPIPFLGNVLQVRTDATFQSFMK(0) 4039
4854 LKEKYGPVFTVYMGPRPVVVLCGHEAVKEALVDRADEFSGRGELASVERNFQGH(1)5015
6748
GVALANGERWRILRRFSLTILRDFGMGKRSIEERIQEEAGFLLVELRKTR(1)6897
8738
GARIEPTFFLSRTVSNVISSVVFGSRFDYEDQQFLKLLQMINQSFIEMSTSWAQ (0) 8899
9151
LYDMYSGIMQYLPGRHNRIYYLIEELKDFIASKVKINEASLDPQNPRDFIDCFLIKMHQ(0) 9327
300
DKNNPHTEFNLKNLVLTTLNLFFAGTETVSSTLRYGLLLMMKHPEVE(1)145
997
AKIHEEIDQVIGPHRIPSVDDRAKMPYTDAVIHEIQRLTDIVPMGVPHNVIRDTHFRGYLLPK(0) 1185
1314
GTDVFPLLGSVLKDPKYFRYPDAFYPQHFLDEQGHFKKNEAFVPFSS(1) 1454
586
GKRICLGEAMARMELFLYFTSILQNFSLRSLVPPADIDITPKVSGFGNIPPTYELCFMVR(1) 765
CYP2J26
complete 2JD
CB465835.1 SCAFFOLD215002
CB462842.1 SCAFFOLD278853
SCAFFOLD107814 CB465835.1
SCAFFOLD215002 CB462842.111497
CB532345.1
MLEALGSLVAALWTTLRPGIVLLGAFVFLLFADFLKRQHPKNYPPGPLRLPFIGNFFHLDLGKGILVPQQ
VVKKYGNIIRLDFGVIHFIVITGLPYIKEALVNQEQNFVNRPMIPLQKHIFNNK
GLVRSNGQVWKEQRRFTLTTLRNFGLGRKSLEERIQEEVTYLIQAIGEEN
GQPFDPHFIINNAVSNIICSITFGERFDYKDDQFQELLRLLDEILCIQASVCCQ
LYNAFPRIMNFLPGSHHTLFRKWEKLKMFVANVIENHRKDWNPAEARDFIDAYLQEIEK
11676
HKGNATSSFDDENLICSTLDLFLAGTETTSTTLRWGLLFMALNPEIQ
14705
EKVQAEIDRVLGQSQKVSTASRESMPYTNAVIHEVQRMGNIVPMNVPREVTVDTVLAGYH
15236
LVKGTMVLTNLTALHRDPAEWATPDTFNPEHFLENGQFKKRESFLPFSI
GKRMCLGEQLARTELFIFFTSLLQKFTFRPPENEKLSLKFRESLTSSPASYRLCAIPRA*
25310
CYP2J27
SCAFFOLD215002
complete 2JC
BM258720.1 CK950217.1
SCAFFOLD388989
This gene has an extra exon
5
MLEALGSLAAALWAALRPGTVLLGAVVFLFLDDFLKRRRPKNYPPGPPPLPEVGNFFQLDFDKAHLSLQR
FVKKYGNVFSVDFGIFRSVLITGLPLIKEALVHQDQNFANRPLIPIEKRIFNNK
37352
GLIMSNGHVWKEQRRFALTTLRNFGLGKKSLEERIQEEAAYLIQEIGEEN
39667
GQPFDPHFTINNAVSNIICSITFGERFDYQDDQFQELLRLFDEMMHLRTSTCCQ
40221
LYNIFPRIMSFLPGPQHALFSKWEKLKMFIAGVVENHKRDWNPAEARDFIDAYLQEIEK
42145
HKGNATSCFHEENLIYNTLDLFFAGTETTSTTLRWGLLYMALYPEIQ
43949
EKVQAEIDRVLGQSQKPSMAARESMPYTNAVIHEVLRMGNILPLNVPREVTVDTVLAGYRLPK
GTMVTTNLTALHRDPAEWATPDTFNPEHFLENGQFKKRESFLPFSI
GKRMCLGEQLARTELFIFFTSLLQKFTFRPPENEKLSLKFRMSMTLSPLSHRLCAIPRA*
SCAFFOLD215002 BM258720.1 CK950217.1
extra exon
5
LSNVFPRIMNFLPGPQHTLFSKWEKLKMFIAGVIENHKRDWNPAEARDFVDAY
41591
CYP2J28
complete 2JA
SCAFFOLD256921 CK945989.1
CK945989.1 CO259427.1
SCAFFOLD295402 CO259427.1
SCAFFOLD309869 CO893955 CO259427.1
SCAFFOLD213132 CO259427.1
SCAFFOLD247979 CO893955 CO259427.1
BE664093.1 SCAFFOLD65540
CO893955 CK770461.1
MLEALGSLAAALWAALRPGTVLLGAIVFLLLTDLLNRRRPKNYPPGPPRLPFVGNFFQLDFEQGHLSLQR
FVKKYGNLFSLEFGDLPSVVITGLPLIKEVLVYQDQNFVNRPISPIRERVFKKN
GLIMSNGHIWKEQRRFSLTALRNFGLGRKSLEERIQEEVAYLIQAIGEEK
GQPFNPHFKINNAVSNIICSITFGERFDYQDDQFQELLRLLDEVTYLETTVWCQ
LYNVFPRIMNFLPGPHQMLFSNWRKLKMFVARVIENHKRDWNPAEARDFIDAYLQETEK
HKGNAASSFHEENLIYNTLDLFFAGTETTSTTLRWGLLYMALYPEIQ
716
EKVQAEIDKVLDESQQPSMATRESMPYTNAVIHEVQRMGNILPLNVPREVTVDTVLAGYHLPK
GTMVLTNLTALHRDPAEWATPDTFNPEHFLENGQFKKREAFLPFSI
GKRMCLGEQLARTELFIFFTSLLQKFTFRPPEHEELSLKFRMGLTLSPVSHCLCAVPRA*
CYP2J29
complete 2JB
SCAFFOLD25620 DN825584.1
DN825609.1 CN793001.1
SCAFFOLD126970
SCAFFOLD87184 CK957631.1 SCAFFOLD166503
CK834395 CK945908.1
SCAFFOLD166503 CK960783.1 CK957063.1
SCAFFOLD245478 CK945645.1
MLSSLAAALWAALRPGTVLLGAVAFLFFADFLKRRRPKNFPPGPAGLPFVGNSFQLDPEKVHLTLQQ
FVKKYGNVFSLDFGTFPSILITGLPLIKEALVHQGENFSKRPVMPLQERIFNTK
GLIMSSGHIWKEQRRFALTTLRNFGLGKKSLEERIQEEASYLIQMIREEN
GKPFDPHFIINNAVSNIICSITFGERFDYQDSQFRELLRLLDEVLNLHTSLCCQ
LYSVFPRIMNFVPGPHQTLFSNLEKLKMFVAEMIENHKRDWNPAEARDFIDAYLQEIEK
8435
HKGGDASSFREENLIYSTLDLFLAGTETTSTSLRWGLLYMALNPEIQ
5634
EKVQAEIDRVLGQSQQPSTAARESMPYTNAVIHEVLRMGNIIPLNVPREVAVDTTLAGYHLPK
5455
GTVVVTNLTALHRDPAEWATPDTFNPEHFLENGQFKKRESFLPFSI
2548
GKRMCLGEQLARAELFIFFTSLLQKFTFRPPENEKLSLKFRVSLTLAPISHRLCAVPRG*
CYP2J30
complete 2JH
DN825200.1 CK950246.1 SCAFFOLD281439
SCAFFOLD298803 CK940975.1
SCAFFOLD170755 CK945247.1
CN793929.1
MLEALSSLATALWAALRPDTVLLGTLAFLLFVDFLKRRHPKNYPPGPPGLPFVGNLFQLDPEKVPLVLHQ
FVKKYGNVFSLDFGTVPSVLITGLPLIKEVLVHQGQIFSNRPIVPLQEHIINNK
GLIMSSGQLWKEQRRFALTTLRNFGLGKKSLEERIQEEASYLIQTIREEN
GQPFDPHLTINNAVSNIICSITFGERFDYQDDQFQELLRMLDEILNLQTSMCCQ
LYNVFPRIMNFLPGPHQALFSNMEKMKMFVARMIENHKRDWNPAEARDFIDAYLQEIEK
HKGDATSSFQEENLIYNTLDLFLAGTETTSTSLRWGLLFMALNPEIQ
EKVQAEIDRVLGQSQQPSMAARESMPYTNAVIHEVLRMGNIIPLNVPREVAVDTTLAGYHLPK
15084
GTMVMTNLTALHRDPTEWATPDTFNPEHFLENGQFKKRESFLPFSI
12265
GKRMCLGEQLARTELFIFFTSLLQKFTFRPPENEQLSLKFRVSLTLAPVSHRLCAVPRG*
CYP2J31P pseudogene
CYP2JF SCAFFOLD295104 no ESTs possible pseudogene (N-term
is short)
some frameshifts present, exons 6-8 missing
MGAAAFLFVVHLKRRRGKNYPPGPPGLPFLGNFFHLDLKQLHLSLQQ
IVKKYGNMISLEMGGFSTVFFKWIAQNQRSPCLPGPKLVNHPIQRIQENIFKKH
5343
GLIMSNGHIWKEQRRSALTTLRNFGLGRKILEECIQEEAAYLIQTVGEEN 8001
XQPFDPHFTINNAVSNIVCSIAFGELFDYQDSXXQELLRLMDEAMYLQTSVRCRV
8538
LYNFFARIMNFLPGPHQTLFIKWEKLNMFIDSVIENHRRDWNPAEPRDFTDA
15856
GMWMCPGEQLARTELFIFFTSLLQKFTFRPPGDEKLSLQFRVSLTISSVSHWLC 16020
CYP2R1
scaf.96150, 81935, 91707
CK969495.1
MWEPHSAEAFVAALGGVFFLLLFALGVRQLLKQRRPSGFPPGPSGLPFIGNIYSLAASAELPHVYMKKQSQVYGE
(0)
IFSLDLGGISAVVLNGYDVVKECLVHQSEIFADRPCLPLFMKMTKMG
(1)
GLLNSRYGRGWVDHRKLAVNSFRCFGYGQKSFESKILEETKFFIDAVETYNGSPFDLKQLV
TNAVSNITNLVIFGERFTYEDTDFQHMIELFSENVELAASATVFLYNAFPWIGILPFGKH
QQLFRNAAVVYDFLSRLIEKASINRKPQLPQHFVDAYLDEMERSKNDPSSTFSKENLIFS
VGELIIAGTETTTNVLRWAVLFMALYPNIQ
(1)
GQVQKEIDLIIGPSGKPSWDEKCKMPYTEAVLHEVLRFCNIVPLGIFHATSEDAVVRGYSI
PKGTTVITNLYSVHFDEKYWRDPEIFYPERFLDSSGHFAKKEALIPFSL
(1)
GRRHCLGEQLARMEMFLFFTALLQRFHLHFPHELVPNLKPRLGMTLQPQPYLICAERR*
CYP2S1
AAFC02159945.1
MEAAGTWALLLLLLLLVVTLVLPATWDRGHLPPGPTPLPLLGNLLQLRPGALYLGLLR
LSKKYGPVFTVYLGPWRRVVVLVGHEAVQEALGGQAEEFSGRGTVATLDGTFDSH
GVFFSNGERWRQLRKFTTLALRDLGMGKREGEELIQAEARCLVEALQGTK
GRPFDPSLLLAQATCNIICSLVFDLRLPYDNEEFQAVVRAAGGIAVGVSSPWGQ
TYEMFSRFLQRLPGPHTQLLRHLGTVAAFAAQQVWQHKGSLGTSGPVRDLVDAFLLKMAK
EKQDPNTEFTAKNLLMTVVYLLFAGTVTVSTTIRYTLLLLLKYPQVQ
ERVQEELMRELGAGQRPSLGDRARLPYTDAVLHEAQRLLALVPMGIPRALTKTTRFRGYTLPQ
GTEVFPLLGSILHDPAVFEEPKEFNPGRFLDADGKFKKHEAFLPFSL
GKRVCLGEGLARTELFLLFTAILQAFSLEGPCPLGALSLQPAISGLFNIPQAFQLQFRPR*
CYP2T2P
ortholog
AAFC02046938 65% to rat 2t4
66% to 2T2P human
MMISGIIALSLLVLLLAPARWGWGARSTQRQGALPPRATPLRLLGSLLQLRIWRPGPCTHG
LSGRCGPVFTVCLGQCPVVVLCRYAALRDALVLQADAFSGRGAMAVFKRFTRGN
GIAFSKGPRWPTLRNFALGALKEFGLGTQTIEERVLEEAACLLGDFQATGG
GAPFDPQRLLDNAVSNVICSVVLGNHYGYEDMEFLRLLDLFNDNFRIMSSRWGE
XXXXXSLLDWLPGLHH*IFRNFAXLRVFISQQIQLHQQTR*SGKPHDFIDXXXXXXX
GTENPESHFQAETLAMTMHNLFFGXXETTSTTLRYGLILLKYSFVA
AKVQAELDDMVGRMCAPTLEDREHLPYTNTVLHEIQCFISVVPFGLPSALTCDTHLRGYFLPK
GTFVIPLLVSTHWVPTQFKNPECFNPTNFLNDQGEFQSNAFTPFAL
GTCLGAGLAPTDIFLFLTSILLRFFLLPVGSHSDTDLTPQCTGLGNVPPAFQLRLVAR*
CYP2U1
XM_615255 AAFC02193700
gnl|ti|665258407
MASPGLPQPPTEDAAWPLRLLHAPPGLLRLDPTGGALLLLVLAALLGWSW
LWRLPERGIPPGPAPWPVVGNFGFVLLPRFLRRKSWPYRRARNGGMNASGQGVQLLLADL
GRVYGNIFSFFIGHYLVVVLNDFHSVREALVQQAEVFSDRPRVPLTSIMTKGKGIVFAHY
GPVWRQQRKFSHSTLRHFGLGKLSLEPKIIEEFRYVKEEMQKHGDAPFNPFPIVNNAVSN
IICSLCFGRRFDYTNSEFKQMLTFMSRALEVCLNTQLLLVNICSWLYNLPFGPFKELRQI
EKDLTLFLKKIIKDHRESLDVENPQDFIDMYLLHVEEEKKNNSNSGFDEDYLFYIIGDLF
IAGTDTTTNSLLWCLLYMSLHPNIQEKIHEEIARVIGADRAPSLTDKAQMPYTEATIMEV
QRLSTVVPLSIPHMTSEKT
VLQGFTIPKGTIILPNLWSVHRDPAIWE
KPNDFYPDRFLDDQGQLIKKETFIPFGI
GKRVCMGEQLAKMELFLMFVSLMQSFTFVLPKDSKPILTGKYGLTLAPHPFNIIISKR
CYP2W1
partial sequence
scaf.279079, 426471 CB440236.1 AAFC02007621.1
XXXXX
LGKQYGPVFTVHLGHQKTVVLTGYEAVKEALVGTGQELAGRPPIAIFQLINGGG
(1)
GVFFSSGPRWRAARQLTVRALHGLGVGRAPVANKVLQELRCLTAQLDSYE
(1)
GRPFPLALLRWAPSNITFTLLFGQRFDYRDPVFLSLLGLVDEVMVLLGKPSVQ
(0)
LFNLYPRLVALLQLHRPVLRKIEEVRAILRALLEARRHRTPPRGPQQSYLDALIQQGQ
(0)
XXXXX
XXXXXXXXXXXXXXXXPRPEDVHALPYTNAVLHEVQRFITLLPHAPRCTVANTQLGPYLLPK
GTPVLALLNSVLLDETQWKTPRQFNPGHFLDANGRFVKRPAFLPFSA
XXXXX
CYP2AB1P
pseudogene
scaf.15709 BI535102
MCPLLIWLGLLAASFLLLKFSIIYWERNHLPPDPFPFPILGNPWQLSFQLHPATLLQ
LAQTHGHVFTVWVGPTPVVVLCSFQA
KEALVSHSEQLSGWPLTPLFQDLAGERG
GVICSSGRTRRQ*RRFCLAALQGLG*GPLALELRLQEEAAGLVEAFHWEQ
GGPFDPQAPIVRSTARVTGALVFGRHFLSEDPFFQELI*ATNFGLAFXXXXXX
QLNDLFPWAFRCLPGPYREMFRYQKAVRGYIHREIMRHKLRTSEAPKDFISCYLAQIIK
ATDDPVSTFNEENLIQVVVGLFLGGTDTTGTTLYWVLIYMIQYGAIQS
ERVQQELVTVLGTSGAICYKDHEQLPHICTLLHEAQRLSSVA*V
AVCQCVTSTHVHGHPVPK
GTIILPNLAAVLCDPECWRTSRQFNPGHFLDKDGNFVVRDIFPPFSA
GHQMCLGD*LAQMKLLLMFATLLGTFSFQLPGRSPGLRLEYNFGGTRKPLPQKIYAVSRLNCPHPGPREEVL*
CYP2AC1P
pseudogene
67% to rat 2AC1
SCAFFOLD151603,
SCAFFOLD251118 AAFC02071354 pseudogene
BZ940825 CC910297
MSGFESSFILPILSLILIFILNIKIVMTKASKQHFPPVPRPLPIIGNLHILNLKRPYQTMLE
(0)
LSQKYGSIYSIQIGPRKVAVLxGYETVKDVLVNHTDQFGEWFHVPISERLFEGK
GIFFSHSDTSKIIRFTLTTSQNFGMGKKALEDTIIGESQHLIRNFETDKG
GKPFEVKTLTNASVANINVSVLLGKGFDYQNTPFLRLLTLIDQSVKLIVSPPTA
LFNMFPVLRFLLKTYKNILRNKDELFSFIRMTFLHHHHKLDKNDPRSLTDAFLVRQQE
DTSTDYFNDDTLVVLVNNLFAAGTESMVSTLCWGILFMSRYPEIQS
KVHDEIAKVMGSTQP*MAH*TQMPYTDAVILEVQRFADILPTGLPRATTTNTIFKNNYIPK
GTEVIFLLTSVL*DQTQWENPATFNPEHFLDSIEKFIKKEAFISFSV
(1)
SPL*CAGESLAKMELLLFFMSLLQKFTFQPPPGVSHLDLDPTRDTGVVIQPMPHKIRALPRA
CYP3
Family
CYP3A28
TC211564
Y10214 73% to 3A5 SCAFFOLD259874
MELIPSFSMETWVLLATSLVLLYI
(2)
YGTYSYGLFKKLGIPGPRPVPYFGSTMAYHK
(0)
GIPEFDNQCFKKYGKMWG
(2)
FYEGRQPMLAITDPDIIKTVLVKECYSVFTNRR
(0)
IFGPMGIMKYAISLAWDEQWKRIRTLLSPAFTSGKLKE
(0)
MFPIIGQYGDMLVRNLRKEAEKGNPVNMKD
(2)
MFGAYSMDVITGTAFGVNIDSLNNPHDPFVEHSKNLLRFRPFDPFILSI
(1)
ILFPFLNPVFEILNITLFPKSTVDFFTKSVKKIKESRLTDKQM
(0)
NRVDLLQLMINSQNSKEIDNHK
(1)
ALSDIELVAQSTIFIFGGYETTSSTLSFIIYELTTHPHVQQKLQEEIDATFPNK
(0)
APPTYDALVQMEYLDMVVNETLRMFPIAGRLERVCKKDVEIHGVTIPKGTTVLVPLFVLHNNPELWPEPEEFRPER
(2)
FSKNNKDSINPYVYLPFGTGPRNCLGMRFAIMNIKLALVRILQNFSFKPCKETQ
(0)
IPLKLYTQGLTQPEQPVILKVVPRGLGPQVEPDFL*
CYP3A74
TC219554
76% to 3A4 Length = 563 N-term CK957598.1
MELILSFSTETWVLLATGLVLLYL
(2)
YGTYSYGLFKKLGVPGPRPLPYFGNILSYRK
(0)
GVCEFNEECFKKYGKIWG
(2)
IFEGKQPLLVITDPDMIKTVLVKECYSVFTNRR
(0)
VFGPSGVMKNAISVAEDEQWKRIRTLLSPTFTSGKLKE
(0)
MFPIIGKYGDVLVRNLRKEAEKGTSVDIKD
(2)
IFGAYSMDVITSTSFGVNIDSLGNPQDPFVENVKKLLRFSILDPFLLAV(1)
VLFPFLVPILDVLNITIFPKSAVNFFTKSVKRIKESRLKDNQK(0)
PRVDFLQLMINSQNSKETDNHK (1)
ALSDQELIAQSIIFIFAGYETTSSTLSFLLYILATHPDVQQKLQEEIDATFPNK
(0)
APPTYDVLAQMEYLDMVVNETLRMFPIAIRLERLCKKDVEIHGVSIPKGTTVMVPISVLHKDPQLWPEPEEFRPER
(2)
FSKKNKDSINPYVYLPFGTGPRNCIGMRFAIMNMKLAIVRVLQNFSFKPCKETQ
(0)
IPLKISSQGVLRPEKPVVLKVVLRDGTISGA*
CYP3A75 missing exons 3-6
SCAFFOLD20102
DN530970.1 CK954349.1 DN532344.1 AAFC02132381
SCAFFOLD237847
MELIPSFSMETWVLLSISLVLLYL
YGTYSHGLFKKLGVPGPRPLPYFGNVLSYRK
VFGAYSMDVITSTSFGVNIDSLGNPQDPFVENAKKLLRFDILNPFLLSV
VLFPFLVPIFEVLNITMFPKSAVNFLAKSVKRIKESRLKDNQK
PRVDFLQLMINSQNSKETDNHK
ALSDQELMAQSVIFIFAGYETTSNTLSFLLYILATHPDVQQKLQEEIDVTFPNK
APPTYDVLAQMEYLDMVVNETLRMFPITVRLDRLCKKDVKIHGVSIPKGTTVTVPISVLHRDPQLWPEPEEFRPER
FSKKNKDTISPYVYLPFGTGPRNCIGMRFAIMNMKLAVVRVLQNFSFKSCKETQ
IPLKINSQGLIRPEKPIFLKVVLRDETISGA*
CYP3A76
TC192141
75% to 3A4 SCAFFOLD300381 AM015519.1 AAFC02005660.1 DR712863.1
MELIPNFSVETWVLLAISLVLLYL(2)
YGTYSHGLFKKLGVPGPRPLPLFGNVLSYRK
(0)
GVCEFDEECFKKYGKMWG
(2)
IFEGKHPLLVITDPDMIKTVLVKECYSVFTNRR
(0)
VFGPMGVMKNAVSVAEDEQWKRIRTLLSPTITSGKLKE
(0)
MFPIIGKYGDVLVRNLRKEAEKGTSVDMKE
(2)
VFGAYSMDVITSTSFGVNIDSLGNPQDPFVENAKKLLRFDILDPFLLSV
(1)
VLFPFLIPIFEVLNISIFPKSAVNFLTTSVKKIKESRLKDTQK
(0)
PRVDFLQLMINSQNSKETDNHK
(1)
ALSDQELMAQSIIFIFGGYETTSTSLSFIIYELATHPDVQQKLQEEIDATFPNK
(0)
APPTYDVLAQMEYLDMVVNETLRMFPIAVRLERFCKKDVEIHGVSIPKGTTVTVPISVLHRDPQLWPELEEFRLER
(2)
FSKKNKDSISPYVYLPFGTGPRNCIGMRFAIMNMKLAVVRVLQNFSFKPCKETQ
(0)
IPLKIKSQGLLRPEKPIFLKVVLRDETISGA*
CYP3A76-ie6b
extra
exon 6 SCAFFOLD300381
other exon 6 at 7654
9898
MFPIIGKYGDVLVRNLRKEAEKGKSVNMKE
Scaffold 211417
AAFC02026180 AAFC02091356.1 CB417720.1 BE750809.1 CYP4Ab
MSVSALSPSRALGGVSGLLQVVSLLGLVLLLIKAAQLYLRRQWLLKALHHFPSPPSHWFCGHKWE
FQEEGELPHLLKRVEKYPRACVRWMWGTRALLLVYDPDYMKMVLGRS
DPKAQIIHRFVKPWI
GTGLLLLEGQTWFQHRRMLTPAFHYDILKPYVGIMADSVRVML
DKWEELVSQDSHLEIFGHVSLMTLDTIMKCAFSQQGSVQTDR
369
SSQSYIQAIKDVSHLIISRLRNAFHQNDLIYRLTPEGHWNHRACQLAHQHT
895
DAVIKERKVRLQKEGELEKVRSRRHLDFLDILLFAR
1445
MENGSSLSDEDLRAEVDTFMFEGHDTTASGISWILYALASHPEHQQRCREEIQS
1690
LLADGASITW
DHLDQMPYTTMCIKEAMRLYPPVPVISRELSKPITFPDGHSLPA
1952
GILVSLSIYGLHHNPKVWPNPE
VFDPTRFAPGSTRHSHAFLPFSGGSR
3927
NCIGKQFAMNELKVAVALTLLRFELSPDSSRVPVPMPVIVLRSKNGIHLQLRKLSDPGGDK
CYP4A41
77% to 4A11 XM_594222
MSVSVLSPTRALGGVSGLLQVVSLLGLVLLLLKAAQLYLRRQWLLKALHQFPSPPSHWFYGHKKE
FQKESELPPLLERVEKYPKACVRWLWGTKALVLVYDPDYMKVFLGRS
DPKPHRTYKYLAPWI
GTGLLLLEGQKWFQHRRMLTPAFRYDILKAYVGIMADSVRVML
DKWEELVSQDSHLEIFGHVSLMTLDTIMKCAFSHQGSVQMDR
SSQSYIQAIRDLSHLIVSRLRNAFHQNDLIYRLTPEGRWNHRACQLTHQHT
DAVIKERKAHLQKEGELEKVRSRRHLDFLDILLLAR
MENGSSLSDEDLRAEVDTFMFEGHDTTASGISWILYALASHPEHQQRCREEIQSLLGDGASITW
DHLDQMRYTTMCIKEAMRL
YPPVPFIGRELRKPITFPDGRSLPA
GILVSLSFYGLHHNPNVWPNPE
VFDPTRFSPGSTQHSYAFLPFSGGSR
NCIGKQFAMNELKVAVALTLLRFELSPDPSRVPVPTPIMVLRSKNGIHLQLRKLSDPGGDKDKL
CYP4A42P pseudogene N-terminal
XM_606400 SCAFFOLD236059 72% to 4A40
KEKELEELLKRVEEFPYAYPCWMWG
DNVHLVVYDPDYMKVVLRRS
NPKSEDIYRFLAPWI
GLLLLEEQTWFQHRQMLTPAFHYDILKPYLGLMADSVQGML
DKWEELVSQDSHLEILGDISLMTLDTIMKCVFSH*GSIRNDR
XXXXXXXXXXXXXXXXXXXLRNVFYQNDLIYRLTPEGQWSY
XXXXXXXXX
TDAVIEERKTCLRKEGELEKVRSRRHLDFLDILLFSRV
NGSSLSDEDLRAEVDTFIFGGHDTTASGISWILYALASHPEHQQRCREEIQSLLGDGASITW
CYP4A pseudogene
C-terminal
Scaf.
42211 pseudogene
GNGTFLLEGQMWFQYWGMLTSAFHHDILRPSMGHMAYSVQLM
TDSIIKGRKSHLHKEGELEKERKWRPLDFLDILLFAR
MKTGSSLSDKDLHAEVDKVMFKGQETTTSGISWILYSLVSHPVHQQR
EEIQRLLGDGASITW
DHLDQMPYTTXCI
QEALRFYPPVPVIGRELSKPITFPVGHSLPA
XXXXXXXXXXXXXXXXXXXXXX
VFDSS*SAPGPAGHNHAFLPFSGGS
CYP4A pseudogene
C-terminal
Scaf.
268824 c-term 4A pseudogene
GIEVALSFYGLHHNPKV*LNPE
VFEPSRFALGSNQHSHAFLPFSGGSR
KCIGKQFVMNEMKVAMALTLLHFELSPDPSRVPCLTP
EIVLRSKNGIHLHLRSL
CYP4A
C-terminal possible pseudogene
Scaf.
305588 83% to CYP4A41 differs from 4A40 and 4A41
no ESTs
LSFYGLHHNPKVWPNPE
VFDPSWFTLGSTRHSHAFLPFSGGSR
NCIRKQSAMNDAKVAVALTLLRFELSSDPFRVLVPAPVMVQRSKNGIHFQLRKLSDPGGDKDK
CYP4B1
AW347853 89%
to 4B2 3 aa diffs to mouse 4b1, 5 aa diffs to 4B2 cow
This seq
is 100% match to pig CJ022616.1
DGRSLPA
GSLISLHIYALHRNSAVWPDPE
VFDPLRFSLENMAGRHPFAFLPFSAGPRNC
Pig CYP4B1
CJ022616.1 BP435822.1 CF179484.1 AW480803.1 CJ026219.1
BP440482.1
80% to human 4B1, 81% to cow 4B2
MVPALLSLSLSHLSLWALGLILALGVLKLTSLLLRRQMLARAMDK
FPGPPTHWIFGHALEIQQTGSLDKMVSWAHQFPHAHPLWLGPFLGFLNIYEPEYAKAVYS
RGDPKASDLYDFFLQWIGKGLLVLHGPKWFQHRKLLTPGFHHDVLKPYVAVMANSARVML
DKWEEKAREDKSFDIYHDVGHMALELLMKC
TFGKGTSGLNYSDNAYHLAVRDLTLLMQQRLSSFQYHNDFIYWLTPHGRRFLRACRVAHD
HTDQVIRERKAALQDKEEQERIQSRRHLDFLDILLGAQDEDRVKLPDADLRAEVDTFMFG
GHDTTTRGISWFLYCMSLNPEHQHRCRDEIREILGDRDSIQGEDLGKMTYLTMCIRESFRLYPP
GPQVYRQLSEPVSFVDGRSLPAGSLISLHIYALHRNSAVWPDPEVFDPLRFSLENMAGRH
PFAFLPFSAGPRNCIGQQFAMNEMKVVAALSLLRFEFAPDPSRPPIPMPQLTLCSKTGIH
LRLTSLGQGSGK*
CYP4B2
TC209863 BF602740
AAFC02041217.1 94% to goat 4B2 only 82% to human 4B1
MVPVLLSLSLSHLSLWAFALVSVLVFLKLTRLLLRRQMLARAMDRFSGPPTHWLFGHALE
IQQTGSLDKVVSWTHEFPYAHQLWVGQFLGFLNIYDPDYAKAVYSR
GDPKAPDFYDFFLQWI
GKGLLVLQGPKWFQHRKLLTPGFHYDVLKPYVALFAESTRVML
DKWEKKAREQKSFDIYSDVGHMALDSLMKCTFGKGTSGLNDR
DNNYYLSVKELTLLMQQRIDSFQYHNDFIYFLTPHGRRFLRACQVAHDHT
DQVIRERKAALQDEKERERIQSKRHLDFLDILLGAW
DEEGIKLSDEDLRAEVDTFMFEGHDTTTSAISWVLYCMSLYPEHQRRCREEIQEILGDRDTLKW
DDLAEMTYLTMCIKESFRL
YPPVPQVYRQLSQPVNFVDGRSLPE
GSLISLHIYALHRNSTVWPDPE
VFDPLRFSPENVAGRHSFAFIPFSAGPR
NCIGQQFAMAEVKVVTALCLLRFEFSPDPSRLPIKMPQLVLRSKNGIHLHLKPLGPGSEK*
CYP4X1
CF763940.1 CF766034.1
CF765861.1 CF768878.1
24776
MESSWLETHWARPFYLTLVFCLALGLLQAIKLYLQRQRLLRDLRLFPSPPTHWLYGHSK 24600
4386
LFKDGKMEILEKIVEKYPCAFPRWIGPFQAFLYIYDPDYAKTFLSRT (1)
DPKSNFLYKFMTASV
1563
GKGLVNLSGPKWSQHRRLLTPGFHFNTLKSFVEVMAQSVNIML 1435
NKWEKICGSQNTLLDIYEHINLMTLDVLMKCIFSWETNCQTDSSHDNYVKATSEGSNILMQR
LFNYFYHYDLIFKLSPLGHRLREVNKILHHHTDKIVKNRRNSLKDENKQTNTQKRKYRDF
LDIVLSAQAENDSFTDADLWSEVNTFMVAGHD
SVSAGISWLLYHLALYPEHQEKCREEIRAILGDGSSITWDQLSEM
SYTTMCIKESLRLAPPAVSISRQLSKPITFPDGRSLPAGMTVVLSIWGLHHNPAVWENPE
VFDPLRFSQECNKRHSHAYLPFSSGPRNCIGQNFAIAEIKVIVALI
LLRFQMRVEPTKPV
VFVPYIFIKPKKGIYLHLKKLP*
CYP4F22
ortholog
MLPITEQLLHLLGLEKTAFRLYAVSGLLLTLLFFLFRLLLQFLRLCWGFYITCRRLRCFPQPPRRNWLLGHLGM
(0)
YLPNERGLQDEKKVLDDMHHVILVWLGPVLPLVVLVHPDYIKPLVGAS
(1)
AAVAPKDDLFYGFLKPWL (1)
GDGLLLSRGDKWSRHRRLLTPAFHFDILKPYMKIFNQCADTMH
(0)
AKWRGLAEGSEVSLDMFEHVSLMTLDCLQKCVFSYNSNCQE
(2)
KMSDYISAIIELSALVVRRQYRLHHYLDFIYYRTADGRRFRQACDTVHGFTTEVIQERRR
ALRQQGAEAWFKAKQGKTLDFIDVLLLAK (0)
DEDGKELSDEDIRAEADTFMFE (1)
GHDTTSSGLSWVLFNLAKYPEYQEKCREEIQEVMKGRELEELEWSGQ
(2)
DDLTQLPFTTMCIKESLRQFPPVTLVSRRCTEDIKLPDGRIIPK
(1)
GIICLVSIYGTHYNPAVWPDSK (0)
VYNPYRFDPDNPQQRSPLAYVPFSAGPR
(2)
NCIGQSFAMAEMRVAVALTLLRFRLFLDRTRKVRRKPELILRTENGIWLKVEPLPPRA*
CYP4F46
SCAFFOLD265009 AAFC02033475
XM_591523 74% to human4F11
MLELSMSRLGLGLVAASPWLLLLVVGASWLLAHILAWTYTFFNNCRLLQCFPQPPKLNWFFAHLYL
(0)
VPMTEQGLRKLTQLAANYSHGYMIWFGPITLMIVFCHPDMLRSITNAS
AAVAPKDMDFYNFLKPWLG
DGLLLSAGDKWSRHRCMLTPTFHFNILKPYMKIFTKSTDIMH
TNWECLVTQGHTRLDMFEHVSLLTLDSLQKCVFSFDSNCQE
KPSEYITTIFELCELAAKRNKQICLHTDFLYFLTLDGWHFRRACRLVHNFTDSIIQEQRCTLPSENI
DDFFKAKVKTKTLDFIDVLLLSK
DEDGKGLSDEE
IRAEADTFMFE
GHDTTASGLSWILYNLAKHPEYQERCGQEVKELLRDCESKEIEW
DDLAHLPFLTMCIKESPRLHPPVTFIFRYCTQDIVLPDGRVIPK
GVVCLIDIFGTHHNPSVWPDPE
VYDPFRFDPENIKGRSPLAFIPFSAGSR
NCIGQTFAMTEMNVVLALTLLRFRFRPDKEEPRRKPELILRAEGGLWLQVELLSAGPQ*
CYP4F47
XM_586481 CN792379.1 80% to
4F11 bottom identical to 615008
MLELSLSWLGLRPVAASPWLLLVLVGASWILAHVLAWTYTFYNNSRHLQCFPQPPKRNWFLGHMGQYPP
TEKGLRDFTQLVANYPQGFRICLGPAIPIIIFCHPDMIRIVANAS
AAAAPKDVIFYDFLKPWLG
DGLLLSAGDKWSRHRRMLTPAFHFNILKPYMKIFTKSADIMH
AKWQRLITEGHTHLDMFEHISLMTLDSLQKCVFSYDSNCQE
KPNEYIASILELSALVAKRNQQIFLHMDFLYFLTPDWRRFRRACRLVHDFTDAVIQERRHTLPSEGT
DDFLKAKVKTKTLDFIDVLLLTK
DEDGKGLSDEDIRAEADTFMFE
GHDTTASGLSWVLYNLAKHPEYQERCRQEVQDLLRDRESKEIEW
DDLAQLPFLTMCIKESLRLHPPVSVISRRYAQDTLLPDGRVIPK
GVICLINIIGTHHNPSVWPDPE
VYDPFRFEPENIKGRSPLAFIPFSVGPR
NCIGQTFAMTEMKVVLALTLLRFRVLPGEEPRRKPELILRAEGGLWLRVEPLSADQ
CYP4F48
XM_584482 CN793339 CK836265
AAFC02027122 CK946303 AAFC02217855.1
MLELSLSRLGLGPLAASPWLLPLLAGVSWILARVLAWTYTFYNNSRRLRCFLQPPKPNWFLGHMNL
VPSTEQGLIYFTQMAANYPRGYLIWFGPIIPMVIFCHPDMLRSITNAS
AAIAPKDMQFYGTLKPWLG
DGLLLSAGDKWSSHRRMLTPAFHFNILKPYIKIFTKSAEIMH
AKWEHLITEGHTHLDMFEHISLLTLDNLQKCVFSFDSNCQE
KPSEYIAAILELSALVSKRNQQLFLYMDFLYYLTSDGQRFRNACRLVHDFTDAVIQERRRTLPKENI
DDFLKAKAKTKTLDFIDVLLLTK
DEDGKGLSDEDIRAEADTFMFE
GHDTTASGLSWILYNLAKHPEYQERCRQEVQDLLRDRESKEIEW
DDLAQLPFLTMCIKESLRLHPPVTSISRRCTQDIVLPDGRVIPK
GVVCLIDIFGTHHNPSVWQDPE
VYDPFRFDPENIKGRSPLAFIPFSAGPR
NCIGQTFAMTEMKVILALTLLRFRVLPDKEPCRKPELILRTEGGLWLRVEPLSAGQQ*
CYP4F49
XM_614912 CK959664 CN793709
AAFC02090500.1 80% to 4F11 XM_583556
MLELSLSWLGLGPVAASPWLLLLFTGASWLLARILAWTYTFYNNSRRLQCFPQPPKRNWFLGHLGL
VPPTEQGMSKLTELVAKYSQGFRIWMGPITPIIVFCHPDLIRIVANAS
AAVAPKDVIFYEVLKPWLG
DGLLLSAGDKWSRHRRMLTPAFHFNILKPYMKIFTKSTDIMH
AKWQHLIKEGHTHLDMFEHISLMTLDSLQKCIFSYDSNCQE
KPSEYIAAILELSALVAKRHQEIFLHMDFLYYLTPDGRRFRRACRLVHDFTDAVIQERHRTLPSESI
DDFLKAKAKTKTLDFIDVLLLTK
DEDGKGLSDEDIRAEADTFMFE
GHDTTASGLSWILYNLAKHKEYQERCRQEVQELLKDREPKNIEW
DDLAQLPFLTMCIKESLRLHPPVSVISRRCTQDTVLPDGRVIPK
GVICLISIFGTHHNPSVWPDPE
VFDPFRFDPENIKGRSPVAFVPFSVGPR
NCIGQTFAMTEMKVVLALTLLRFRVLPDKEEPRRKPELILRAEGGLWLRVEPLSTGQQ*
CYP4F50
XM_583211 XM_583616
AAFC02021075.1 AAFC02239800.1 CK772605.1 CK960170.1
MLELSLSWLGLGPVAASPWLLLLLVGASWILARILAWIYAFYDNCCRLRCFPQPPKRSWFWGHLGL
AQSNEESMRLVEELGHYFRDVHLWWMGPFFPILRLVHPNFVAPLLQAS
ATIIPKDMFFYSFLKPWLG
DGLLLSAGDKWSSHRRLLTPAFHFEILKPYMKIFNKSADIMH
AKWQRLALEGSTRLDMFEHISLMTLDSLQKCVFSYDSNCQE
KPSEYIAAILELSALVMKRIKHIFLHVDFLYYLTRDGQRFYRACRLVHDFTDAIIQKRRRTLISQGS
QEFLKTKTKAKTLDFIDVLLLAK
DEDGKGLPDEDIRAEADTFMFE
GHDTTASGLSWILYNLAKHPEYQERCRQEVQELLRDREPKEIEW
EDLAQLPFLTMCIKESLRLHPPVAVISRLCTHDVVLPDGRVIPK
GNICVISIFGIHHNPSVWPDPE
VFNPFRFDPEAPKRSPLAFIPFSAGPR
NCIGQTFAMNEMKVALALTLLRFRILPDEEEPRRKPELILRAEGGLWLQVEPLN
TGQ*
CYP4V2
ortholog
TC207398 TC207399 TC207400
TC189439 CO895969.1 EST
MLAPWLLSVGPKLLLWSGLC
AVSLAGATLTLNLLKMVASYARKWRQMRPVPTIGDPYPLVGHALMMKSDARDFFQQIIDFTEECR
HLPLLKLWLGPVPLVALYNAETVEVILSSSKHIEKSYMYKFLEPWLGLGLLTSTGNKW
RSRRKMLTPTFHFTILEDFLDV
MNEQANILVTKLEKHVNQEAFNCFFYVTLCTLDIICETAMGKNIGAQRNDDSEYVRAVYR
MSDSIHQRMKMPWLWLDLIFYMFKNGREHRRSLKIVHDFTNNVITERANEMKRHEEGTSN
DKEKDFPPRKTKCRAFLDLLLNVT
DDQGNKLSHEDIREEVDTFMFEGHDTTAAAINWSLYLLGWYPEVQQKVDTELEEVFGKS
DRPVTLEDLKKLKYLDCVIKESLRLFPSVPFFARNLTEDCEVAGHKMVQGCQVIIVPYAL
GSSRDPKYFPDPE
EFKPERVFPENLKGRHTYAYVPFSAGPRNCIGQKFAIMEEKTILSCILRHFWVESNQKRE
ELGLAGELILRPSNGIWIKLKRRNTDES*
CYP5A1
TC191301 76% to CYP5A1
Length = 1926
94
MEVLGLFRLEVSGPMVTVALSVVFLALLKWYSTSAFSRLEKLGIRHPKPSPFIGNLAFFRQ 276
277 GFWESHMELRKQYGPLSGYYLGRLMFIVISEPDMIEQVLVEKFSNFTNRMAT 432
433 GLEPKPVADSVLFLRDKRWEEVRSVLTVAFSPEKLSEMTPLISRACDVLLAHLERH
600
601 AQSGEAFDIQKTYCCYTTDVVASVAFGTEVNSQEAPEHPF 720
VEHCRRFFASSIPKPLLVLLLSFPSIMVPLARILPNKNRDELNGFFNKLIRNVIALRDQQ
AAEERRRDFLQMVQDVRHSAATVGVENFDIVRQVFSATKCPANPPRRHSPRPLSKP
1069 LSVDEVVGQAFIFLIAGYEIVTNTLSFATYLLATNPECQEKLLEEVDCFSKEHLAPEYCS
1248
1249
LQEGLPYLDMVIKETLRMYPPAFRFTRVAAQDCEVLGQRIPAGAVLETAVGALHYDPE 1422
1423
HWPNPENFNPERFTAEAQQRRRPYTYLPFGAGPRSCLGVRLGLLELKLTLLHILRKFR 1596
1597
FEACPETQVPLQLESKSALGPKNGVYIRIVSR*
CYP7A1
just one gene CB421103 92%
to 7A1 Length = 707
MMSLSLIWGIVIAVCCCLYLLGMRRR
(2) 5259
QMGEPPLENGLIPYLGCALQFGANPLEFLRANQRKHGHVFTCRLMGNYVHFITNPLSYH
3625
KVLCHGKYFDWKKFHFTASAK (0)
AFGHRSIDPSDGNTTDTISKTIIKTLQGDALSSLTEAMMGNLQLVLRPQGPPQPPTPTW
2193
VTEGMYSFCYRVMFEAGYLTLFGRDLAGQDAQKALILNSLDNFKQFDKIFPALVAGFPIH
2013
VFKTGHYAREKLTEGLRLQKFRERDHISELVRFLNDTFATLDDTERAKSLLAVLWAS
1842
QANTIPATFWSLFQMIR
RNPEAMKAATEEVNKTLENAGQKVSFEDSPIHLNQTQLDNMPVL
(1)
60
DSIIKESLRLSSASLNIRTAKEDFTLHLQDGSYNIRKDDIIALYPQLMHLDPEIYP 239
240 DPLTFKYDRYLDENGKTKTTFYSNGLKLKYYYMPFGSGVTICPGRLFAVQEIKQFLILML
419
420
SYFELELVESCVKCPPLDQSRAGLGILPPLYDTEFRYKFKHS* 548
CYP7B
just one gene
MNAGMGGFWPEPWLTRSLGYPGLALAAALLLLVLCLNARRTR
(2)
RPGEPPLIKGWLPYFGQALKLQKDPLGFMTSLQKQYGDIFTLLLG
(1)
GKYITFILDPFHYTSVAKNQKLSFQIFTNKFLKRVFSIKKMLTDSDLIDEIHSTYQ
56
3
FLQGKHLDILMESTMQNLKQVFEPQLLKTTSWSTEYLLPFCNSVIFEMTFTTIYGNILAD 182
183
DKKTFITELKDDFLKFDEKFTRLASGIPIELLGNIKSVRTKLIKDLTIESLAKLQGMSEV 362
363 VQRRNDILEKYYTPKDTEIG
(1)
AHHLGLLWASVTNTVPTMFWAMYYLLRNPKAMAVLRDEIDHLLQSTGQKKGPGFSIYLTR
EQLDSLVYL (1)
ESTILEVLRLCSFSGIFRFVQEDLTLHLESQDCCLRKGDFVVIFPPILHHDPEIFEAPD
EFRFDRFTENGKKKTTFFKRGKKLKYYHLPFGLGVSKCP
GRFLAMVEIKQLLVVLLTYFDLEIIDNKPLELNYSRFLFGIPYPDSDVLFRYKIKS
CYP8A1
TC208596 = D30718 89% to
8A1 Length = 2495
30
MSWAVVFGLLAALLLLLLLTRRRTRRPGEPPLDLGSIPWLGHALEFGKDAAGFLTRMKEK 209
210
HGDIFTVLVGGRHVTVLLDPHSYDAVVWEPRSRLDFHAYAVFLMERIFDVQLPHYNPGDE 389
390
KSKMKPTLLHKELQALTDAMYTNLRTVLLGDTVEAGSGWHEMGLLEFSYGFLLRAGYLTQ 569
570
YGVEAPPHTQESQAQDRVHSADVFHTFRQLDLLLPKLARGSLSAGDKDRVGKVKGRLWKL 749
750
LSPTRLASRAHRSRWLESYLLHLEEMGVSEEMQARALVLQLWATQGNMGPAAFWLLLFLL 929
930
KNPEALAAVRGELETVLLGAEQPISQMTTLPQKVLDSMPVLDSVLSESLRLTAAPFITRE 1109
1110
VVADLALPMADGREFSLRRGDRLLLFPFLSPQKDPEIYTDPEVFKYNRFLNPDGSEKKDF 1289
1290
YKDGKRLKNYSLPWGAGHNQCLGKGYAVNSIKQFVFLVLTQFDLELITPDVDIPEFDLSR 1469
1470 YGFGLMQPEHDVPVRYRIRP*
1532
CYP8B1
just one gene
MVLWGPALGALLVAIVGYLCLRGLLRQRRPKEPPLDKGPVPWLGHAMAFRKNMFEFLRHM
QAKHGDIFTVQLGGQYFTFVMDPLSFGPILKDAQRKLDFVEYAQKLVLKVFGYRSVQGDY
RMIHSASTKHLMGEGLEELNKVMLDTLSLVMLGPIGPSLGTHHWREDGLFHFCYNILFKA
GYLSLFGYTKDKEQDLLQAEELFLEFRKYDRLFPRFVYSLLGPREWLEVGRLQRLFHKEL
SVEHNLEKYGVSNWITYMLQFLREQGVSPAMQDKFNFMMLWASQGNTGPTSFWALLFLLK
HPEAMRAVREEATRLLGEVRLEAKQSFKFGALHHMPVLDSVMEETLRLRASPTLLRVV
NDDQILQMASGQEYMLRNGDILALFPYLSVHMDPDIXXXXXXXXXXXXXXXXXLL
KAGKKIRHYTMPWGSGVSICPGRFLALNEMKLFILLMVMNFDLELVDPATPVPPVDPQRW
GFGTTQPSHEVRFRYRLRPAE
CYP11A1
TC189227 78% to 11A1 Length
= 1959
MLARGLPLRSALVKACPPI
LSTVGEGWGHHRVGTGEGAGISTKTPRPYSEIPSPGDNGWLNLYHFWREKGSQRIHFRHI
ENFQKYGPIYREKLGNLESVYIIHPEDVAHLFKFEGSYPERYDIPPWLAYHRYYQKPIGV
LFKKSGTWKKDRVVLNTEVMAPEAIKNFIPLLNPVSQDFVSLLHKRIKQQGSGKFVGDIK
EDLFHFAFESITNVMFGERLGMLEETVNPEAQKFIDAVYKMFHTSVPLLNVPPELYRLFR
TKTWRDHVAAWDTIFNKAEKYTEIFYQDLRRKTEFRNYPGILYCLLKSEKMLLEDVKANI
TEMLAGGVNTTSMTLQWHLYEMARSLNVQEMLREEVLNARRQAEGDISKMLQMVPLLKAS
IKETLRLHPISVTLQRYPESDLVLQDYLIPAKTLVQVAIYAMGRDPAFFSSPDKFDPTRW
LSKDKDLIHFRNLGFGWGVRQCVGRRIAELEMTLFLIHILENFKVEMQHIGDVDTIFNLI
LTPDKPIFLVFRPFNQDPPQA*
CYP11B1
NM_174638
MALWAKARVRMAGPWLSLHEARLLGTRGAAAPKAVLPFEAMPRC
PGNKWMRMLQIWKEQSSENMHLDMHQTFQELGPIFRYDVGGRHMVFVMLPEDVERLQQ
ADSHHPQRMILEPWLAYRQARGHKCGVFLLNGPQWRLDRLRLNPDVLSLPALQKYTPL
VDGVARDFSQTLKARVLQNARGSLTLHIAPSVFRYTIEASTLVLYGERLGLLTQQPNP
DSLNFIHALEAMLKSTVQLMFVPRRLSRWMSTNMWREHFEAWDYIFQYANRAIQRIYQ
ELALGHPWHYSGIVAELLMRADMTLDTIKANTIDLTAGSVDTTAFPLLMTLFELARNP
EVQQAVRQESLVAEARISENPQRAITELPLLRAALKETLRLYPVGITLEREVSSDLVL
QNYHIPAGTLVKVLLYSLGRNPAVFARPESYHPQRWLDRQGSGSRFPHLAFGFGVRQC
LGRRVAEVEMLLLLHHVLKNFLVETLEQEDIKMVYRFILMPSTLPLFTFRAIQ
CYP11B2
MALWAKARVRMAGPWLSLHEARLLGTRGAAAPKAVLPFEAMPRCPGNKWMRMLQIWKE
QSSENMHLDMHQTFQELGPIFRYDVGGRHMVFVMLPEDVERLQQADSHHPQRMILEPWLA
YRQARGHKCGVFLLNGPQWRLDRLRLNPDVLSLPALQKYTPLVDGVARDFSQTLKARVLQ
NARGSLTLDIAPSVFRYTIEASTLVLYGERLGLLTQQPNPDSLNFIHALEAMLKSTVQLM
FVPRRLSRWMSTNMWREHFEAWDYIFQYANRAIQRIYQELALGHPWHYSGIVAELLMRAD
MTLDTIKANTIDLTAGSVDTTAFPLLMTLFELARNPEVQQAVRQESLVAEARISENPQRA
ITELPLLRAALKETLRLYPVGITLEREVSSDLVLQNYHIPAGTLVKVLLYSLGRNPAVFA
RPESYHPQRWLDRQGSGSRFPHLAFGFGVRQCLGRRVAEVEMLLLLHHVLKNFLVETLEQ
EDIKMVYRFILMPSTLPLFTFRAIQ*
CYP17A1
TC190136 71% to Steroid
17alphahydroxylase/17,20 lyase Length = 1725
MWLLLAVFLLTLAYLFWPKTKHS
116
GAKYPRSLPSLPLVGSLPFLPRRGQQHKNFFKLQEKYGPIYSFRLGSKTTVMIGHHQLAR 295
296 EVLLKKGKEFSGRPKVATLDILSDNQKGIAFADHGAHWQLHRKLALNAFALFKDG
460
461 NLKLEKIINQEANVLCDFLATQHGEAIDLSEPLSLAVTNIISFICFNFSFKNEDP
625
626 ALKAIQNVNDGILEVLSKEVLLDIFPVLKIFPSKAMEKMKGCVQTRNELLNEILEKCQEN
805
806
FSSDSITNLLHILIQAKVNADNNNAGPDQDSKLLSNRHMLATIGDIFGAGVETTTSVIK 982
983
WIVAYLLHHPSLKKRIQDDIDQIIGFNRTPTISDRNRLVLLEATIREVLRIRPVAPTLIP 1162
1163
HKAVIDSSIGDLTIDKGTDVVVNLWALHHSEKEWQHPDLFMPERFLDPTGTQLISPSL 1336
1337
SYLPFGAGPRSCVGEMLARQELFLFMSRLLQRFNLEIP 1450
DDGKLPSLEGHASLVLQIKPFKVKIEVRQAWKEAQAEGSTP*
CYP19A1
TC194849 88% to 19A1 Length
= 2057
MLLEVLNPRHYNVTSMVSEVVPIASIAILLLTGFLLLVWNYEDTSSIPGPSYFLGIGPLI
SHCRFLWMGIGSACNYYNKMYGEFMRVWVCGEETLIISKSSSMFHVMKHSHYISRFGSKL
GLQFIGMHEKGIIFNNNPALWKAVRPFFTKALSGPGLVRMVTICADSITKHLDRLEEVCN
DLGYVDVLTLMRRIMLDTSNMLFLGIPLDESAIVVKIQGYFDAWQALLLKPDIFFKISWLCRKYEKSV
927 KDLKDAMEILIEEKRHRISTAEKLEDSIDFATELIFAEKRGELTRENVNQC 1079
1080
ILEMLIAAPDTMSVSVFFMLFLIAKHPQVEEAIIREIQTVVGERDIRIDDMQKLKVVEN 1256
1257
FINESMRYQPVVDLVMRKALEDDVIDGYPVKKGTNIILNLGRMHRLEFFPKPNEFTLE 1430
1431
NFAKNVPYRYFQPFGFGPRACAGKYITMVMMKVVLVTLLRR 1553
FHVQTLQGRCVEKMQKKNDLSLHPDETRDRLEMIFTPRNSDKCLER*
CYP20A1
NM_001015644.1
MLDFAIFAVTFLLALVGAVLYLYPASRQAAGIPGITPTEEKDGN
LPDIVNSGSLHEFLVNLHERYGPVVSFWFGRRLVVSLGTVDVLKQHINPNKTLDPFET
MLKSLLRYQSDSGNVSENHMRKKLYENGVTNCLRSNFALLIKLSEELLDKWLSYPESQ
HVPLCQHMLGFAMKSVTQMVMGSTFEDEQEVIRFQKNHGTVWSEIGKGFLDGSLDKST
TRKKQYEDALMQLESILKKIIKERKGRNFSQHIFIDSLVQGNLNDQQILEDTMIFSLA
SCMITAKLCTWAVCFLTTYEEIQKKLYEEIDQVLGKGPITSEKIEELRYCRQVLCETV
RTAKLTPVSARLQDIEGKIDKFIIPRETLVLYALGVVLQDPGTWSSPYKFDPERFDDE
SVMKTFSLLGFSGTRECPELRFAYMVTAVLLSVLLRRLHLLSVEGQVIETKYELVTSS
KEEAWITVSKRY
CYP21A1
TC209496 79% to CYP21A2
Length = 2158
MVLAGLLLLLTLLSGAHLLWGRWKLRNLHLP
234
PLVPGFLHLLQPNLPIHLLSLTQKLGPVYRLRLGLQEVVVLNSKRTIEEAMIRKWVDFAG 413
414 RPQIPSYKLVSQRCQDISLGDYSLLWKAHKKLTRSALLLGTRSSMEPWVD 563
564 QLTQEFCERMRVQAGAPVTIQKEFSLLTCSIICYLTFGNKEDTLVHAFHDCVQDLMKT
737
738 WDHWSIQILDMVPFLRFFPNPGLWRLKQAIENRDHMVEKQLTRHKESMVAGQWRD
902
903
MTDYMLQGVGRQRVEEGPGQLLEGHVHMSVVDLFIGGTETTASTLSWAVAFLLHHPEIQ 1079
1080
RRLQEELDRELGPGASCSRVTYKDRARLPLLNATIAEVLRLRPVVPLALPHRTTRPSSIF 1259
1260
GYDIPEGMVVIPNLQGAHLDETVWEQPHEFRPDRFLEPGANPSALAFGCGARVC 1421
1422
LGESLARLELFVVLLRLLQAFTLLPPPVGALPSLQPDPYCGVNLKVQPFQVRLQ 1583
PRGVEAGAWESASAQ*
CYP24A1
86% to human
6035
MSSPISKSRSLAAFLQQLRSLGQPPRPVTSTACSPRRPREVPLCPVMEPGEAQDAATLP 6211
6212
GPTKWPLLGSLLEILWKGGLKKQHDTL (0)
6511 (0)
VEYHKKYGKIFRMKLGSFDSVHLGSPCLLEALYRTESAHPQRLEIKPWKAYRDYREEGYGLLIL(2) 6702
7484
(2)EGEDWQRVRSAFPKKLMKPVEIMKLDDKINE(0) 7573
4559 (0)
VLADFMGRIDELCDERGRIEDLYTELNKWSFE (1)
2100 (1)
SICLVLYEKRFGLLQKNAGEEALNFIMAVKT (0)
1035 (0)
MMSMFGKMMVTPVELHRSLNTRVWQAHTQAWDTIFRS (1) 925
1303(1)
VKSCVDNRLEKYSEQPSMDFLCDIYHHNQLSKKELYAAVTELQLAAVET (0) 1449
TANSLMWILYNLSRNPHVQQKLFKEIQSVLPENQLPRAEDLRNMPYLKACLKESMR
LNPTVPFTTRTLDKAMVLGEYALPKG
5393
TVLVLNTHVLGSSEENFEESSQFRPERWLQDKKNLNPFAHLPFGVGKRMCVGRRLAELQL 5572
5573 HLALCW (0) 5602
IVRKYDVVATDLEPVETLHLGNLVPGRQLPVAFCQR
CYP26A1
TC196927 98% to 26A1 Length
= 654 C-term AAFC02074175.1
MGLPALLASALCTFVLPLLLFLAAIKLWDLYCVSSRDRSCALPL
PPGTMGFPFFGETLQMVLQREKFLQMKRRKYGFIYKTHLFGRPTVRVMGADNVRRILL
GEHRLVSVHWPASVRTILGSGCLSNLHDSSHKQRKK
(0)
VIMQAFSREALQCYVPVIAEEVGNYLEQWLSCGERGLLVYPQVKRLMFRIAMRILLGCE
SRLASGGEDEQQLVEAFEEMTRNLFSLPIDVPFSGLYR
GLKARDLIHARIEENIRAKIRRLPAAEAGGGCKDALQLLIEHSWERGERLDMQ
ALKQSSTELLFGGHETTASAATSLITYLGLYPHVLQKVREELKNK
GLLCKGNQDNKLDVEI
LEQLKYIGCVIKETLRLNPPVPGGFRVALKTFELNGYQIPKGWNVIYSICDTHDVADIFT
182
183
NKEEFNPDRFLLPHPEDASRFSFIPFGG
GLRSCVGKEFAKILLKIFTVELARHCDWQLLNGPPTMKTSPTVYPVDDLPARFTRFQGEI*
449
CYP26B1
TC200808 95% to 26B1 Length
= 762
MLFEGLELVSALATLAACLVSVTLLLAVSQQLWQLRWAATRDKSCKLPIPKGSMGFPLIGETGHWLLQ
GSGFQSSRREKYGNVFKTHLLGRPLIRVTGAENVRKILMGEHHLVSTEWPRSTRML
LGPNTVSNSIGDIHRNKR
VFSKIFSHEALESYMPKIQLVIQDTLRAWSSHPEAINVYQEAQKLTFRMAIRVLLG
FSLPVDLPFSGYRRGIQARQTLQKG
77 LEKAIREKLQCTQGKDYSDALDIFIESSKEHGKEMTMQELKDGTLELIFAAYAT
238
239
TASASTSLIMQLLKHPAVLEKLREELRAKGLLHSGGCPCEGTLRLDTLSGLHYLDCVIKE 418
419
VMRLFTPVSGGYRTVLQTFELDGFQIPKGWSVMYSIRDTHDTAPVFKDVNVFDPDRFG 592
593 QARSEDKDGRFHYLPFGGGVRTCLGKHLAKLFLKVL 700
AVELASTSRFELATRTFPRITLVPVLHPVDGLSVKFFGLDSNQNKILPETEAMLSATV*
CYP26C1
BE749195 CC773722 95% to
26C1 Length = 465 N-term
67
MLPWGLSCLSALGAVGTALLGAGLLLSLAQHLWTLRWTLSRDRASALPLPKGSMGWPFFGETLHWLVQ (0)
GSRFHSSRRERYGTVFKTHLLGRPVIRVSGAENVRTVLLGEHRLVRSQWPQSAHILLGSHTLLGAVGESHRQRRK
(0)
5955
ILARAFSRAALECYVPRLQRALRREVRSWCAARGPVAVYEAAKALTFRMAARILLGLR 6134
6135
LDEAQCSELARTFEQFVENLFSLPLDVPFSGLRK (0) 6251
GIRARDQLHRHLEEAIAQKLLEDKTAVEPGDALDGIIHSTRELGHELSVQELK
(0)
46
ESAVELLFAAFSTTASASTSLVLLLLQHPAAIAKIQQELAAQGLGRACGCAPAASGGG 225
226
AGPPPDCGCEPDLSLAALGHLRYVGCVVKEVLRLLPPVSGGYRTALRTFELD (0) 381
GYQIPKGWNVMYSIRDTHETAAVYRSPPEGF
DPERFGTAGDDALGAAGRFHYIPFGGGARSCLGQELAQTVLQLLAVELVRTARWELATPA
2322
2321
FPAMQTVPIVHPVDGLRLFFHPLASPAAQDGQRS*
CYP27A1
TC206580 82% to 27A1 Length
= 1614 missing about 58aa at N-term
MNRMGALGSARLRWALLGRRAALPGLGSFGARAKAAIPSALPAAQAAEAPGTGPGDRRLRSLD
ELSGPGQLRLLFQLLVQGYVLHLHQLQVLNKAKYGPIWINRVGPQMHVHLASAPLLEQVM
RQEGKYPVRDDMKLWKEHRDQQGLSYGPFTTMGEQWYRLRQTLNQRMLKPAEAALYTDAL
NEVINDFMDQLKQLRAESASGDHVPDIAHQFYFFALEAISYILFEKRIGCLERSIPKDTE
TFVRSVGLMFHNSLFVTFLPTWTRPLLPFWKRYLDGWNTIFSFGKKLIDQKLEEIEAQLK
TENPEKTQISGYLHFLLTSG
QLSPREAEGSLPELLLAGVDTTSNTLTWALYHLSKNPEIQAALHKEVVGVVPAGQVPQHK
961
DLARMPLLKAVLKETLRLYPVVPVNSRVVVDKEIEVGGFLFPKNTQFVLCHYVISRDPDI
1141
YPEPDSFQPQRWLRKNQPDALKTQHPFGSVPFGYGVRACLGRRIAELEMQLLLTRLIQHYE
1324
VVLAPETGEVTSVARIVLVPNKKVGLRFLQRQS*
CYP27B1
MTQTLKFASRVFHRVRCPELGASLGSRGSESAPRVLADIPGPSTPGFLAELFCKGGLSRLHELQ
(0)
VQGAARFGPVWLASFGTVRTVYLAAPTLVEQLLRQEGPRPERCSFSPWTEHRRRRQRACGLLTA
(2)
EGEEWQRLRSLLAPLLLRPQAAARYAGTLHGVVRDLVRRLRRQRGLGAGPPSLVRDVAGEFYKFGLE
(1)
GIAAVLLGSRLGCLEAEVPPDTETFIRAVGSVFVSTLLTMAMPSWLHRVVPGPWDRLCRDWDQMFAF
(1)
AQQHVEQREAEVAMRNQSEKSEEDMGPGAHLTYFLLQKELPAASILGNVTELLLAGVDT
(0)
VSNTLSWALYELSRHPEIQTALHAEITAALGPGSSTQPSATALSQLPLLKAVVKEVLR
(2)
LYPVVPGNSRVPDRDICVGEYIIPKN
(0)
TLVTLCHYATSRDPAQFPEPNSFRPARWLGEGPAPHPFASLPFGFGKRSCVGRRLAELELQMALAQ
(0)
ILIHFEVQPEPGSAPVRPMTRTVLVPERSINLQFVDR*
CYP27C1
BE723057 86% to 27C1 Length
= 519 DN538841.1 AAFC02145184.1
MALLARILKARPSPAAGRTGLLGGWDAGQPRLASSGLSARARAEDKGAGRPGASQAGGR
AEGPRSLAAMPGPRTLANLVEFFGKDGFSRIHEIQ
297
QKHTREYGKIFKPHFGPQFVVSVADRDLVAQVLRAEGASPQRANMGSWQEYRDLRXRS 476
477 TGLISA 494
EGEQWLKMRSVLRQRILKPKDVAIFAGEINQVIADLIKRIYFLKSQAEDGDTVTNINDLFFKYSME
GVATILYESRLGCLGNSIPQLTADYIEALALMFCTFKTSMYAGAIPRWLRLLIPKPWQEFCRSWDGLFEFS
QIHVDNKLRDIRCQMERGERVRGGLLTYLFLSQELTLEEIYANMTEMLLAGVDT
TSFTLSWAVYLLARHPEVQQALYREIVRNLGERHVPTAADVPKVPLVRALLKETLR
LFPVLPGNGRVTQEDLIVGGYLIPRG
TQLALCHYATSYEDENFPRAKEFRPERWLRQGNLRRVDNFGSIPFGYGARSCIGRRIAELEIHLLVIQ
LLQHFEIKTSPWTKTVHAKTHGLLMPGEPIHVRFVNRK
CYP39A1
TC197782 CB448362 79% to
39A1 Length = 911
192
MEFISPTVIIILSCVAVLLFLQWKNLRRPPCIRGWIPWIGAGFEFGKTPLEFIEKARIK
YGPVFTVIVMGTRMTFVTEEEGINVFLKSKEINFELAVQNPVYHT
ASIAKNIFLKLHEKLYITVKGKMGIFNLYKFTGQLTEELQEQLQNLGTHGTTDLNKFMR
HLLYPVTVNILFKKGLFPTDERKIREFHQHFQAYDEGFEYGSQLPECLLR
NWSKSKKWLLALFEKNIPDIKTHKSAKENYP
TVMQAVLDLLEMEANEQKSPNYGLLLLWASLSHTVP
VAFWTFAFVLSHPNIHRTIMEGISSVFGTAG
KDKIKVSEDDLKKLPLIKWCILETIRLRAPGVIARKVLKPVKIL
DYTVPSGDLLMLSPFWLHRNPKYFPEPDLFKPERW
KEANLEKHAFLDCFMAFGSGKYQCPGRW
LALLEIQICIILIFYYYDCSLLDPLPKQ
SSLHLVGVQQPEGRCRIQFKQRK*
CYP46A1
AV663682 94% to CYP46
Length = 506 AV663681 CC523468 Bac end C-term
MSPGLLMLLGSAVLVVFGLCCTFVHRARSRYEHIPGPPRPS
(2)
FLLGHLPYFWKKDEVCGRVLQDVFLDW
(2)
AKKYGPVVRVNVFHKTSVIVTSPESVK
(0)
KFLMSTKYNKDSKMYHAIQTVFGER
(1)
LFGQGLVSECXYERWHKQRRIMDLAFSRSSLVGLMGTFNEKAEQLVEILEAQ
198
199
ADGQTPVSMQDMLTCATMDILAKAAFGMETSMLLGAQKPLSRKVKLILEGISASRNTLAK 378
379
FMPGKWKQLXETRESVRFLRQVGKEWVXRXRXALQRGEDVPADILTQILKAEEGAQD
367
DEILLDNFVTFFIAGHETSANHLAFTVMELSRQPEILARLQAEVDEVIGSKRHLDCEDLG 188
187 RLQYLSQVLKESLR 146
LYPPAWGTFRLLEEETLIDGVRVPGNTPLL
FSTYVMGRMDTYFEDPLTNPDRFGPGAPK
PKFTYFPFSLGPRSCIGQQFAQ
MEVKVVMAKLLQRLEFRLVPGQRFGLQEQATLKPLDPVLCTLQPRGWQPAPPPPPC
CYP51A1
AC108175.2 CYP51 93% to
human
22188
MLDLLQAGGSVLGQAMEQVTGGNLASMLLIACAFTLSLVYLFRLAVGHLAPPLPTGA (0) 22018
20194
KSPPYIVSPIPFLGHAIAFGKSPIEFLEDAYEK (0) 20096
17795
YGPVFSFTMVGKTFTYLLGSEAAALLFNSKNEDLNAEEVYSRLTTPVFGKGVAYDVPNT 17619
16395
VFLEQKKMLKSGLNIAHFRQHVSIIEKETKEYFKSWGESGEK 16270
14644
NLFEALSELIILTASHCLHGKEIRSQLNEKVAQLYADLDGGFSHAAWLLPGWLPLPSFR 14468
13586
RRDRAHREIKNIFYKAIQKRRESGEKIDDILQTLLESTYK 13467
13076
DGRPLTDDEVAGMLIGLLLAGQHTSSTTSAWMGFFLARDKTLQEKCFLEQKTVCGENLPPLTYDQ (0) 12882
11489 LKDLNLLDRCIKETLRLRPPIMTMMRLAKTP
11397
10597
QTVAGYTIPPGHQVCVSPTVNQRLKDSWVERLDFNPDRYLEDSPASGEKFAYVPFGA (1?) 10427
7020
GRHRCIGENFAYVQIKTIWSTMLRLYEFDLIDGYFPTVNYTTMIHTPEKPIIRYKRRSK* 6841