Bos taurus Cytochrome P450s

 

Newly updated by David Drane (Ridgeway High School, Memphis)

Last revised July 21, 2005

 

All cytochrome P450 genes that could be assembled from EST data,

mRNA sequences in the nr section of Genbank, and WGS (whole genome

shotgun sequences) at NCBI, were assembled and compared to the human

P450 collection.

 

59 complete genes were assembled, this number includes three pseudogenes.

6 partial genes are lacking small regions, and there are several other small

pieces and a few shorter pseudogenes.

 

There are a minimum of 62 CYP genes in the cow. 

 

The old Bos taurus sequence page has been preserved here.

 

 

CYP1 family

 

CYP1A1

AB060696.1 = NP338778, TC208445 XM_588298 CC766779.1 Bac end

MFSVFGLPIPISATELLLASAVFCL

VFWVVRTWRPRVPQGLKSPPEPWGWPLLGHMLMLGKNPHVVLSQLSQRYGDVLQIRIG

CTPVLVLSGLDTVRQALVRQGDDFKGRPDLYSFTLITNGQSMTFNPDSGPVWAARRRL

AQNALKSFSTASDPASSSSCYLEEHVNKEAKYLLGKFQELMSGPGRFDPYRYIVVSVA

NVICAICFGRRYDHNDQEFLSLVNLSNEFGEITASGNPSDFIPVLRYLPNTALDLFKD

LNQRFYVFVQKIVKEHYKTFEKGHIRDITDSLIEHCQDKRLDENANIQLSDEKIINVV

IDLFGAGFDTVTTALSWSLLYLVTSPRVQKKIQEELDTVIGRARRPRLSDRPQLPYLE

AFILETFRHSSFVPFTIPHSTTRDSNLNGFYIPKGRCVFVNQWQINHDQKLWEDPSEF

RPERFLTADGTINKVLSEKVIIFGLGKRKCIGETIARLEVFLFLAILLHQVEFCVTPG

VKVDMTPVYGLTMKYARCEHFQAHMRS

 

CYP1A2

82% to human 1A2

Database location  : AAFC01721680   742 to 1116 (-)

Genomic location   : SCAFFOLD261464   3 to  377 (+)

trace file gnl|ti|669459617

MALSQLSPFSAMELLLASAIFCLVFWVVRTWRPRVPQGLKSPPEPWGWPLLGHMLTLG

KNPHVVLSQLSQRYGDVLQIRIGCTPVLVLSGLDTVRQALVRQGDDFKGRPDLYSFTLVT

DGQSMTFNPDSGPVWAARRRLAQNALNTFSVASD

PSSSSSCYLEDHVSKEAEALLGKFQELMSGPGRFDPYGHVVASV

ANVIGAMCFGQHFPQSSKEMLSLVESSHDFVESASSGNPVDFFPILKYLPNPALQRFK

SFNQRFLQFVRKTVQEHYQDFDKNSIQDIIGALFKHSEDNSRASSRLISQEKTVNLVN

DLFAAGFDTITTAISWSLMYLVTNPKIQRKIQEELD

RVVGRARRPRLSDRPQLPYLES

FILETFRHSSFVPFTIPHSTTRDTTLNGFFIPKERCVFINQWQVNHDPKLWGDPSVFR

PERFLTSDGTTIDKTASEKVLLFGMGKRRCIGEVMARWEVFLFLAILLQRLEFSVPPG

VKVDLTPTYGLTMKHARCEHMQARLRFPIK

 

CYP1B1

82% to human 1B1

MATGLSPDDHLSPTLLSVQQTMLLLLLSVLAAVHVGQWLLRQRRRQPGSAPPGPFAWPLI

GNAASMGSAPHLLFARLARRYGDVFQIHLGSCRVVVLNGERAIRQALVHQSAAFADRPPF

ASFRLVSGGRSLAFGQYSESWKAQRRAAHSTMRAFSTRQPRGRRVLEGHVVGEVRELVEL

LVRRSAGGAFLDPRPLTLVAVANVMSALCFGCRYSHDDAEFLELLSHNEEFGRTVGAGSL

VDVLPWLQRFPNPVRTAFREFEQLNRNFSNFVLDKFLRHRESLRPGAAPRDMMDAFIHSA

GADSGDGGPRLDVDYVPATVTDIFGASQDTLSTALQWLLVLFTR (2)

YSEVQARVQAELDQVVGRHRLPTLEDQPRLPYVMAFLYEAMRFSSFVPVTIPHATTANAS

VLGYHIPKDTVVFVNQWSVNHDPVKWSNPEDFDPTRFLDKDGLINKDLTGSVMVFSVGKR

RCIGEEISKMQLFLFISILAHQCNFKANPDEPSKMDFNYGLTIKPKSFKINVTLRESMEL

LDSAVQKLQVEKECQ*

 

CYP1A8P ortholog (probably a new subfamily since it is functional in opossum)

68% to 1A8P 40% to CYP1A1

BM25891 SCAFFOLD 61062 SCAFFOLD 129297 SCAFFOLD 245449

AAFC02085398.1

AAFC02099349.1

MIFGMAVTSGEVTTSRIILVMVFVFVRELGNKGRKEVFPPGPWSLPIVENLLQLG

DHLYFTFMEMRKKYGDVFLIKLGMVPVLVVNGMEMVKEVLLRNGEHFAA*PNV

LTFSFLAQ*KSLTFS

NYGENWTLHKKIASNALRTFPKAETKSSTRSCLLEKHVIEEVSELVKV

FTELTSRSGSFEPRGAITCAMANVV

CTLCFGKRYDHSDEEFLRIVKTDHDLLKASSAANPADFIPYF*YLPLRIINAPQEFYHARNQ

FIALHIRDHLTT

CPQDHIQDITDALINACHNKYAVAKITILNDDEIISTVSDLVGAG

FEIISTCIYWSFLYLIYYPEIQVKIQEEI

DGNTGMKSPRFENRKILP

YTEAFINEIFRHTSFLPFTIPHC (2)

TTADTTLNGYFIPRKTCTFINMYQVNHDE (2)

TIWDNPNLLRPERFLNENRELNKNLIEKIFIFGMGIQKCL

REEVAQNEVFVFITTVLQQLTLKKCPVVKLDLTPTYGLVMKPKPYQLPAEPRSMGSSCS*

 

CYP2 Family

 

CYP2A13

90% to 2A13 86% to 2A7

CB434432.1 CB463229 TC193989

MLASGLLLVALLACLTIMVLMSVWRQRNLKGKLPPGPTPLPFIGNYLQLNTEQMCNSLMK

ISEHYGPVFTV

HLGTRQIVVLCGYDAVKEALVDQAEEFSGRGKQATFDWLFKGYGVAFSNGERAKQLRRFS

ITTLRDFGVGKRGIEERIQEEAGFLIEAFRGTRS

AFIDPTFFLSRTVSNVISSIVFGDRFDYEDKEFLSLLRMMLGSFQFTATSTGQ

1 LYEMFYSVMKYLPGPQQQAFKELQGLEDFIAKKVEQNQRTLDPNSPRDFIDSFLIRMQEEKENPNTEFYRK 177

178 NLVMTTLNLFFAGTETVSTTMRDGFLLLMKHPDVEAKIHEEIDRVIGKNRQPKFEDRAKM 357

358 PYTEAVIHEIQRFGDMIPMGLARRVTKDTKFRDFLLPKGTEVFPMLGSVLRDPKFFSNPR 537

538 DFNPQHFLDEKGQFKKSDAFVPFSIGKRYCFGESLARMELFLFFTTIMQNFRFKSPQS 711

712 PQDINVSPKLVGFATIPPNYTMSFLPR*

 

CYP2B6

76% to 2B6 CN790156.1 CN787808.1 CB222090 TC193614

40 MELSMLLLFALLTGLLVLLARGRPKAHGRLPPGPRPLPFLGNLLQMDRKGLLKSFLR

FQQKYGDVFTVYLGPRPVVIICGTEAIREALVDQAEVFSGRAKIAVVDPIFQGY

GVIFANGERWKALRRFSLATMRDFGMGKRSVEERIQDEAQCLVEELRKSQ

GALQDPVFYFHSITANIICSIVFGKRFDYRDPEFLRLLELLFQSFVLISSLSSQ

LFELYSSFLKYFPGSHRQIYKNLQEINVFIGRSVEQHRETLDPNAPRDFIDCYLLRMEKDKSNPQSQFDHQN

LIMSVLSLFFAGTETTSTTLRYGFLLMLKYPHITERIQKEIDQVIGSYR

PALDDRAQMPYTDAVIHEIQRFADLIPIGVPHMVTKDTHFRGYILPK

GTEVYPVLSSALHESCYFEKPDDFNPDHFLDANGVVKKNDAFMPFSI

GKRICLGEGIARIELFLFFTTILQNFSVASPVAPEDIDLTPQESGVGNVPPNYRIQFLPRQRG*

 

CYP2B pseudogene fragment

SCAFFOLD 393933 stop codon same as in human 2B7

PALDDRAQMPYTDTVIHEIQRFADLISIGVSHMDAKDAHF*GYILPK

 

CYP2C85 missing exons 8,9

76% to 2C18 CB422177 AAFC02018840.1 TC198839 SCAFFOLD251527 SCAFFOLD131911

SCAFFOLD45421

Last two exons from XM_612374, see controversy below about exon 6

MDLPVVLVLCLCCLLLISLWKQSSGKGKLPPGPTPLPILGNILQLDVKDISKSVSN

LSKVYGPVFTLYFGMNPLVVLHGYEAVKEALIGLGEEFSGRGSCPVIQRASKGY

GVIFSNGKIWKETRRFSLMTLRDFGMGKRSMEDRVQQEACCLVEELRKTD

GLPCDPTFILGCAPCNVICSIIFQNHFDYKDQIFLDLMERLNENARILGSPWIQ

LCSSFPALIDYVPGKHKKFFENYACMKSYVLEKTREHQASLDMNNPRDFIDCFLTKMEQ

EKHNQELEYTVENLAHTVLDLFVAGTETTSTTLRYGLLLLLKHPEVT

AKVQEEIDHVIGRHRSPCMQDKSHMPYTDAVVHEIQRYIDLVPTNLPHAVTCDIKFRNYLIPK

GTGILTSLTSVLYDDKEFPNPEVFDPGHFLDESGNFRKSDHFMAFST

GKRICVGEGLARMELFLFLTTILQNFTLKSVVDPKDLDTTPVVNGLLSVPPFYQLCFIPV*

 

duplicate sequence on same contig AAFC02018840.1

EST CB422177.1 matches yellow region

PCDPTFILGCAPCNVICSIIFQNHFDYKDQIFLDLMERLNENARILGSPWIQ

LCSSFPALIDYVPGKHKKFFENYACMKSYVLEKTREHQASLDMNNPRDFIDCFLTKMEQ

EKHNQELEYTVENLAHTVLDLFVAGTETTSTTLRYGLLLLLKHPE

TAKVQEEIDHVIGRHQSPCMQDKSHMPYTDAVVHEIQRYIDLVPTNLPHAVTCDIKFRNYLIPK

 

CYP2C fragment C-term exons 8,9

CB421823 AAFC02018839 (exons 8,9) 88% to 2C87 81% to 2C18

Matches XM_612374 predicted from NW_616319

Same as 2C85 except yellow area in exon 6

EST CK949679.1 matches this yellow region, this EST is from 2C87

There might be an assembly error joining an exon from 2C87 into the 2C85 gene

Or there might be two exon 6 sequences in one gene.

SCAFFOLD54892 matches exon 6

SCAFFOLD131911 matches exon 5

LPCDPTFILGCAPCNVICSIIFQNHFDYKDQIFLDLMERLNENARILGSPWIQ

LCSSFPALIDYVPGKHKKFFENYACMKSYVLEKTREHQASLDMNNPRDFIDCFLTKMEQ

EKHNHQSEYTFENLTITVSDLFGAGTETTSTTLRYGLLLLLKHPEVT

AKVQEEIDHVIGRHRSPCMQDKSHMPYTDAVVHEIQRYIDLVPTNLPHAVTCDIKFRNYLIPK

GTGILTSLTSVLYDDKEFPNPEVFDPGHFLDESGNFRKSDHFMAFST

GKRICVGEGLARMELFLFLTTILQNFTLKSVVDPKDLDTTPVVNGLLSVPPFYQLCFIPV*

 

CYP2C86

58% to 2C19 CB428210 AAFC02035479.1 complete

MERLEITTLALVICVTCLVFLFVWKKSHKGLGKLPPGPTPLPIIGNLMQLNLKDIPASLSK

LAKQYGPVYTLHLGSQTTVVLHGYEVVKEALIDQGDEFLGRAHFPIIDDTQRGY

GLIFSNGDTWKQMRRFSSLMTLRDFGMGKRSLEERIQEEAQFLVEEFRKSE

AQPFNPAVTLSCATCNIICSILFNERFHYQDKTLHSLLDLLNENFNRISSLWNQ

IYNLWPKLIKPLPGEHRAFSKRLKDVHYFVLEKVKEHQKSLNHNNPRDYIDCFLSRMEQ

EKQNPESQFHLENLATCGSNLFSAGVETTTATLSYGFLLLMKYPEVQ

AKVHEEIDRVIGRTRSPCMKDKMKLPYTEAVLHEIQRYVTLVPSNLPHAVVQDTKFRQYVIPK

GTTVLPLLSSILYDCKEFPNPEKFDPGHFLDKNGSFRKTKYFVAFSI

GKRACVGEGLAQMELFLFFTTILQNFVLKPLGETKDIETKPIVIGLINMPPPFKLCLIPR*

 

CYP2C87

78% TO 2C19

CN791444 CK951564 SCAFFOLD 53069 CN791418.1 CK949679.1 CB441113

MDLAVVLVLCLSCLLLLSLWKQSSGKGKLPPGPTPLPILGNIFQLDVKNISKSLTS

LSKVYGPVFTVYFGMKPTVVLHGYEAVKEALIDLGEEFSRRGSFPVIERNVKGH

GIVFSNGKTWKETRRFSLMTLRNFGMGKRSIEDRVQEEARCLVEELRKTN

GLPCDPTFILGCAPCNVICSIIFQNRFDYKDQTFLNLMKTINENIKILGSPWIQ

VLNIFPVLLDFFPWSYSYKKLYTNTAYVKNYVLEKTREHQASLDINNPRDFIDCFLIKMEQ EKHNHQSEYTFENLTITVSDLFGAGTETTSTTLRYGLLLLLKHPEVT

AKIQEEIDRVIGRHRSPCMQDRTHMPYMDAVLHEIQRYIDLAPTSVPHAVNCDVKFRNYLIPK

GTDILTSLTSVLHDDKEFPNPEVFDPGHFLDENGNFRKSDYFMAFSAGKRVCVGEGLA

RMELFLFLTTILQTFTLKSVVDPKDLDTTPAVTGIANVPPPYQLCFIPV*

 

CYP2C87-de2b fragment exon 2,

6kb downstream of 2C87 without an intervening exon 1, same orientation

LSKVCGPVFTVYFGMKPTVVLHGYEALQEALIDLGEEFSGRYSFPVNEKTRRGH

 

CYP2C fragment exons 7,8,9

BZ845111 SCAFFOLD133344 XM_612392 predicted from NW_616354 81% to 2C89

AAFC02088219

Note exon 7 = same seq as 2C87 but exons 8,9 differ

Possible alternative splice variant of 2C87.  EST CK949679.1 supports

Exon 7 joining with different exons 8 and 9 than these

So these exons may be skipped over. 

Gene order might be 2C87 exons 1-7, then these exons 8,9

Follwed by CYP2C87-de2b and the exons 8,9 of 2C87 above

AKIQEEIDRVIGRHRSPCMQDRTHMPYMDAVLHEIQRYIDLAPTSVPHAVNCDVKFRNYLIPK

GTTILTSLTSVLHDGKEFPNPEQFDPGHFLDKSGNFKKSDYFMVFSA

GKRVCVGEGLARMELFLLLVSILQKFTLKPLVDPKNIDTAPLLKGVGSIPHFYEVCFIPV*

 

CYP2C88

78% TO 2C19

CN786649 CK956312 CN787335.1 CN786468.1 CC503360

MDLAVVLVLCLSCLLLLSLWKQSSGKGKLPPGPTPLPILGNILQLDVKNISKSLTN

LSKVYGPVFTVYFGMKPIVVLHGYEAVKEALIDLGEEFSGRGMFPLAERANIVN

GILFSNGKTWKEIRRFSLMTLRNFGMGKRSIEDRVQEEACCLVEELRKTN

GLPCDPTFILGCAPCNVICSIIFQNRFDYKDPVFLDLMERLNEILRILSSPWVQ

VCNNFPALFDYLPGSHNKVLKNVANLKSFVLEKAMEHKASLDINNPRDYIDCFLIRMEQ

EKQNQQLEFTLENLTTTVFDLFGAGTETMSTTLRYGLLLLLKHPEVT

AKVQEEIDRVIGRHRSPCMQDRSHMPYTDAVVHEIQRYIDLVPSSLPHMVTHDIELRNYIIPK

GTGVLVSLTSVLYDDKVFPNPEMFDPGHFLDDSGNFKKSDHFMPFSA

GKRICAGESLARMEVFLFLTVILQKFTLKSVVDPKDIDTTPIANGFASVPPPYKLCFIPL

 

CYP2C89 missing exon 1 and part of exon 2

69% to 2C18 TC211258 CB531366 AY265992

     XXXXX

   3 XXXXXGPVFTLYFGMKPTVVLHGYEAVKQVLIDQSEEFSGRGSLPVADNINQGL

     GIVFSNGEIWKQTRRFSLMVLRNMGMGKRTIEHRIQEEALCLVEALKKTN

     GSPCDPTLLLSCAPCNVICSIIFRNRFEYNDERLLTLIKYFNENSRLVSTPWVE

     LYNTFPSLLHYFPGSHNTIFKNMTEQRKFILEEIKKHQESLDLNNPQDFIDYFLIKMEK

     EKHNKHSEFTMDNLITTVWDVFSAGTETTSLTLRYGLLLLLKHPEVT

     AKVQEEIDRVVGRNRSPCMQDKSCMPYTDAVLHEIQRYIDLVPSSMPHAATQDVKFREYLIPK

     GTVILTSLTSVLHDDNEFSNPGQFDPGHFLDESGNFKKTDHFMAFSA

     GKRVCVGEGLARMELFLLLVSILQHFTLKSVVDPKHIDTAPSFKGLISIPPFCEMCFIPV* 1292

 

CYP2C90 partial seq.  missing exon 1

CB222086 82% to 2C18 77% to 2C87 SCAFFOLD35528 AAFC02073759

DR713168

LSNTYGPVFTVYFGLRPTVVLHGYEAVKEALIDQGEEFSGRGNIPMSQRVNKGY

GIIFSNGKRWKEIRRFSLMTLRNFGMGKRSIEDRVQEEAHCLVEELRKTN

GSPCDPTFILGCAPCNVICSIIFQNRFDYTDQNFLNLLDKFNENLQVVSSPWMQ

VCNTFPILIDYFPGSHNKLFKNFAYIRSYVLEKVKEHQATLDINNPRDFIDCFLIKMEQ

EKHNQEMEFTFENLIASVSDLFGAGTETTSTTLRYGLLMLLKHPEVT

AKVQEEIDRVIGRHRSPCMQDRSHMPYMDAVVHEIQRYIDLVPTNLPHAVTRDIKFRNYLIPK

GTTVVTSLSSVLHDEKEFPNPKVFDPAHFLDESGNFKKSDYFMAFSA

GKRSCVGEGLARMELFLFLTTILQKFTLKSVVDPKDLDTTPVSSGFGHVPPPYQLCFTPL*

 

CYP2C exon 1

SCAFFOLD291268

MDLFVVLVICLSILIFLFLWNQRHAKGKLPPGPTPLPIVGNILQINIKNVSKSISK

 

CYP2C partial pseudogene sequence with frameshift and stop codon exons 1-3

SCAFFOLD113483

MDLAVVLVLYFSCLLLLSLWKQSSGKGKLLPGPTPLPILGNTLQLDVNDISKSLSD

LSEVCGPVFTVYFGMKPTVVLYGYEAVKETLI

DLGEEFSGRGSLPLPERISKGH

GILFSSGKRWKETRRFSLVTLRNFRMGE*SVEDRVQEEARCLVEELRKTNG

 

CYP2C pseudogene fragments

SCAFFOLD51208

EKHNQELEFTVENLMITVSDLFGAGTEMTSTTLRYGLLLLLKHPEVT

AKFQEEIDHMIGRDQSPCMQDRSHMPYVDAVVHEIQRYIDL (frameshift)

IPTNLPLAVTHDVKFRNYLIPK

Gap missing exon 8

XXXXXXXEGLTCVELFLFLTTILQ (frameshift)

KFTLKSVVDPKDIDTTPIVN

 

CYP2C fragments exons 5-7, two copies, 94% identical

BZ878104 78% to 2C18 87% to 2C89 SCAFFOLD40567 no ESTs

AAFC02042798.1

LYNAFPSLLHYLPGSHNTLFKNMTEQRKFILEKIKEHQESLDLNNPQDFIDYFLIKMEK

EKHNKQSEFTMDNLITTVWDVFTAGIETTSISLKYGLLLLLKHPEVT

AKVQEEINRVVGRNRSPCMQDRSRMPYTDAVIHEIQRYIDIVPNNLPHAAAQDIKFREYLIPK

 

Second partial gene on same contig AAFC02042798.1

Note: exons 8,9 not found between these.

LYNAFPSLLHYLPGSHNTLFKNMTEQRKFILEKIKEHQESLDLNNPQDFIDYFLIKMEK

EKHNKQSEFTMDNLITTIWDVLTAGIETTSLTLRYGLLLLLKHPEVT

AKVQEEINHVVGRNRSPCMEDRSRMPYTDAVIHEIQRFIDLVPNNLPHAAAQDIKFREYLIPK

 

CYP2D14

TC205271 78% to 2D6

MGLLSGDTLGPLAVALLIFLLLLDLMHRRSRWAPRYPPGPTPLPVLGNLLQVDFEDPRPSFNQ

LRRRFGNVFSLQQVWTPVVVLNGLAAVREALVYRSQDTADRPPPAVYEHLGYGPRAEG

VILARYGDAWREQRRFSLTTLRNFGLGKKSLEQWVTEEASCLCAAFADQA

GRPFSPMDLLNKAVSNVIASLTFGCRFEYNDPRIIKLLDLTEDGLKEEFNLVRKV

VEAVPVLLSIPGLAARVFPAQKAFMALIDELIAEQKMTRDPTQPPRHLTDAFLDEVKE

AKGNPESSFNDENLRLVVADLFSAGMVTTSTTLAWALLLMILHPDVQR

RVQQEIDEVIGQVRRPEMGDQALMPFTVAVVHEVQRFADIVPLGLPHMTSRDIEVQGFHIPK

GTTLITNLSSVLKDETVWEKPFRFHPEHFLDAQGRFVKQEAFIPFSA

GRRACLGEPLARMELFLFFTSLLQHFSFSVPAGQPRPSEHGVFAFLVTPAPYQLCAVPR*

 

CYP2D43

SCAFFOLD 102164

5681 MGLLSGDTLGPLAVALLIFLLLLDLMHRRSRWAPRYPPGPMPLPVLGNLLQVDFEDPRPSFNQ

     LRRRFGNVFSLQQVWTPVVVLNGLAAVREALVYRSQDTADRPPQALYKHLGFGPRAEG 6760

7291 VILARYGNAWREQRRFSLSTLRNFGLGKKSLEQWVTEEASCLCAAFADQA 7449

7550 GHPFSPMDLLNKAVSNVIASLTFGCRFEYNDPRIVKLLDVMEDGLKEEMKIMRQV 7714

8109 VEAVPVLLSIPGLAAKVVPGQKAFMTLVDELIAEQKMTRDPTQPPRHLTDAFLDEVKE 8288

     AKGNPESSFSDENLRLVVADLFSAGMVTTSTTLAWALLLMILHPDVQR 8591

8806 RVQQEIDEVIGQVRRPEMGDQALMPFTVAVVHEVQRFADIVPLGLPHMTSRDIEVQGFHIPK 8985

9424 GTTLITNLSSVLKDETVWEKPFRFHPEHFLDAQGRFVKQEAFIPFSA 9603

     GRRACLGEPLARMELFLFFTSLLQHFSFSVPAGQPRPSDHGVFVALVTPAPYQLCAVPR 9843

 

CYP2E1

AJ001715 TC189658 79% to human 2E1

MAALGITVALLVWMATLLFISIWKHIYSSWKLPPGPFPLPIIGNLLQLDIKNIPKSFTR

LAERYGPVFTLYLGSQRAVVVHGYKPVKEVLLDYKNEFSGRGENPGFQMHKNN

GIIFNNGSTWRDTRRFSLTTLRDLGMGKQGNEQRIQREAHFLLEVLRKTQ

GQPFDPTFVVGFAPYNVISDILFHKRFDYKDQTSLRLMSLFNENFYLLSSPWIQ

LYNNFPDYLQYLPGSHRKLLKNVSEVKSYALERVKDHQKSLEPSCPRGFLDTMLIEM

AKERHSVDPMYTLENIAVTVADLLFAGTETTSTTLRYGLLILMKYPEVE

EKLHEEIDRVIGPSRIPAVKDRLDMPYLDAVVHEIQRFIDLLPSNLLHEATQDTVFRGYVIPK

GTVVIPTLDSVLHDRQEFPEPEKFKPEHFLNENGKFKYSDHFKAFSA

GKRVCVGEGLARMELFLLLAAILQHFNLKSLVDPKDIDLSPIAIGFGKIPPRYKLCLIPRSKV*

 

CYP2F1 missing exon 1

AW312559.1 AAFC02178016.1 AAFC02183743.1 AJ459276 CC540963 BAC end

XXXXX

LSKEFGAVYTVYLGPRRVVVLSGYQAVKEALVDQAEEFGGRGDYPVFFNFTKGN

GIAFSNGDRWKVLRKYSVQILRNFGMGKRTIEERILEEGHFLLEELRKTQ

GKPFDPTFVVSRSVSNIICSVIFGSRFDYDDDRPLSIIHLINENFQIMSSPWGE

MYNIFPNLLDWVPGPHRRLFKNYGRIKDIIARSVREHQASLDPNSPRDFIDCFLTRWH

QEKQDPLSHFFMDTLLMTTHNLLFGGTETVGTTLRHAFRLLMKYPEVQ

VRVQEEIDRVVGHERLPTVEDRAAMPYTDAVIHEVQRFADVIPMSLPHRVTRDTNFRGFTIPR

GTDVITLLNTVHYDPSQFLKPKEFNPEHFLDANMSFKKSPAFMPFSA

GRRLCLGEALARMELFLYLTAILQSFSLQPLGAPEDIDLTPLSSGLGNLPRPYQLCVLAR

 

CYP2G1

83% to mouse (88% to human pseudogene)

Database location  : AAFC01082069   3860 to 4039 (+)

Genomic location   : SCAFFOLD219575 3860 to 4039 (+)

Database location  : AAFC01082069   4854 to 5015 (+)

Genomic location   : SCAFFOLD219575 4854 to 5015 (+)

Database location  : AAFC01082069   6748 to 6897 (+)

Genomic location   : SCAFFOLD219575 6748 to 6897 (+)

Database location  : AAFC01082069   8738 to 8899 (+)

Genomic location   : SCAFFOLD219575 8738 to 8899 (+)

Database location  : AAFC01082069   9151 to 9327 (+)

Genomic location   : SCAFFOLD219575 9151 to 9327 (+)

Database location  : AAFC01202105   145 to 300 (-)

Genomic location   : SCAFFOLD199473 145 to 300 (-)

Database location  : AAFC01421374    997 to 1185 (+)

Genomic location   : SCAFFOLD281678 3294 to 3482 (-)

Database location  : AAFC01421374   1314 to 1454 (+)

Genomic location   : SCAFFOLD281678 3025 to 3165 (-)

Database location  : AAFC01603617    586 to  765 (+)

Genomic location   : SCAFFOLD281678 1315 to 1494 (-)

3860 MELGGAFTIFLALCLSCLLILIAWKRMSKGGKLPPGPTPIPFLGNVLQVRTDATFQSFMK(0) 4039

4854 LKEKYGPVFTVYMGPRPVVVLCGHEAVKEALVDRADEFSGRGELASVERNFQGH(1)5015

6748 GVALANGERWRILRRFSLTILRDFGMGKRSIEERIQEEAGFLLVELRKTR(1)6897

8738 GARIEPTFFLSRTVSNVISSVVFGSRFDYEDQQFLKLLQMINQSFIEMSTSWAQ (0) 8899

9151 LYDMYSGIMQYLPGRHNRIYYLIEELKDFIASKVKINEASLDPQNPRDFIDCFLIKMHQ(0) 9327

300  DKNNPHTEFNLKNLVLTTLNLFFAGTETVSSTLRYGLLLMMKHPEVE(1)145

997  AKIHEEIDQVIGPHRIPSVDDRAKMPYTDAVIHEIQRLTDIVPMGVPHNVIRDTHFRGYLLPK(0) 1185

1314 GTDVFPLLGSVLKDPKYFRYPDAFYPQHFLDEQGHFKKNEAFVPFSS(1) 1454

586  GKRICLGEAMARMELFLYFTSILQNFSLRSLVPPADIDITPKVSGFGNIPPTYELCFMVR(1) 765

 

CYP2J26

complete 2JD

CB465835.1 SCAFFOLD215002 CB462842.1 SCAFFOLD278853

SCAFFOLD107814 CB465835.1 SCAFFOLD215002 CB462842.111497

CB532345.1

MLEALGSLVAALWTTLRPGIVLLGAFVFLLFADFLKRQHPKNYPPGPLRLPFIGNFFHLDLGKGILVPQQ

VVKKYGNIIRLDFGVIHFIVITGLPYIKEALVNQEQNFVNRPMIPLQKHIFNNK

GLVRSNGQVWKEQRRFTLTTLRNFGLGRKSLEERIQEEVTYLIQAIGEEN

GQPFDPHFIINNAVSNIICSITFGERFDYKDDQFQELLRLLDEILCIQASVCCQ

LYNAFPRIMNFLPGSHHTLFRKWEKLKMFVANVIENHRKDWNPAEARDFIDAYLQEIEK 11676

HKGNATSSFDDENLICSTLDLFLAGTETTSTTLRWGLLFMALNPEIQ 14705

EKVQAEIDRVLGQSQKVSTASRESMPYTNAVIHEVQRMGNIVPMNVPREVTVDTVLAGYH 15236

LVKGTMVLTNLTALHRDPAEWATPDTFNPEHFLENGQFKKRESFLPFSI

GKRMCLGEQLARTELFIFFTSLLQKFTFRPPENEKLSLKFRESLTSSPASYRLCAIPRA* 25310

 

CYP2J27

SCAFFOLD215002 complete  2JC

BM258720.1 CK950217.1

SCAFFOLD388989

This gene has an extra exon 5

MLEALGSLAAALWAALRPGTVLLGAVVFLFLDDFLKRRRPKNYPPGPPPLPEVGNFFQLDFDKAHLSLQR

FVKKYGNVFSVDFGIFRSVLITGLPLIKEALVHQDQNFANRPLIPIEKRIFNNK 37352

GLIMSNGHVWKEQRRFALTTLRNFGLGKKSLEERIQEEAAYLIQEIGEEN 39667

GQPFDPHFTINNAVSNIICSITFGERFDYQDDQFQELLRLFDEMMHLRTSTCCQ 40221

LYNIFPRIMSFLPGPQHALFSKWEKLKMFIAGVVENHKRDWNPAEARDFIDAYLQEIEK 42145

HKGNATSCFHEENLIYNTLDLFFAGTETTSTTLRWGLLYMALYPEIQ 43949

EKVQAEIDRVLGQSQKPSMAARESMPYTNAVIHEVLRMGNILPLNVPREVTVDTVLAGYRLPK

GTMVTTNLTALHRDPAEWATPDTFNPEHFLENGQFKKRESFLPFSI

GKRMCLGEQLARTELFIFFTSLLQKFTFRPPENEKLSLKFRMSMTLSPLSHRLCAIPRA*

 

CYP2J27-ie5b

SCAFFOLD215002  BM258720.1 CK950217.1

extra exon 5

LSNVFPRIMNFLPGPQHTLFSKWEKLKMFIAGVIENHKRDWNPAEARDFVDAY 41591

 

CYP2J28

complete  2JA

SCAFFOLD256921 CK945989.1 CK945989.1 CO259427.1

SCAFFOLD295402 CO259427.1 SCAFFOLD309869 CO893955 CO259427.1

SCAFFOLD213132 CO259427.1 SCAFFOLD247979 CO893955 CO259427.1

BE664093.1 SCAFFOLD65540 CO893955 CK770461.1

MLEALGSLAAALWAALRPGTVLLGAIVFLLLTDLLNRRRPKNYPPGPPRLPFVGNFFQLDFEQGHLSLQR

FVKKYGNLFSLEFGDLPSVVITGLPLIKEVLVYQDQNFVNRPISPIRERVFKKN

GLIMSNGHIWKEQRRFSLTALRNFGLGRKSLEERIQEEVAYLIQAIGEEK

GQPFNPHFKINNAVSNIICSITFGERFDYQDDQFQELLRLLDEVTYLETTVWCQ

LYNVFPRIMNFLPGPHQMLFSNWRKLKMFVARVIENHKRDWNPAEARDFIDAYLQETEK

HKGNAASSFHEENLIYNTLDLFFAGTETTSTTLRWGLLYMALYPEIQ 716

EKVQAEIDKVLDESQQPSMATRESMPYTNAVIHEVQRMGNILPLNVPREVTVDTVLAGYHLPK

GTMVLTNLTALHRDPAEWATPDTFNPEHFLENGQFKKREAFLPFSI

GKRMCLGEQLARTELFIFFTSLLQKFTFRPPEHEELSLKFRMGLTLSPVSHCLCAVPRA*

 

CYP2J29

complete 2JB

SCAFFOLD25620 DN825584.1 DN825609.1 CN793001.1

SCAFFOLD126970 SCAFFOLD87184 CK957631.1 SCAFFOLD166503

CK834395 CK945908.1 SCAFFOLD166503 CK960783.1 CK957063.1

SCAFFOLD245478 CK945645.1

MLSSLAAALWAALRPGTVLLGAVAFLFFADFLKRRRPKNFPPGPAGLPFVGNSFQLDPEKVHLTLQQ

FVKKYGNVFSLDFGTFPSILITGLPLIKEALVHQGENFSKRPVMPLQERIFNTK

GLIMSSGHIWKEQRRFALTTLRNFGLGKKSLEERIQEEASYLIQMIREEN

GKPFDPHFIINNAVSNIICSITFGERFDYQDSQFRELLRLLDEVLNLHTSLCCQ

LYSVFPRIMNFVPGPHQTLFSNLEKLKMFVAEMIENHKRDWNPAEARDFIDAYLQEIEK 8435

HKGGDASSFREENLIYSTLDLFLAGTETTSTSLRWGLLYMALNPEIQ 5634    

EKVQAEIDRVLGQSQQPSTAARESMPYTNAVIHEVLRMGNIIPLNVPREVAVDTTLAGYHLPK 5455

GTVVVTNLTALHRDPAEWATPDTFNPEHFLENGQFKKRESFLPFSI 2548

GKRMCLGEQLARAELFIFFTSLLQKFTFRPPENEKLSLKFRVSLTLAPISHRLCAVPRG*

 

CYP2J30

complete 2JH

DN825200.1 CK950246.1 SCAFFOLD281439

SCAFFOLD298803 CK940975.1

SCAFFOLD170755 CK945247.1 CN793929.1

MLEALSSLATALWAALRPDTVLLGTLAFLLFVDFLKRRHPKNYPPGPPGLPFVGNLFQLDPEKVPLVLHQ

FVKKYGNVFSLDFGTVPSVLITGLPLIKEVLVHQGQIFSNRPIVPLQEHIINNK

GLIMSSGQLWKEQRRFALTTLRNFGLGKKSLEERIQEEASYLIQTIREEN

GQPFDPHLTINNAVSNIICSITFGERFDYQDDQFQELLRMLDEILNLQTSMCCQ

LYNVFPRIMNFLPGPHQALFSNMEKMKMFVARMIENHKRDWNPAEARDFIDAYLQEIEK

HKGDATSSFQEENLIYNTLDLFLAGTETTSTSLRWGLLFMALNPEIQ

EKVQAEIDRVLGQSQQPSMAARESMPYTNAVIHEVLRMGNIIPLNVPREVAVDTTLAGYHLPK 15084

GTMVMTNLTALHRDPTEWATPDTFNPEHFLENGQFKKRESFLPFSI 12265

GKRMCLGEQLARTELFIFFTSLLQKFTFRPPENEQLSLKFRVSLTLAPVSHRLCAVPRG*

 

CYP2J31P pseudogene

CYP2JF SCAFFOLD295104 no ESTs possible pseudogene (N-term is short)

some frameshifts present,  exons 6-8 missing

MGAAAFLFVVHLKRRRGKNYPPGPPGLPFLGNFFHLDLKQLHLSLQQ

IVKKYGNMISLEMGGFSTVFFKWIAQNQRSPCLPGPKLVNHPIQRIQENIFKKH 5343

GLIMSNGHIWKEQRRSALTTLRNFGLGRKILEECIQEEAAYLIQTVGEEN 8001

XQPFDPHFTINNAVSNIVCSIAFGELFDYQDSXXQELLRLMDEAMYLQTSVRCRV 8538

LYNFFARIMNFLPGPHQTLFIKWEKLNMFIDSVIENHRRDWNPAEPRDFTDA

15856 GMWMCPGEQLARTELFIFFTSLLQKFTFRPPGDEKLSLQFRVSLTISSVSHWLC 16020

 

CYP2R1

scaf.96150, 81935, 91707 CK969495.1

MWEPHSAEAFVAALGGVFFLLLFALGVRQLLKQRRPSGFPPGPSGLPFIGNIYSLAASAELPHVYMKKQSQVYGE (0)

IFSLDLGGISAVVLNGYDVVKECLVHQSEIFADRPCLPLFMKMTKMG (1)

GLLNSRYGRGWVDHRKLAVNSFRCFGYGQKSFESKILEETKFFIDAVETYNGSPFDLKQLV

TNAVSNITNLVIFGERFTYEDTDFQHMIELFSENVELAASATVFLYNAFPWIGILPFGKH

QQLFRNAAVVYDFLSRLIEKASINRKPQLPQHFVDAYLDEMERSKNDPSSTFSKENLIFS

VGELIIAGTETTTNVLRWAVLFMALYPNIQ (1)

GQVQKEIDLIIGPSGKPSWDEKCKMPYTEAVLHEVLRFCNIVPLGIFHATSEDAVVRGYSI

PKGTTVITNLYSVHFDEKYWRDPEIFYPERFLDSSGHFAKKEALIPFSL (1)

GRRHCLGEQLARMEMFLFFTALLQRFHLHFPHELVPNLKPRLGMTLQPQPYLICAERR*

 

CYP2S1

AAFC02159945.1

MEAAGTWALLLLLLLLVVTLVLPATWDRGHLPPGPTPLPLLGNLLQLRPGALYLGLLR

LSKKYGPVFTVYLGPWRRVVVLVGHEAVQEALGGQAEEFSGRGTVATLDGTFDSH

GVFFSNGERWRQLRKFTTLALRDLGMGKREGEELIQAEARCLVEALQGTK

GRPFDPSLLLAQATCNIICSLVFDLRLPYDNEEFQAVVRAAGGIAVGVSSPWGQ

TYEMFSRFLQRLPGPHTQLLRHLGTVAAFAAQQVWQHKGSLGTSGPVRDLVDAFLLKMAK

EKQDPNTEFTAKNLLMTVVYLLFAGTVTVSTTIRYTLLLLLKYPQVQ

ERVQEELMRELGAGQRPSLGDRARLPYTDAVLHEAQRLLALVPMGIPRALTKTTRFRGYTLPQ

GTEVFPLLGSILHDPAVFEEPKEFNPGRFLDADGKFKKHEAFLPFSL

GKRVCLGEGLARTELFLLFTAILQAFSLEGPCPLGALSLQPAISGLFNIPQAFQLQFRPR*

 

CYP2T2P ortholog

AAFC02046938 65% to rat 2t4 66% to 2T2P human

MMISGIIALSLLVLLLAPARWGWGARSTQRQGALPPRATPLRLLGSLLQLRIWRPGPCTHG

LSGRCGPVFTVCLGQCPVVVLCRYAALRDALVLQADAFSGRGAMAVFKRFTRGN

GIAFSKGPRWPTLRNFALGALKEFGLGTQTIEERVLEEAACLLGDFQATGG

GAPFDPQRLLDNAVSNVICSVVLGNHYGYEDMEFLRLLDLFNDNFRIMSSRWGE

XXXXXSLLDWLPGLHH*IFRNFAXLRVFISQQIQLHQQTR*SGKPHDFIDXXXXXXX

GTENPESHFQAETLAMTMHNLFFGXXETTSTTLRYGLILLKYSFVA

AKVQAELDDMVGRMCAPTLEDREHLPYTNTVLHEIQCFISVVPFGLPSALTCDTHLRGYFLPK

GTFVIPLLVSTHWVPTQFKNPECFNPTNFLNDQGEFQSNAFTPFAL GTCLGAGLAPTDIFLFLTSILLRFFLLPVGSHSDTDLTPQCTGLGNVPPAFQLRLVAR*

 

CYP2U1

XM_615255 AAFC02193700 gnl|ti|665258407

MASPGLPQPPTEDAAWPLRLLHAPPGLLRLDPTGGALLLLVLAALLGWSW

LWRLPERGIPPGPAPWPVVGNFGFVLLPRFLRRKSWPYRRARNGGMNASGQGVQLLLADL

GRVYGNIFSFFIGHYLVVVLNDFHSVREALVQQAEVFSDRPRVPLTSIMTKGKGIVFAHY

GPVWRQQRKFSHSTLRHFGLGKLSLEPKIIEEFRYVKEEMQKHGDAPFNPFPIVNNAVSN

IICSLCFGRRFDYTNSEFKQMLTFMSRALEVCLNTQLLLVNICSWLYNLPFGPFKELRQI

EKDLTLFLKKIIKDHRESLDVENPQDFIDMYLLHVEEEKKNNSNSGFDEDYLFYIIGDLF

IAGTDTTTNSLLWCLLYMSLHPNIQEKIHEEIARVIGADRAPSLTDKAQMPYTEATIMEV

QRLSTVVPLSIPHMTSEKT

VLQGFTIPKGTIILPNLWSVHRDPAIWE

KPNDFYPDRFLDDQGQLIKKETFIPFGI

GKRVCMGEQLAKMELFLMFVSLMQSFTFVLPKDSKPILTGKYGLTLAPHPFNIIISKR

 

CYP2W1 partial sequence

scaf.279079,  426471 CB440236.1 AAFC02007621.1

XXXXX

LGKQYGPVFTVHLGHQKTVVLTGYEAVKEALVGTGQELAGRPPIAIFQLINGGG (1)

GVFFSSGPRWRAARQLTVRALHGLGVGRAPVANKVLQELRCLTAQLDSYE (1)

GRPFPLALLRWAPSNITFTLLFGQRFDYRDPVFLSLLGLVDEVMVLLGKPSVQ (0)

LFNLYPRLVALLQLHRPVLRKIEEVRAILRALLEARRHRTPPRGPQQSYLDALIQQGQ (0)

XXXXX

XXXXXXXXXXXXXXXXPRPEDVHALPYTNAVLHEVQRFITLLPHAPRCTVANTQLGPYLLPK

GTPVLALLNSVLLDETQWKTPRQFNPGHFLDANGRFVKRPAFLPFSA

XXXXX

CYP2AB1P pseudogene

scaf.15709 BI535102

MCPLLIWLGLLAASFLLLKFSIIYWERNHLPPDPFPFPILGNPWQLSFQLHPATLLQ

LAQTHGHVFTVWVGPTPVVVLCSFQA

KEALVSHSEQLSGWPLTPLFQDLAGERG

GVICSSGRTRRQ*RRFCLAALQGLG*GPLALELRLQEEAAGLVEAFHWEQ

GGPFDPQAPIVRSTARVTGALVFGRHFLSEDPFFQELI*ATNFGLAFXXXXXX

QLNDLFPWAFRCLPGPYREMFRYQKAVRGYIHREIMRHKLRTSEAPKDFISCYLAQIIK

ATDDPVSTFNEENLIQVVVGLFLGGTDTTGTTLYWVLIYMIQYGAIQS

ERVQQELVTVLGTSGAICYKDHEQLPHICTLLHEAQRLSSVA*V

AVCQCVTSTHVHGHPVPK

GTIILPNLAAVLCDPECWRTSRQFNPGHFLDKDGNFVVRDIFPPFSA

GHQMCLGD*LAQMKLLLMFATLLGTFSFQLPGRSPGLRLEYNFGGTRKPLPQKIYAVSRLNCPHPGPREEVL*

 

CYP2AC1P pseudogene

67% to rat 2AC1

SCAFFOLD151603, SCAFFOLD251118 AAFC02071354 pseudogene

BZ940825 CC910297

MSGFESSFILPILSLILIFILNIKIVMTKASKQHFPPVPRPLPIIGNLHILNLKRPYQTMLE (0)

LSQKYGSIYSIQIGPRKVAVLxGYETVKDVLVNHTDQFGEWFHVPISERLFEGK

GIFFSHSDTSKIIRFTLTTSQNFGMGKKALEDTIIGESQHLIRNFETDKG

GKPFEVKTLTNASVANINVSVLLGKGFDYQNTPFLRLLTLIDQSVKLIVSPPTA

LFNMFPVLRFLLKTYKNILRNKDELFSFIRMTFLHHHHKLDKNDPRSLTDAFLVRQQE

DTSTDYFNDDTLVVLVNNLFAAGTESMVSTLCWGILFMSRYPEIQS

KVHDEIAKVMGSTQP*MAH*TQMPYTDAVILEVQRFADILPTGLPRATTTNTIFKNNYIPK

GTEVIFLLTSVL*DQTQWENPATFNPEHFLDSIEKFIKKEAFISFSV (1)

SPL*CAGESLAKMELLLFFMSLLQKFTFQPPPGVSHLDLDPTRDTGVVIQPMPHKIRALPRA

 

CYP3 Family

 

CYP3A28

TC211564 Y10214 73% to 3A5 SCAFFOLD259874

MELIPSFSMETWVLLATSLVLLYI (2)

YGTYSYGLFKKLGIPGPRPVPYFGSTMAYHK (0)

GIPEFDNQCFKKYGKMWG (2)

FYEGRQPMLAITDPDIIKTVLVKECYSVFTNRR (0)

IFGPMGIMKYAISLAWDEQWKRIRTLLSPAFTSGKLKE (0)

MFPIIGQYGDMLVRNLRKEAEKGNPVNMKD (2)

MFGAYSMDVITGTAFGVNIDSLNNPHDPFVEHSKNLLRFRPFDPFILSI (1)

ILFPFLNPVFEILNITLFPKSTVDFFTKSVKKIKESRLTDKQM (0)

NRVDLLQLMINSQNSKEIDNHK (1)

ALSDIELVAQSTIFIFGGYETTSSTLSFIIYELTTHPHVQQKLQEEIDATFPNK (0)

APPTYDALVQMEYLDMVVNETLRMFPIAGRLERVCKKDVEIHGVTIPKGTTVLVPLFVLHNNPELWPEPEEFRPER (2)

FSKNNKDSINPYVYLPFGTGPRNCLGMRFAIMNIKLALVRILQNFSFKPCKETQ (0)

IPLKLYTQGLTQPEQPVILKVVPRGLGPQVEPDFL*

 

CYP3A74

TC219554 76% to 3A4 Length = 563 N-term CK957598.1

MELILSFSTETWVLLATGLVLLYL (2)

YGTYSYGLFKKLGVPGPRPLPYFGNILSYRK (0)

GVCEFNEECFKKYGKIWG (2)

IFEGKQPLLVITDPDMIKTVLVKECYSVFTNRR (0)

VFGPSGVMKNAISVAEDEQWKRIRTLLSPTFTSGKLKE (0)

MFPIIGKYGDVLVRNLRKEAEKGTSVDIKD (2)

IFGAYSMDVITSTSFGVNIDSLGNPQDPFVENVKKLLRFSILDPFLLAV(1)

VLFPFLVPILDVLNITIFPKSAVNFFTKSVKRIKESRLKDNQK(0)

PRVDFLQLMINSQNSKETDNHK  (1)

ALSDQELIAQSIIFIFAGYETTSSTLSFLLYILATHPDVQQKLQEEIDATFPNK (0)

APPTYDVLAQMEYLDMVVNETLRMFPIAIRLERLCKKDVEIHGVSIPKGTTVMVPISVLHKDPQLWPEPEEFRPER (2)

FSKKNKDSINPYVYLPFGTGPRNCIGMRFAIMNMKLAIVRVLQNFSFKPCKETQ (0)

IPLKISSQGVLRPEKPVVLKVVLRDGTISGA*

 

CYP3A75 missing exons 3-6

SCAFFOLD20102 DN530970.1 CK954349.1 DN532344.1 AAFC02132381 SCAFFOLD237847

MELIPSFSMETWVLLSISLVLLYL

YGTYSHGLFKKLGVPGPRPLPYFGNVLSYRK

 

VFGAYSMDVITSTSFGVNIDSLGNPQDPFVENAKKLLRFDILNPFLLSV

VLFPFLVPIFEVLNITMFPKSAVNFLAKSVKRIKESRLKDNQK

PRVDFLQLMINSQNSKETDNHK

ALSDQELMAQSVIFIFAGYETTSNTLSFLLYILATHPDVQQKLQEEIDVTFPNK

APPTYDVLAQMEYLDMVVNETLRMFPITVRLDRLCKKDVKIHGVSIPKGTTVTVPISVLHRDPQLWPEPEEFRPER

FSKKNKDTISPYVYLPFGTGPRNCIGMRFAIMNMKLAVVRVLQNFSFKSCKETQ

IPLKINSQGLIRPEKPIFLKVVLRDETISGA*

CYP3A76

TC192141 75% to 3A4 SCAFFOLD300381 AM015519.1 AAFC02005660.1 DR712863.1

MELIPNFSVETWVLLAISLVLLYL(2)

YGTYSHGLFKKLGVPGPRPLPLFGNVLSYRK (0)

GVCEFDEECFKKYGKMWG (2)

IFEGKHPLLVITDPDMIKTVLVKECYSVFTNRR (0)

VFGPMGVMKNAVSVAEDEQWKRIRTLLSPTITSGKLKE (0)

MFPIIGKYGDVLVRNLRKEAEKGTSVDMKE (2)

VFGAYSMDVITSTSFGVNIDSLGNPQDPFVENAKKLLRFDILDPFLLSV (1)

VLFPFLIPIFEVLNISIFPKSAVNFLTTSVKKIKESRLKDTQK (0)

PRVDFLQLMINSQNSKETDNHK (1)

ALSDQELMAQSIIFIFGGYETTSTSLSFIIYELATHPDVQQKLQEEIDATFPNK (0)

APPTYDVLAQMEYLDMVVNETLRMFPIAVRLERFCKKDVEIHGVSIPKGTTVTVPISVLHRDPQLWPELEEFRLER (2)

FSKKNKDSISPYVYLPFGTGPRNCIGMRFAIMNMKLAVVRVLQNFSFKPCKETQ (0)

IPLKIKSQGLLRPEKPIFLKVVLRDETISGA*

 

CYP3A76-ie6b

extra exon 6 SCAFFOLD300381 other exon 6 at 7654

9898 MFPIIGKYGDVLVRNLRKEAEKGKSVNMKE

 

CYP4 family

 

CYP4A40

Scaffold 211417 AAFC02026180 AAFC02091356.1 CB417720.1 BE750809.1 CYP4Ab

MSVSALSPSRALGGVSGLLQVVSLLGLVLLLIKAAQLYLRRQWLLKALHHFPSPPSHWFCGHKWE

FQEEGELPHLLKRVEKYPRACVRWMWGTRALLLVYDPDYMKMVLGRS

DPKAQIIHRFVKPWI

GTGLLLLEGQTWFQHRRMLTPAFHYDILKPYVGIMADSVRVML

DKWEELVSQDSHLEIFGHVSLMTLDTIMKCAFSQQGSVQTDR 369

SSQSYIQAIKDVSHLIISRLRNAFHQNDLIYRLTPEGHWNHRACQLAHQHT 895

DAVIKERKVRLQKEGELEKVRSRRHLDFLDILLFAR 1445

MENGSSLSDEDLRAEVDTFMFEGHDTTASGISWILYALASHPEHQQRCREEIQS 1690

LLADGASITW

DHLDQMPYTTMCIKEAMRLYPPVPVISRELSKPITFPDGHSLPA 1952

GILVSLSIYGLHHNPKVWPNPE

VFDPTRFAPGSTRHSHAFLPFSGGSR 3927

NCIGKQFAMNELKVAVALTLLRFELSPDSSRVPVPMPVIVLRSKNGIHLQLRKLSDPGGDK

 

CYP4A41

77% to 4A11 XM_594222

MSVSVLSPTRALGGVSGLLQVVSLLGLVLLLLKAAQLYLRRQWLLKALHQFPSPPSHWFYGHKKE

FQKESELPPLLERVEKYPKACVRWLWGTKALVLVYDPDYMKVFLGRS

DPKPHRTYKYLAPWI

GTGLLLLEGQKWFQHRRMLTPAFRYDILKAYVGIMADSVRVML

DKWEELVSQDSHLEIFGHVSLMTLDTIMKCAFSHQGSVQMDR

SSQSYIQAIRDLSHLIVSRLRNAFHQNDLIYRLTPEGRWNHRACQLTHQHT

DAVIKERKAHLQKEGELEKVRSRRHLDFLDILLLAR

MENGSSLSDEDLRAEVDTFMFEGHDTTASGISWILYALASHPEHQQRCREEIQSLLGDGASITW

DHLDQMRYTTMCIKEAMRL

YPPVPFIGRELRKPITFPDGRSLPA

GILVSLSFYGLHHNPNVWPNPE

VFDPTRFSPGSTQHSYAFLPFSGGSR

NCIGKQFAMNELKVAVALTLLRFELSPDPSRVPVPTPIMVLRSKNGIHLQLRKLSDPGGDKDKL

 

CYP4A42P pseudogene N-terminal

XM_606400 SCAFFOLD236059 72% to 4A40

KEKELEELLKRVEEFPYAYPCWMWG

DNVHLVVYDPDYMKVVLRRS

NPKSEDIYRFLAPWI

GLLLLEEQTWFQHRQMLTPAFHYDILKPYLGLMADSVQGML

DKWEELVSQDSHLEILGDISLMTLDTIMKCVFSH*GSIRNDR

XXXXXXXXXXXXXXXXXXXLRNVFYQNDLIYRLTPEGQWSY

XXXXXXXXX

TDAVIEERKTCLRKEGELEKVRSRRHLDFLDILLFSRV

NGSSLSDEDLRAEVDTFIFGGHDTTASGISWILYALASHPEHQQRCREEIQSLLGDGASITW

 

CYP4A pseudogene C-terminal

Scaf. 42211 pseudogene  

GNGTFLLEGQMWFQYWGMLTSAFHHDILRPSMGHMAYSVQLM

TDSIIKGRKSHLHKEGELEKERKWRPLDFLDILLFAR

MKTGSSLSDKDLHAEVDKVMFKGQETTTSGISWILYSLVSHPVHQQR

EEIQRLLGDGASITW

DHLDQMPYTTXCI

QEALRFYPPVPVIGRELSKPITFPVGHSLPA

XXXXXXXXXXXXXXXXXXXXXX

VFDSS*SAPGPAGHNHAFLPFSGGS

 

CYP4A pseudogene C-terminal

Scaf. 268824 c-term 4A pseudogene

GIEVALSFYGLHHNPKV*LNPE