Bovine P450s
There are 319,805 ESTs for Bos taurus in dbEST.
There are 291,106 BAC-ends in dbGSS.
It is interesting that there is an EST for a P450 in Bos that is not yet found among human ESTs (CYP26C1).
This is so even when there are over 5 million human ESTs.
D. Nelson
Sept. 25, 2003
TC, NP numbers represent TIGR gene index numbers
There are 98 sequences shown here. There are at least 53 genes and probably more
(including four more that are currently missing CYP24, 2U, 2R, and 27B).
For an alpabetical list of accessions see ALPHA LIST
CYP24 is not found yet. This is the only mammalian family missing.
Among subfamilies no CYP2G, 2R, 2U, 4Z, 27B are found yet.
Family CYP1
Subfamily 1A probably 2 genes and a pseudogene
>CYP1A1 AB060696.1 = NP338778, TC208445 Probable 1A1 CC766779.1 Bac end
(missing 17 aa at N-term)
VFCLVFWV
2 VRTWRPRVPQGLKSPPEPWGWPLLGHMLMLGKNPHVVLSQLSQRYGDVLQIRIGCTPVLV 181
182 LSGLDTVRQALVRQGDDFKGRPDLYSFTLITNGQSMTFNPDSGPVWAARRRLAQNALKSF 361
362 STASDPASSSSCYLEEHVNKEAKYLLGKFQELMSGPGRFDPYRYIVVSVANVICAICFGR 541
542 RYD 550
(gap of 85aa)
3 LDENANIQLSDEKIINVVIDLFGA 74
163 GFDTVTTALSWSLLYLVTSPRVQKKIQEELDTVIGRARRPRLSDRPQLPYLEAFILETFR 342
343 HSSFVPFTIPHSTTRDSNLNGFYIPKGRCVFVNQWQINHDQKLWEDPSEFRPERFLTADG 522
523 TINKVLSEKVIIFGLGKRKCIGETIARLEVFLFLAILLHQVEFCVTPGVKVDMTPVYGLT 702
703 MKHARCEHFQAHMRS 747
>cattle|TC201439 probable 1A2 Length = 946 PLLGHM to J-helix Like 81% to 1A2 67% to 1A1
3 LKSPPEPWGWPLLGHMLTLGKNPHVVLSQLSQRYGDVLQIRIGCTPVLVLSGLDTVRQAL 182
183 VRQGDDFKGRPDLYSFTLVTDGQSMTFNPDSGPVWAARRRLAQNALNTFSVASDPSSSSS 362
363 CYLEDHVSKEAEALLGKFQELMSGPGRFDPYGHVVASVANVIGAICFGQHFPQSSKEMLS 542
543 LVESSHDFVESASSGNPVDFFPILKYLPNPALQRFKSFNQRFLQFVRKTVQEHYQDFDKN 722
723 SIQDIIGALFKHSEDNSRASSRLISQEKTVNLVNDLFAAGFDTITTAISWSLMYLVT 893
894 NPKIQRKIQEELDTVIG 944
>cattle|TC212268 79% to 1A2 Length = 828 some frameshifts, probably a pseudogene
2 IPHSTTRDTTLNGFFIPKERCVFINQWQVNHDPK
105 LWGDPSVFRPERFLTSDGTTIDKTAM
CEKVLLF
GMGKLRCIGEFMV 241
RWEVFLFLAILL 278
279 QRLEFSVPPGVKVDLTPTYGLTMKHARCEHMQARLRFPIK* 401
>cattle|BM258918 71% to 1A8P Length = 532
408 VLHEVRTIWDNPNLLRPERFLNENRELNKNLIEKIFIFGMG 530
Subfamily 1B probably 1 gene
>cattle|TC196849 87% to 1B1 Length = 882
VTDIFGCSASLTPCALLVLFTRYSEVQARVQAELDK
VVGKHRLPTL
137 GGQPRLPYVMAFLYEAMRFSTXVPVTIPHATTANASVLGYHIPKDTVVFVNQWSVNHDPVKWSNP 331
332 EDFDPTRFLDKDGLINKDLTGSVMVFSVGKRRCIGEEISKMQLFLFISILAHQCNFKANP 511
512 DEPSKMDFNYGLTIKPKSFKINVTLRESMELLDSAVQKLQVEKECQ*
Family CYP2
Subfamily 2A (there may only be one CYP2A gene in cattle)
>cattle|CB434432 84% to 2A13 Length = 584
15 ILA*GLLLVA*LACLTIMVLMSVWRQRNLKGKLPPGPTPLPFIGNYLQLNTEXMCDGGMKISEHY 209
210 GPVFTVHLGTRQIVVLCGYDAVKEALVDQAEEFSGRGKQATFDWLFKGYGVAFSNGER 383
384 AKQLRRFSITTLRDFGVGKRGIEERIQEEAGFLIEAFRGTRSAFIDPTFFL 536
537 SRTGSNVINSIVFRDR 584
only 41 aa missing between these two sequences
>cattle|TC193989 91% to 2A13 Length = 850
1 AGPQQQAFKELQGLEDFIAKKVEQNQRTLDPNSPRDFIDSFLIRMQEEKENPNTEFYRK 177
178 NLVMTTLNLFFAGTETVSTTMRDGFLLLMKHPDVEAKIHEEIDRVIGKNRQPKFEDRAKM 357
358 PYTEAVIHEIQRFGDMIPMGLARRVTKDTKFRDFLLPKGTEVFPMLGSVLRDPKFFSNPR 537
538 DFNPQHFLDEKGQFKKSDAFVPFSIGKRYCFGESLARMELFLFFTTIMQNFRFKSPQS 711
712 PQDINVSPKLVGFATIPPNYTMSFLPR*
>CB463229 92% to 2A13 one aa diff with TC193989, probably identical
2 FGEGLARMELFLFFTTIMQNFRFKSPQSPQDINVSPKLVGFATIPPNYTMSFLPR 166
Subfamily 2B (there may only be one CYP2B gene in cattle)
>cattle|TC193614 79% to 2B6 Length = 872 N-term and C-term
40 MELSMLLLFALLTGLLVLLARGRPKAHGRLPPGPRPLPFLGNLLQMDRKGLLKSFLRFQQK
ILPEVPTEVYPVLSSAL
249 HESCYFEKPDDFNPDHFLDANGVVKKNDAFMPFSIGKRICLGEGIARIELFLFFTTIL 422
423 QNFSVASPVAPEDIDLTP 476
QESGVGNVPPNYRIQFLPRQRG*
>cattle|CB222090 76% to 2B6 Length = 154
3 LIMSVLSLFFAGTETTSTTLRYGFLLMLKYPHITERIQKEIDQVIGSYRP 152
Subfamily 2C There are at least 6 different genes see alignment
>cattle|TC211258 69% to 2C18 Length = 1763 CB531366
3 GPVFTLYFGMKPTVVLHGYEAVKQVLIDQSEEFSGRGSLPVADNINQGLGIVFSNGEI 176
177 WKQTRRFSLMVLRNMGMGKRTIEHRIQEEALCLVEALKKTNGSPCDPTLLL 329
330 SCAPCNVICSIIFRNRFEYNDERLLTLIKYFNENSRLVSTPWVELYNTFPSLLHYFPGSH 509
510 NTIFKNMTEQRKFILEEIKKHQESLDLNNPQDFIDYFLIKMEKEKHNKHSEFTMDN 677
678 LITTVWDVFSAGTETTSLTLRYGLLLLLKHPEVTAKVQEEIDRVVGRNRSPCMQDKSCMP 857
858 YTDAVLHEIQRYIDLVPSSMPHAATQDVKFREYLIPKGTVILTSLTSVLHDDNEFSNPGQ 1037
1038 FDPGHFLDESGNFKKTDHFMAFSAGKRVCVGEGLARMELFLLLVSILQHFTLKSVVDP 1211
1212 KHIDTAPSFKGLISIPPFCEMCFIPV* 1292
>cattle|BM253908 83% to 2C19 Length = 557
28 LVLCLSCLLLLSLWKQ 75 (gap)
81 ERNVKGHGIVFSNGKTWKETRRFSLMTLRNFGMGKRSIEDRVQEEARCLVEELRKTNGLP 260
261 CDPTFILGCAPCNVICSIIFQNRFDYKDQTFLNLMKTINENIKILGSPWIQVLNIFPVLL 440
441 DFFPWSYSYKKLYTNTAYVKNYVLEKTREHQASLDINNP 557
>cattle|CB421823 81% to 2C18 Length = 669
668 KGTGILTSLTSVLYDDKEFPNPEVFDPGHFLDESGNFRKSDHFMAFSTGKRICVGEGL 495
494 ARMELFLFLTTILQNFTLKSVVDPKDLDTTPVVNGLLSVPPFYQLCFIPV* 342
>cattle|TC198839 76% to 2C18 Length = 836 CB422177
3 FSGRGSCPVIQRASKGYGGIFSNGKIWKETRRFSLMTLRDFGMGKRSMEDR 155
156 VQQKACCLVEELRKTDGLPCDPTFILGCAPCNVICSIIFQNHFDYKDQTFLDLMERLN 329
330 ENARILGSPWIQLCSSFPALIDYVPGKHKKFFENYACMKSYVLEKTREHQASLDMNNPR 506
507 DFIDCFLTKMEQEKHNQELEYTVENLAHTVLDLFVAGTETTSTTLRYGLLLLLKHPEV 680
681 TAKVQEEIDHVIGRHQSPCMXDKSHMPYTDAVVHEIQRYIDLVPTNLPHAVTCDIKF
>cattle|CB222086 82% to 2C18 Length = 455
2 PTVVLHGYEAVKEALIDQGEEFSGRGNIPMSQRVNKGYGIIFSNGKRWKEIRRFSLMT 175
176 LRNFGMGKRSIEDRVQEEAHCLVEELRKTNGSPCDPTFILXCAPCNVICSI 328
329 IFQNRFDYTDQNFLNLLDKFNENLQVVSSPWMQVCNTFPILI 454
>cattle|TC213492 83% to 2C19 Length = 687
18 VSSVLHDEKEFPNPKVFDPAHFLDESGNFKKSDYFMAFSAGKRSCVGEGLARMELFLFLTTIL 206
207 QKFTLKSVVDPKDLDTTPV 263
SSGFGHVPPPYQLCFTPL*
>cattle|TC203642 82% to 2C18 Length = 698
22 PQGTDILTSLTSVLHDDKEFPNPEVFDPGHFLDENGNFRKSDYFMAFSAGKRVCVGEG 195
196 LARMELFLFLTTILQTFTLKSVVDPKDLDTTP 291
AVTGIANVPPPYQLCFIPV*
>BZ878104 genomic clone CH240_266K11 Length = 664 78% to 2C18
528 GRNRSPCMQDRSRMPYTDAVIHEIQRYIDIVPNNLPHAAAQDIKFREYLIPK
>CC503360 genomic clone CH240_342N5 Length = 423 81% to 2C18
212 GTGVLVSLTSVLYDDKVFPNPEMFDPGHFLDDSGNFKKSDHFMPFSAG 355
>cattle|CB428210 47% to 2C19 Length = 542
2 IDQGDEFLGRAHFPIIDDTQRGYGLIFSNGDTWKQMRRFSSLMTLR 136
138 DFGMGKRSLEERIQEEAQFLVEEFRKSEAQPFNPAVTLSCATCNIICSILFNERFHYQDKTLHSLL 335
336 DLLNENFNRISSLWNQIYNLWPKLIKPLPGEHRAFSKRLKDVHYFVLEKVKEHQKSLNH 512
513 NNPRDYIDC 539
>cattle|TC210467 68% to 2C8 Length = 1031
3 QAKVHEEIDRVIGRTRSPCMKDKMKLPYTEAVLHEIQRYVTLVPSNLPHAVVQDTKFRQY 182
183 VIPKGTTVLPLLSSILYDCKEFPNPEKFDPGHFLDKNGSFRKTKYFVAFSIGKRACVG 356
357 EGLAQMELFLFFTTILQNFVLKPLGETKDI 446
ETKPIVIGLINMPPPFKLCLIPR*
Subfamily 2D There are at least two gene sequences
>CYP2D14 TC205271 Length = 1772 78% to 2D6
MGLLSGDTLGPLA
231 VALLIFLLLLDLMHRRSRWAPRYPPGPTPLPVLGNLLQVDFEDPRPSFNQLRRRFGN 401
402 VFSLQQVWTPVVVLNGLAAVREALVYRSQDTADRPPPAVYEHLGYGPRAEGVILARYGDA 581
582 WREQRRFSLTTLRNFGLGKKSLEQWVTEEASCLCAAFADQAGRPFSPMDLL 734
735 NKAVSNVIASLTFGCRFEYNDPRIIKLLDLTEDGLKEEFNLVRKVVEAVPVLLSIPGLAA 914
915 RVFPAQKAFMALIDELIAEQKMTRDPTQPPRHLTDAFLDEVKEAKGNPESSFNDENL 1085
1086 RLVVADLFSAGMVTTSTTLAWALLLMILHPDVQRRVQQEIDEVIGQVRRPEMGDQALMPF 1265
1266 TVAVVHEVQRFADIVPLGLPHMTSRDIEVQGFHIPKGTTLITNLSSVLKDETVWEKPFRF 1445
1446 HPEHFLDAQGRFVKQEAFIPFSAGRRACLGEPLARMELFLFFTSLLQHFSFSVPAG 1613
QPRPSEHGVFAFLVTPAPYQLCAVPR*
>cattle|TC205272 90% to 2D6 Length = 1045 contains intron seq at PKG motif
same seq as 2D14
3 MILHPDVQRRVQQEIDEVIGQVRRPEMGDQALMPFTVAVVHEVQRFADIVPLGLPHMTSR 182
183 DIEVQGFHIPK 215
GTTLITNLSSVLKDETVWEKPFRFHPEHFLDAQGRFVKQEAFIPFSAGRRACLGEPL 821
822 ARMELFLFFTSLLQHFSFSVPAG 890
QPRPSEHGVFAFLVTPAPYQLCAVPR*
>cattle|BI849982 71% to 2D6 Length = 399 Same seq as 2D14
62 MGLLSGDTLGPLAVALLIFLLLL 130 (gap)
128 LGNLLQVDFEDPRPSFNQLRRRFGNVFSLQQVWTPVVVLNGLAAVREALVYRSQDTADRP 307
308 PPAVYEHLGYGPRAEGVILARYGDAWREQRRFSLTTLRNFGL 433
>cattle|TC205273 Length = 491 seq order jumbled same seq as 2D14
IAEQKMTRDPTQPPRHLTDAFLDEVK (gap) EPLARMELFLFFTSLLQHFSFSVPAGQPR (gap)
168 PFTVAVVHEVQRFADIVPLGLPHMTSRDIEVQGFHIPKGTTLITNLSSVLKDETVWEKPF 347
348 RFHPEHFL (gap) VTPAPYQLCAVPR*
>cattle|TC205274 Length = 639 4 diffs to 2D14
292 GTTLITNLSSVLKDETVWEKPFRFHPEHFLDAQGRFVKQEAFIPFSASRRACLGEPL 462
463 ARMELFLFFTSLLQHFSFSVPAG 531
QPRPSDHGVFVALVTPAPYQLCAVPR*
>cattle|BE756001 Length = 494 1 diff to 2D14
1 VIGQVRRPEMGDRALMPFTVAVVHEVQRFADIVPLGLPHMTSRDIEVQGFHIPKGTTLIT 180
181 NLSSVLKDETVWEKPFRFHPEHFLDAQGRFVKQEAFIPFSA 303
>TC205275_1 91% to 2D6 PERF to HEME with frameshift same seq as 2D14
GTTLITNLSSVL
KDETVWEKPFRFHPEHFLDAQGRFVKQEAFIPFSA
>TC205276_1 CYP2D6like N-term 74% and C-helix 85% fragments 3 diffs to 2D14
155 MGLLSGDTLGPLAVALLIFLLLLDLRHRRSRWAPRYPPGPTPLPGLGNLLHVDFEDPRPSFNQ 343
(gap)
343 GVILARYGDAWREQRRFSLTTLRNFGLGKKSLEQWVTEEASCLCAAF 483
>cattle|CB434719 71% to 2D6 Length = 462 aa 107-256 86% to 2D14 this is a different seq
23 LYKHLGFGPRAEGVILARYGNAWREQRRFSLSTLRNFGLGKKSLEQWVTEEASCLCAAFADQAGHPFSPM 218
219 DLLNKAVSNVIASLTFGCRFEYNDPRIVKLLDVMEDGLKEEMKIMRQVVEAVPVLLSIP 395
RLAAKVVPGQKAFMTLVDELI 472
>AW485272 78% to 2D6 same seq as 2D14
192 PTQPPRHLTDAFLDEVKEAKGNPESSFNDENLR 290
Subfamily 2E probably just one gene
>cattle|TC219673 74% to 2E1 Length = 625 same as 2E1
MAALGITVALLVWMATLLFISIWKHIYSSW
106 KLPPGPFPLPIIGNLLQLDIKNIPKSFTRLAERYGPVFTLYLGSQRAVVVHGYKPVKEVL 285
286 LDYKNEFSGRGE 321
>cattle|TC189660 73% to 2E1 Length = 779 same as 2E1
PGFQMHKNNGIIF
40 NNGSTWRDTRRFSLTTLRDLGMGKQGKEQRIQREAHFLLEVLKKTQGQPFD 192
193 PTFVVGFAPYNVISDILFHKRFDYKDQTSLRLMSLFNENFYLLSSPWIQLYNNFPDYLQY 372
373 LPGSHRKLLKNVSEVKSYALERVKDHQKSLEPSC
>BZ845111 CH240_277C23 Length = 808 78% to 2E1 may be a different gene or a pseudogene
681 KRVCVGEGLARMELFLLLVSILQKFTLKPLVDPKTLIL 794
>CYP2E1 TC189658 = AJ001715 79% to human 2E1 Length = 1900
MAALGITVALLVWMATLLFISIWKHIYSSW
111 KLPPGPFPLPIIGNLLQLDIKNIPKSFTRLAERYGPVFTLYLGSQRAVVVHGYKPVKEVL 290
291 LDYKNEFSGRGENPGFQMHKNNGIIFNNGSTWRDTRRFSLTTLRDLGMGKQGN 449
450 EQRIQREAHFLLEVLRKTQGQPFDPTFVVGFAPYNVISDILFHKRFDYKDQTSLR 614
615 LMSLFNENFYLLSSPWIQLYNNFPDYLQYLPGSHRKLLKNVSEVKSYALERVKDHQKSL 791
792 EPSCPRGFLDTMLIEMAKERHSVDPMYTLENIAVTVADLLFAGTETTSTTLRYGLLI 962
963 LMKYPEVEEKLHEEIDRVIGPSRIPAVKDRLDMPYLDAVVHEIQRFIDLLPSNLLHEATQ 1142
1143 DTVFRGYVIPKGTVVIPTLDSVLHDRQEFPEPEKFKPEHFLNENGKFKYSDHFKAFSA 1316
1317 GKRVCVGEGLARMELFLLLAAILQHFNLKSLVDPKDIDLSPI 1442
AIGFGKIPPRYKLCLIPRSKV*
Subfamily 2F probably just one gene
>AJ459276 65% to 2F1 aa 224-274
307 MYNIFPNLLDWVPGPHRRLFKNYGRIKDIIARSVREHQASLDPNSPRDFIDCFLTKMAQ 131
>CC540963 Bac end 81% to 2F1 aa 324-383
3 VQEEIDRVVGHERLPTVEDRAAMPYTDAVIHEVQRFADVIPMSLPHRVTRDTNFRGFTIPR 185
>cattle|TC206020 94% to 2F1 Length = 3062 aa 384 to end
GTDVITLLNT
2561 VHYDPSQFLKPKEFNPEHFLDANMSFKKSPAFMPFSAGRRLCLGEALARMELFLYLTA 2388
2387 ILQSFSLQPLGAPEDIDLTPL 2325
SSGLGNLPRPYQLCVLAR*
Subfamily 2J There are at least 4 genes see alignment
>cattle|BM258720 85% to 2J2 Length = 435
30 HKGNATSCFHEENLIYNTLDLFFAGTETTSTTLRWGLLYMALYPEIQEKVQAEIDRVLGQSQK 218
219 PSMAARESMPYTNAVIHEVLRMGNILPLNVPREVTVDTVLAGYRLPKGTMVTTNLTALHR 398
399 DPAEWATPDTFN 434
>cattle|TC206739 TC206740 80% to 2J2 Length = 754 mid to PERF
3 IICSITFGERFDYQDDQFQELMRLLDDVTYLETTVWCQLYNVFPRIMNFLPGPHQMLFSN 182
183 WRKLKMFVARVIENHKRDWNPAEARDFIDAYLQETEKHKGNAASSFHEENLIYNTLD 353
354 LFFAGTETTSTTLRWGLLYMALYPEIQEKVQAEIDKVLDESQQPSMATRESMPYTNAVIH 533
534 EVQRMGNILPLNVPREVTVDTVLAGYHLPKGTMVLTNLTALHRDPAEWATPDTFNPEHFL 713
ENGQFKKREAFLP 752
(gap)
GKRMCLGEQLARTELFIFFTSLLQKFTFRPPEHEELSLKFRMGLTLS
>cattle|TC206738 79% to 2J2 Length = 498
2 RLLDEVLNLHTSLCCQLYSV
62 FPRIMNFVPGPHQTLFSNSEKLKMFVAEMIENHKRDWNPAEARDFIDAYLQEIEKHKG 235
236 GDASSFHEENLIYSTLNLFLAGTETTSTSLRWGLLYMALNPEIQEKVQAEIDRVLGQSQ 412
413 QPSTAARESMPYTNAVIHEVLRMGNIIP 496
>cattle|TC206737 84% to 2J2 Length = 1031 C-term half
2 FIDAYLQEIEKHKG
44 NATSSFDDENLICSTLDLFLAGTETTSTTLRWGLLFMALNPEIQEKVQAEIDRVLGQSQK 223
224 VSTASRESMPYTNAVIHEVQRMGNIVPMNVPREVTVDTVLAGYHLVKGTMVLTNLTALHR 403
404 DPAEWATPDTFNPEHFLENGQFKKRESFLPFSIGKRMCLGEQLARTELFIFFTSLLQ 574
575 KFTFRPPENEKLSL 616
KFRESLTSSPASYRLCAIPRA*
>cattle|TC206741 CB465183 66% to 2J2 Length = 727
26 MLEALGSLVAALWTTLRPGIV
89 LLGAFVFLLFADFLKRQHPKNYPPGPLRLPFIGNFFHLDLGKGILVPQQVVKK 247
248 YGNIIKLDFGVIHFIVITGLPYIKEALVNQEQNFVNRPMIPLQKHIFNNKGLVRSNGH 421
422 VWKEQRKFTLTTLKNFGLGKKSLKERIQEEVTYLIQAIREENGQPFDPHFI 574
575 INNAVSNIICSITFRERFDYNDDQFQELLRLLDEILCIQASVCCQLYNAFP 727
RIMNXLPGSHHTLFRKWEKLKMFVANVIENHRKDWNPAEARDFIDAY
>cattle|BE480225 70% to 2J2 Length = 517 N-term
MLEALSSLAAALWAALRPGTV
77 LLGAVAFLFFADFLKRRRPKNFPPGPAGLPFVGNSFQLDPEKVHLTLQQFVKKY 238
239 GNVFSLDFGTFPSILITGLPLIKEALVHQGENFSKRPVMPLQERIFNTKGLIMSSGHI 412
413 WKEQRRFALTTLRNFGLGKKSLEERIQEEASYLIQ 517
>cattle|BI541543 71% to 2J2 Length = 573 N-term missing about 5 aa
SSLATGLWAALRPDTV
50 LLGTLAFLLFVDFLKRRHPKNYPPGPPGLPFVGNLFXLDPEKVPLVLHQFVKK 208
209 YGNVFSLDFGTVPSVLITGLPLIKEVLVHQGQIFSNRPIVPLQEHIINNKGLIMSSGQ 382
383 LWKEQRRFALTTLRNFGLGKKSLEERIQEEASYLIQTIREENGQPFDPHLT 535
536 INNAVSNIICSI 571
>cattle|BE484578 73% to 2J2 Length = 481 N-term
MLEALGSLAAALWAALRPGTV
73 LLGAIVFLLLTDLLNRRRPKNYPPGPPRLPFVGNFFQLDFEQGHLSLQRFVKKY 234
235 GNLFSLEFGDLPSVVITGLPLIKEVLVYQDQNFVNRPISPIRERVFKKNGLIMSNG 402
403 HIWKEQRRFSLTALRNFGLGRKSLEE 470
>cattle|TC198485 85% to 2J2 Length = 748
GTMVMTNLTALHRDPTEWATPDTFNPEHFLENGQFKKRESFLPFSIGKRMC 197
198 LGEQLARTELFIFFTSLLQKFTFRPPENEQLSLKFRVSLTLAPVSHRLCAVPRG* 362
Subfamily 2S probably just one gene
>cattle|TC215382 72% to 2S1 Length = 250
4 PCPLGALSLQPAISGLFNIPQAFQLQFRP 90
Subfamily 2T probably just one pseudogene
>cattle|BM256543 BE749249 61% to 2T2P
GAPFDPQRLLDNAVSNVICSVVLGNHYGYEDMEFLRLLDLFNDNFRIMSSRWGE
51 GTENPESHFQAETLAMTMHNLFFG 122
121 VVETTSTTLRYGLILLKYSFVA 186
280 AKVQAELDDMVGRMCAPTLEDREHLPYTKTVLHEIQCFISVVPFGLPSALTCDTHL 447
448 RGYFLPKXXXXVPLSLGILHR 498
>cattle|BM285833 59% to 2T2P Length = 410 C-term
3 ECFNPTNFL 29
142 IPMSPGKGTCLGAGLAPTDIFLFLTSILLRFFLLPVGSHSDTDLTPQCTGLGNVPPAFQL 321
322 RLVAR 336
Subfamily 2U probably just one gene
>cattle|BE590124 88% to 2U1 Length = 151 I-helix
3 NSGFDEDYLFYIIGDLFIAGTDTTTNSLLWCLLYMSLHPNIQGNKQMFS 149
Subfamily 2W probably just one gene
>cattle|TC188265 83% to 2W1 Length = 564 aa179-209
147 TFTLLFGQRFDYRDPVFLSLLGLVDDVMVLL 52
Subfamily 2AB probably just one gene/pseudogene
>cattle|BI535102 73% to 2AB1P Length = 539
174 SYTSSSLQGTIILPNLAAVLCDPECWRTSRQFNPGHFLDKDGNFVVRDIFPPFSAGHQMC 353
354 LGD*LAQMKLLLMFATLLGTFSFQLPGRSPGLRLEYNFGGTRKPLPQKIYAVSR 515
Subfamily 2AC probably just one pseudogene
>BZ940825 genomic clone CH240_90L20 Length = 538 70% to 2AC1P K-helix
303 YPEKVHDEIAKVMGSTQP*MAH*TQMPYTDAVILEVQRFADILPTGLPRATTTNTIFKNNYIPK 494
>CC910297 t060j19ba.f1 genomic clone t060j19ba Length = 772 58% to 2AC1P
97 GIFFSHSDTSKIIRFTLTTSRNFGMGKNALEDTIIGESQHLTRNFETDKG
Family CYP3 There are at least 3 genes in this family
>cattle|TC219554 73% to 3A4 Length = 563 N-term
MELILSFSTETWVLLATGLVLLYLYGTYSYGLFKKLGV
201 PGPRPLPYFGNILSYRKGVCEFNEECFKKYGKIWGIFEGKQPLLVITDPDMIKTVLVKE 377
378 CYSVFTNRRVFGPSGVMKNAISVAEDEQWKRIR 476
TLLSPTFTSGKLKEMFPIIGKYGDVLVRN
>cattle|TC192142 77% to 3A4 Length = 1019
FVENVKKLLRFSILDPFLLAVVLFPFL
83 VPILDVLNITIFPKSAVNFFTKSVKRIKESRLKDNQKPRVDFLQLMINSQNSKETD 250
251 NHKALSDQELIAQSIIFIFAGYETTSSTLSFLLYILATHPDVQQKLQEEIDATFPNK 421
422 APPTYDVLAQMEYLDMVVNETLRMFPIAIRLERLCKKDVEIHGVSIPKGTTVMVPISVL 598
599 HKDPQLWPEPEEFRPERFSKKNKDSINPYVYLPFGTGPRNCIGMRFAIMNMKLAIVRV 772
773 LQNFSFKPCKETQIPLKISSQGVLRPEKPVVLKVVLRDGTISGA*
>CYP3A28 TC211564 Y10214 73% to 3A5 Length = 1576
MELIPSFSMETWVLLATSLVLLYIYGTYSYGLFKKLGIPGPRPV
PYFGSTMAYHKGIPEFDNQCFKKYGKMWGFYEGRQPMLAITDPDIIKTVLVKECYSVF
TNRRIFGPMGIMKYAISLAWDEQWKRIR
TLLSPAFTSGKLKEMFPIIGQYGDMLVRNLRKEAEKGNPVNMKDMFGAYSMDVITGTAFG
VNIDSLNNPHDPFVEHSKNLLRFRPFDPFILSIILFPFLNPVFEIL
320 NITLFPKSTVDFFTKSVKKIKESRLTDKQMNRVDLLQLMINSQNSKEIDNHKALSDIE 493
494 LVAQSTIFIFGGYETTSSTLSFIIYELTTHPHVQQKLQEEIDATFPNKAPPTYDALVQM 670
671 EYLDMVVNETLRMFPIAGRLERVCKKDVEIHGVTIPKGTTVLVPLFVLHNNPELWPEPE 847
848 EFRPERFSKNNKDSINPYVYLPFGTGPRNCLGMRFAIMNIKLALVRILQNFSF 1006
KPCKETQIPLKLYTQGLTQPEQPVILKVVPRGLGPQVEPDFL*
>cattle|TC192141 77% to 3A4 Length = 1131
TSVDMKEVFGAYSMDVITSTSFGVNIDSLGNPQDPFVENAKKLLRFDILDPFLLSVVLFPFL
188 IPIFEVLNISIFPKSAVNFLTTSVKKIKESRLKDTQKPRVDFLQLMINSQNSKETD 355
356 NHKALSDQELMAQSIIFIFGGYETTSTSLSFIIYELATHPDVQQKLQEEIDATFPNK 526
527 APPTYDVLAQMEYLDMVVNETLRMFPIAVRLERFCKKDVEIHGVSIPKGTTVTVPISVL 703
704 HRDPQLWPELEEFRLERFSKKNKDSISPYVYLPFGTGPRNCIGMRFAIMNMKLAVVRV 877
878 LQNFSFKPCKETQIPLKIK 934
SQGLLRPEKPIFLKVVLRDETISGA*
Family CYP4
Subfamily 4A at least two genes
>cattle|TC188271 TC188272 TC188270 79% to 4A11 Length = 597 one diff to TC188272 probably same seq
3 EGQKWFQHRRMLTPAFRYDILKAYVGIMADSVRVMLDKWEELVSQDSHLEIFGHVSLMT 179
180 LDTIMKCAFSHQGSVQMDRSSQSYI
1 QAIRDLSHLIVSRLRNAFHQNDLIYRLTPEGRWNHRACQLTHQHTDAVIKERKAHLQKEG 180
181 ELEKVRSRRHLDFLDILLLARMENGSSLSDEDLRAEVDTFMFEGHDTTASGISWILYALA 360
361 SHPEHQQRCREEIQSLLGDGASITWDHLDQMRYTTMCIKEAMRLYPPVPFIGRELRKPIT 540
541 FPDGRSLPAGILVSLSFYGLHHNPNVWPNPEVFDPTRFSPGSTQHSYAFLPFSGGSRNCI 720
721 GKQFAMNELKVAVALTLLRFELSPDPSRVPVPTPIMVLRSKNGIHLQLKKLSDPGLL*
>cattle|TC188268 TC188267 72% to 4A11 Length = 1060
29 MSVSALSPSRALGGVSGLLQVVSLLGLVLLLIKAAQLYLRRQWLLKALHHFPSPPSHWFY 208
209 GHKRE
224 FQEEGELPHLLKRVEKYPRACVRWMWGTRALLLVYDPDYMKMVLGRSDPKAQIIHRFVKP 403
404 WIGTGLLLLEGQTWFQHRRMLTPAFHYDILKPYVGIMADSVRVMLDKWEELVSQDSHLE 580
581 IFGHVSLMTLDTIMKCAFSQQGSVQTDRNSQSYIQAIKDVSHLIISRLRNAFHQNDLIYR 760
761 LTPEGHWNHRACQLAHQHTDAVIKERKVRLQKEGELEKVRSRRHLDFLDILLFA 922
RMENGSSLSDEDLX
AEVDTFMFEGHYTTASGISWILYALASHPEHQQRCREEIQSLLADGASITWDHLDQMPYT
TMCIKEAMRLYPPVPVISRELSKPITFPDGRSLPAGILVSLSIYGLHHNPKVWPNPEVFD
PTRFAPGSTRHSHAFLPFSGGSRNCIGKQFAMNELKVAVALTLLRFELSPDSSRVPVPMP
VIVLRSKNGIHLQLRKLSDPGT*
Subfamily 4B probably a 4B1 and a 4B2
>CYP4B2 TC209863 82% to 4B1 Length = 1462 BF602740 94% to goat 4B2 only 82% to human 4B1
YDFFLQWIGKGLLVLQGPKWFQHRKLLTPGFHYDVLKPYVALFAESTRVMLDKWEKKARE
QKSFDIYSDVGHMALDSLMKCTFGKGTSGLNDRDNNYYLSVKELTLLMQQRIDSFQYHND
FIYFLTPHGRRFLRACHVAHDHTDQVIRERKAALQDE
474 KERERIQSKRHLDFLDILLGAWDEEGIKLSDEDLRAEVDTFMFEGHDTTTSAI 632
633 SWVLYCMSLYPEHQRRCREEIQEILGDRDTLKWDDLAEMTYLTMCIKESFRLYPPVPQVY 812
813 RQLSQPVNFVDGRSLPEGSLISLHIYALHRNSTVWPDPEVFDPLRFSPENVAGRHSFA 986
987 FIPFSAGPRNCIGQQFAMAEVKVVTALCLLRFEFS 1091
PDPSRLPIKMPQLVLRSKNGIHLHLKPLGPGSEK*
>CYP4B1? AW347853 89% to 4B2 Length = 181
3 DGRSLPAGSLISLHIYALHRNSAVWPDPEVFDPLRFSLENMAGRHPFAFLPFSAGPRNC 179
Subfamily 4F
>cattle|TC201350 80% to 4F2 Length = 543
1 PNFVAPLLQASATIIPKDMFFYSFLKPWLGDGLLLSAGDKWSSHRRLLTPAFHFEILKPY 180
181 MKIFNKSADIMHAKWQRLALEGSTRLDMFEHISLMTLDSLQKCVFSYDSNCQEKPSEY 354
355 IAAILELSALVMKRIKHIFLHVDFLYYLTRDGQRFYRACRLVHDFTDAIIQKRRRTLISQGS 540
>cattle|CB440231 75% to 4F2 Length = 344
343 DPFRFEPENIKGRSPLAIMPFSVGRRMCIGQTFAMTQMKVVLALTVLRFRVLPGEEPRR 167
166 KPELILRAEGGLWLRVEPLS 107
>cattle|TC215358 86% to 4F2 Length = 926
1 DLLRDRESKEIEWDDLAHLPFLTMCIKESLRVHPPVTSISRRCTQDIVLPDGRVIPKGVVCLID 194
195 IFGTHHNPSVWQDPEVYDPFRFDPENIKGRSPLAFIPFSAGPRNCIGQTFAMTEMKG 365
366 ILALTLLRFRVLPDKEPCRKPELILRTEGGLWLRVEPLSASQQ*
>cattle|TC213582 68% to 4F12 Length = 573 N-term
MLELSLSRLGLGPLAASPWLLPLLAGVSWILARVLAWTYTFYNNSRRLRCF
241 LQPPKPNWFLGHMNLVPSTEQGLIYFTQMAANYPRGYLIWFGPIIPMVIFCHPDMLRS 414
415 ITNASAAIAPKDMQFYGTLKPWLGDGLLLSAGDKWSSHRRMLTPAFHFNILKP 573
Subfamily 4V probably just one gene
>cattle|TC207398 82% to 4V2 Length = 913 N-term
2 AVSLAGAT
26 LTLNLLKMVASYARKWRQMRPVPTIGDPYPLVGHALMMKSDARDFFQQIIDFTEECR 196
197 HLPLLKLWLGPVPLVALYNAETVEVILSSSKHIEKSYMYKFLEPWLGLGLLTSTGNKW 370
371 RSRRKMLTPTFHFTILEDFLDV MNEQANILVTKLEKHVNQEAFNCFFYVTLCTLDIIC 544
545 ETAMGKNIGAQRNDDSEYVRAVYRMSDSIHQRMKMPWLWLDLIFYMFKNGREHRRSLKLN 724
>cattle|TC207399 63% to 4V2 Length = 799
1 HRRSLKIVHDFTNNVITERANEMKRHEEGTSNDKEKDFPPRKTKCRAFLDLLLNVTDDQGNKLSH 195
196 EDIREEVDTFMFE 234
>cattle|TC207400 72% to 4V2 Length = 788 aa 166-425
MNEQANILVTKLEKHVNQEAFNCFFYVTLCTLDIICETAMGKNIGAQRNDDSEYVRAVYR
MSDSIHQRMKMPWLWLDLIFYMFKNGREHRRSLKIVHDFTNNVITERANEMKRHEEGTSN
DKEKDFPPRKTKCRAFLDLLLNVT
433 DDQGNKLSHEDIREEVDTFMFEGHDTTAAAINWSLYLLGWYPEVQQKVDTELEEVFGKS 609
610 DRPVTLEDLKKLKYLDCVIKESLRLFPSVPFFARNLTEDCEVAGHKMVQGCQVIIVPYAL 789
>cattle|TC189439 89% to 4V2 Length = 1378
GSSRDPKYFPDPE
42 EFKPERVFPENLKGRHTYAYVPFSAGPRNCIGQKFAIMEEKTILSCILRHFWVESNQKRE 221
222 ELGLAGELILRPSNGIWIKLKRRNTDES* 308
>4V2 ortholog (77%) built from TC207398 TC207399 TC207400 TC189439 missing 20 aa at N-term
AVSLAGATLTLNLLKMVASYARKWRQMRPVPTIGDPYPLVGHALMMKSDARDFFQQIIDFTEECR
HLPLLKLWLGPVPLVALYNAETVEVILSSSKHIEKSYMYKFLEPWLGLGLLTSTGNKW
RSRRKMLTPTFHFTILEDFLDV
MNEQANILVTKLEKHVNQEAFNCFFYVTLCTLDIICETAMGKNIGAQRNDDSEYVRAVYR
MSDSIHQRMKMPWLWLDLIFYMFKNGREHRRSLKIVHDFTNNVITERANEMKRHEEGTSN
DKEKDFPPRKTKCRAFLDLLLNVT
DDQGNKLSHEDIREEVDTFMFEGHDTTAAAINWSLYLLGWYPEVQQKVDTELEEVFGKS
DRPVTLEDLKKLKYLDCVIKESLRLFPSVPFFARNLTEDCEVAGHKMVQGCQVIIVPYAL
GSSRDPKYFPDPE
EFKPERVFPENLKGRHTYAYVPFSAGPRNCIGQKFAIMEEKTILSCILRHFWVESNQKRE
ELGLAGELILRPSNGIWIKLKRRNTDES*
Subfamily 4X probably just one gene
>cattle|AW312765 67% to 4X1 Length = 463
55 RDGKMEILEKIVEKYPCAFSRWIGPFQAFLYIYDPDYAKTFLSRTDPKSNFLYKFMTASV 234
235 GKGLVNLSGPKWSQHRRLLTPGFHFNTLKSFVEVMAQSVNIMLNKWEKICGSQNTLLDIY 414
415 EHINLMTLDVLMKCIF 462
Family CYP5 just one gene
>CYP5A1 TC191301 76% to CYP5A1 Length = 1926
94 MEVLGLFRLEVSGPMVTVALSVVFLALLKWYSTSAFSRLEKLGIRHPKPSPFIGNLAFFRQ 276
277 GFWESHMELRKQYGPLSGYYLGRLMFIVISEPDMIEQVLVEKFSNFTNRMAT 432
433 GLEPKPVADSVLFLRDKRWEEVRSVLTVAFSPEKLSEMTPLISRACDVLLAHLERH 600
601 AQSGEAFDIQKTYCCYTTDVVASVAFGTEVNSQEAPEHPF 720
VEHCRRFFASSIPKPLLVLLLSFPSIMVPLARILPNKNRDELNGFFNKLIRNVIALRDQQ
AAEERRRDFLQMVQDVRHSAATVGVENFDIVRQVFSATKCPANPPRRHSPRPLSKP
1069 LSVDEVVGQAFIFLIAGYEIVTNTLSFATYLLATNPECQEKLLEEVDCFSKEHLAPEYCS 1248
1249 LQEGLPYLDMVIKETLRMYPPAFRFTRVAAQDCEVLGQRIPAGAVLETAVGALHYDPE 1422
1423 HWPNPENFNPERFTAEAQQRRRPYTYLPFGAGPRSCLGVRLGLLELKLTLLHILRKFR 1596
1597 FEACPETQVPLQLESKSALGPKNGVYIRIVSR*
Family CYP7
Subfamily 7A probably just one gene
>cattle|CB421103 92% to 7A1 Length = 707
60 LKSLDSIIKESLRLSSASLNIRTAKEDFTLHLQDGSYNIRKDDIIALYPQLMHLDPEIYP 239
240 DPLTFKYDRYLDENGKTKTTFYSNGLKLKYYYMPFGSGVTICPGRLFAVQEIKQFLILML 419
420 SYFELELVESCVKCPPLDQSRAGLGILPPLYDTEFRYKFKHS* 548
Subfamily 7B probably just one gene
>BZ933530 77% to 7B1 aa 144-287
3 FLQGKHLDILMESTMQNLKQVFEPQLLKTTSWSTEYLLPFCNSVIFEMTFTTIYGNILAX 182
183 XXKTFITELKDDFLKFDEKFTRLASGIPIELLGNIKSVRTKLIKDLTIESLAKLQGMSEV 362
363 VQRRNDILEKYYTPKDTEIGGKKL 434
Family CYP8
Subfamily 8A probably just one gene
>CYP8A1 TC208596 = D30718 89% to 8A1 Length = 2495
30 MSWAVVFGLLAALLLLLLLTRRRTRRPGEPPLDLGSIPWLGHALEFGKDAAGFLTRMKEK 209
210 HGDIFTVLVGGRHVTVLLDPHSYDAVVWEPRSRLDFHAYAVFLMERIFDVQLPHYNPGDE 389
390 KSKMKPTLLHKELQALTDAMYTNLRTVLLGDTVEAGSGWHEMGLLEFSYGFLLRAGYLTQ 569
570 YGVEAPPHTQESQAQDRVHSADVFHTFRQLDLLLPKLARGSLSAGDKDRVGKVKGRLWKL 749
750 LSPTRLASRAHRSRWLESYLLHLEEMGVSEEMQARALVLQLWATQGNMGPAAFWLLLFLL 929
930 KNPEALAAVRGELETVLLGAEQPISQMTTLPQKVLDSMPVLDSVLSESLRLTAAPFITRE 1109
1110 VVADLALPMADGREFSLRRGDRLLLFPFLSPQKDPEIYTDPEVFKYNRFLNPDGSEKKDF 1289
1290 YKDGKRLKNYSLPWGAGHNQCLGKGYAVNSIKQFVFLVLTQFDLELITPDVDIPEFDLSR 1469
1470 YGFGLMQPEHDVPVRYRIRP* 1532
>cattle|BE721760 90% to 8A1 Length = 510 N-term same seq as 8A1
RRPGE
289 PPLDLGSIPWLGHALEFGKDAAGFLTRMKEKHGDIFTVLVGGRHVTVLLDPHSYDAVVWE 468
469 PRSRLDFHAYAVFL 510
Subfamily 8B probably just one gene
>cattle|BF074464 87% to 8B1 Length = 545 aa 27-207
3 QRRPKEPPLDKGPVPWLGHAMAFRKNMFEFLRHMQAKHGDIFTVQLGGQYFTFVMDPLSF 182
183 GPILKDAQRKLDFVEYAQKLVLKVFGYRSVQGDYRMIHSASTKHLMGEGLEELNKVMLDT 362
363 LSLVMLGPIGPSLGTHHWREDGLFHFCYNILFKAGYLSLFGYTKDKEQDLLQAEELFLEF 542
543 R 545
Family CYP11
Subfamily 11A probably just one gene
>CYP11A1 TC189227 78% to 11A1 Length = 1959
MLARGLPLRSALVKACPPI
LSTVGEGWGHHRVGTGEGAGISTKTPRPYSEIPSPGDNGWLNLYHFWREKGSQRIHFRHI
ENFQKYGPIYREKLGNLESVYIIHPEDVAHLFKFEGSYPERYDIPPWLAYHRYYQKPIGV
LFKKSGTWKKDRVVLNTEVMAPEAIKNFIPLLNPVSQDFVSLLHKRIKQQGSGKFVGDIK
EDLFHFAFESITNVMFGERLGMLEETVNPEAQKFIDAVYKMFHTSVPLLNVPPELYRLFR
TKTWRDHVAAWDTIFNKAEKYTEIFYQDLRRKTEFRNYPGILYCLLKSEKMLLEDVKANI
TEMLAGGVNTTSMTLQWHLYEMARSLNVQEMLREEVLNARRQAEGDISKMLQMVPLLKAS
IKETLRLHPISVTLQRYPESDLVLQDYLIPAKTLVQVAIYAMGRDPAFFSSPDKFDPTRW
LSKDKDLIHFRNLGFGWGVRQCVGRRIAELEMTLFLIHILENFKVEMQHIGDVDTIFNLI
LTPDKPIFLVFRPFNQDPPQA*
Subfamily 11B there is evidence for at least 2 11B genes expressed in cattle
>CYP11Bx TC192860 74% to 11B2 73% to 11B1 Length = 1786
MALWAKARVRMAGPWLSLHEARLLGTRGAAAPKAVLPFEAMPRCPGNKWMRMLQIWKE
QGSENMHLDMHQTFQELGPIFRYDVGGRHMVFVMLPEDVERLQQADSHHPQRMILEPWLA
YRQARGHKCGVFLLNGPQWRLDRLRLNPDVLSLPALQKYTPLVDGVARDFSQTLKARVLQ
NARGSLTLDIAPSVFRYTIEASTLVLYGERLGLLTQQPNPDSLNFIHALEAMLKSTVQLM
FVPRRLSRWMSTNMWREHFEAWDYIFQYANRAIQRIYQELALGHPWHYSGIVAELLMRADMTLDTIKANT
932 IDLTAGSVDTTAFPLLMTLFELARNPEVQQAVRQESLVAEARISENPQRAITELPLLRA 1108
1109 ALKETLRLYPVGITLEREVSSDLVLQNYHIPAGTLVKVLLYSLGRNPAVFARPESYHPQ 1285
1286 RWLDRQGSGSRFPHLAFGFGVRQCLGRRVAEVEMLLLLHHVLKNFLVETLEQEDI 1450
1451 KMVYRFILMPSTLPLFTFRAIQ*
Family CYP17 just one gene
>CYP17A1 TC190136 71% to Steroid 17alphahydroxylase/17,20 lyase Length = 1725
MWLLLAVFLLTLAYLFWPKTKHS
116 GAKYPRSLPSLPLVGSLPFLPRRGQQHKNFFKLQEKYGPIYSFRLGSKTTVMIGHHQLAR 295
296 EVLLKKGKEFSGRPKVATLDILSDNQKGIAFADHGAHWQLHRKLALNAFALFKDG 460
461 NLKLEKIINQEANVLCDFLATQHGEAIDLSEPLSLAVTNIISFICFNFSFKNEDP 625
626 ALKAIQNVNDGILEVLSKEVLLDIFPVLKIFPSKAMEKMKGCVQTRNELLNEILEKCQEN 805
806 FSSDSITNLLHILIQAKVNADNNNAGPDQDSKLLSNRHMLATIGDIFGAGVETTTSVIK 982
983 WIVAYLLHHPSLKKRIQDDIDQIIGFNRTPTISDRNRLVLLEATIREVLRIRPVAPTLIP 1162
1163 HKAVIDSSIGDLTIDKGTDVVVNLWALHHSEKEWQHPDLFMPERFLDPTGTQLISPSL 1336
1337 SYLPFGAGPRSCVGEMLARQELFLFMSRLLQRFNLEIP 1450
DDGKLPSLEGHASLVLQIKPFKVKIEVRQAWKEAQAEGSTP*
Family CYP19 probably just one gene and a pseudogene
>CYP19A1 TC194849 88% to 19A1 Length = 2057
MLLEVLNPRHYNVTSMVSEVVPIASIAILLLTGFLLLVWNYEDTSSIPGPSYFLGIGPLI
SHCRFLWMGIGSACNYYNKMYGEFMRVWVCGEETLIISKSSSMFHVMKHSHYISRFGSKL
GLQFIGMHEKGIIFNNNPALWKAVRPFFTKALSGPGLVRMVTICADSITKHLDRLEEVCN
DLGYVDVLTLMRRIMLDTSNMLFLGIPLDESAIVVKIQGYFDAWQALLLKPDIFFKISWLCRKYEKSV
927 KDLKDAMEILIEEKRHRISTAEKLEDSIDFATELIFAEKRGELTRENVNQC 1079
1080 ILEMLIAAPDTMSVSVFFMLFLIAKHPQVEEAIIREIQTVVGERDIRIDDMQKLKVVEN 1256
1257 FINESMRYQPVVDLVMRKALEDDVIDGYPVKKGTNIILNLGRMHRLEFFPKPNEFTLE 1430
1431 NFAKNVPYRYFQPFGFGPRACAGKYITMVMMKVVLVTLLRR 1553
FHVQTLQGRCVEKMQKKNDLSLHPDETRDRLEMIFTPRNSDKCLER*
>CYP19A1P TC215440 = Z32813 79% to CYP19 Length = 983 possible pseudogene
127 MVLEVLNPTHYDVTGMVAKGVTVASIAILLLTGFLPLVKNSKNTTSIPGPSYFLGIGPLI 306
307 SYGRFLWMGIGSA*NYYNKMYGEFMRVWTHG*ETLIIT 420 (gap)
421 LSGPGLVRMVTICADSITKHLDRLEEVCNDLGYVDVLTLMRHIMLDTSNVLFLGIPLDE 597 (gap)
596 KRGELTGQNINQCILEMTIAAPDAMSVSVFFVLFLITTHPQVEEAVMKEIQTVVGERKIK 775
776 SDDIQKLTVVENFINESM*YQPVMDLVMSKALEDEVIDGYPVKKGTNIILNLGRMHRLEF 955
956 FPKPNKFT 979
Family CYP20 just one gene
>CYP20A1 TC207993 86% to CYP20 Length = 1564
MLDFAIFAVTFLLALVGAVLYLYPASRQAAGIP
GITPTEEKDGNLPDIVNSGSLHEFLVNLHERYGPVVSFWFGRRLVVSLGTVDVLKQHINP
NKTLDPFETMLKSLLRYQSDSGNVSENHMRKKLYENGVTNCLRSNFALLIKLSEELLDKW
LSYPESQHVPLCQHMLGFAMKSVTQMVMGSTFEDEQEVIRFQKNHGTVWSEIGKGFLDGS
LDKSTTRKKQ
752 YEDALMQLESILKKIIKERKGRNFSQHIFIDSLVQGNLNDQQILE 886
887 DTMIFSLASCMITAKLCTWAVCFLTTYEEIQKKLYEEIDQVLGKGPITSEKIEELRY 1057
1058 CRQVLCETVRTAKLTPVSARLQDIEGKIDKFIIPRETLVLYALGVVLQDPGTWSSPYKF 1234
1235 DPERFDDESVMKTFSLLGFSGTRECPELRFAYMVTAVLLSVLLRRL 1372
HLLSVEGQVIETKYELVTSSKEEAWITVSKRY*
Family CYP21 probably just one gene (may be a pseudogene with it like humans)
>CYP21A1 TC209496 79% to CYP21A2 Length = 2158
MVLAGLLLLLTLLSGAHLLWGRWKLRNLHLP
234 PLVPGFLHLLQPNLPIHLLSLTQKLGPVYRLRLGLQEVVVLNSKRTIEEAMIRKWVDFAG 413
414 RPQIPSYKLVSQRCQDISLGDYSLLWKAHKKLTRSALLLGTRSSMEPWVD 563
564 QLTQEFCERMRVQAGAPVTIQKEFSLLTCSIICYLTFGNKEDTLVHAFHDCVQDLMKT 737
738 WDHWSIQILDMVPFLRFFPNPGLWRLKQAIENRDHMVEKQLTRHKESMVAGQWRD 902
903 MTDYMLQGVGRQRVEEGPGQLLEGHVHMSVVDLFIGGTETTASTLSWAVAFLLHHPEIQ 1079
1080 RRLQEELDRELGPGASCSRVTYKDRARLPLLNATIAEVLRLRPVVPLALPHRTTRPSSIF 1259
1260 GYDIPEGMVVIPNLQGAHLDETVWEQPHEFRPDRFLEPGANPSALAFGCGARVC 1421
1422 LGESLARLELFVVLLRLLQAFTLLPPPVGALPSLQPDPYCGVNLKVQPFQVRLQ 1583
PRGVEAGAWESASAQ*
>cattle|CB170517 80% to 21A2 Length = 640 same seq as CYP21A1
605 DRFLEPGANPSALAFGCGARVCLGESLARLELFVVLLRLLQAFTLLPPPVGALP 444
443 SLQPDPYCGVNLKVQPFQ
Family CYP26
Subfamily 26A probably just one gene
>cattle|TC196927 98% to 26A1 Length = 654 C-term
3 LEQLKYIGCVIKETLRLNPPVPGGFRVALKTFELNGYQIPKGWNVIYSICDTHDVADIFT 182
183 NKEEFNPDRFLLPHPEDASRFSFIPFGGGLRSCVGKEFAKILLKIFTVELARHCDWQLLN 362
363 GPPTMKTSPTVYPVDDLPARFTRFQGEI* 449
Subfamily 26B probably just one gene
>cattle|TC200808 95% to 26B1 Length = 762 cannot fill in missing seq
(missing 221 aa N-term)
FSLPVDLPFSGYRRGIQARQTLQKG
77 LEKAIREKLQCTQGKDYSDALDIFIESSKEHGKEMTMQELKDGTLELIFAAYAT 238
239 TASASTSLIMQLLKHPAVLEKLREELRAKGLLHSGGCPCEGTLRLDTLSGLHYLDCVIKE 418
419 VMRLFTPVSGGYRTVLQTFELDGFQIPKGWSVMYSIRDTHDTAPVFKDVNVFDPDRFG 592
593 QARSEDKDGRFHYLPFGGGVRTCLGKHLAKLFLKVL 700
AVELASTSRFELATRTFPRK
(missing 38 aa C-term)
Subfamily 26C probably just one gene
>cattle|BE749195 CC773722 95% to 26C1 Length = 465 N-term
67 MLPWGLSCLSALGAVG
115 TALLGAGLLLSLAQHLWTLRWTLSRDRASALPLPKGSMGWPFFG 246
247 ETLHWLVQGSRFHSSRRERYGTVFKTHLLGRPVIRVSGAENVRTVLL
GEHRLVRSQWPQSAHILLGSHTLLGA 465
Family CYP27
Subfamily 27A probably just one gene
>cattle|TC206580 82% to 27A1 Length = 1614 missing about 58aa at N-term BF605237
ELSGPGQLRLLFQLLVQGYVLHLHQLQVLNKAKYGPIWINRVGPQMHVHLASAPLLEQVM
RQEGKYPVRDDMKLWKEHRDQQGLSYGPFTTMGEQWYRLRQTLNQRMLKPAEAALYTDAL
NEVINDFMDQLKQLRAESASGDHVPDIAHQFYFFALEAISYILFEKRIGCLERSIPKDTE
TFVRSVGLMFHNSLFVTFLPTWTRPLLPFWKRYLDGWNTIFSFGKKLIDQKLEEIEAQLK
TENPEKTQISGYLHFLLTSG
782 QLSPREAEGSLPELLLAGVDTTSNTLTWALYHLSKNPEIQAALHKEVVGVVPAGQVPQHK 961
962 DLARMPLLKAVLKETLRLYPVVPVNSRVVVDKEIEVGGFLFPKNTQFVLCHYVISRDPDI 1141
1142 YPEPDSFQPQRWLRKNQPDALKTQHPFGSVPFGYGVRACLGRRIAELEMQLLLTRLIQHYE 1324
VVLAPETGEVTSVARIVLVPNKKVGLRFLQRQS*
Subfamily 27C probably just one gene
>cattle|BE723057 86% to 27C1 Length = 519
297 LQQKHTREYGKIFKPHFGPQFVVSVADRDLVAQVLRAEGASPQRANMGSWQEYRDLRXRS 476
477 TGLISA 494
Family CYP39 just one gene
>cattle|TC197782 CB448362 79% to 39A1 Length = 911
192 MEFISPTVIIILSCVAVLLFLQWKN
267 LRRPPCIRGWIPWIGAGFEFGKTPLEFIEKARIKYGPVFTVIVMGTRMTFVTEEEGINVF 446
447 LKSKEINFELAVQNPVYHTASIAKNIFLKLHEKLYITVKGKMGIFNLYKFTG 602
603 QLTEELQEQLQNLGTHGTTDLNKFMRHLLYPVTVNILFKKGLFPTDERKIREFHQHFQAY 782
783 DEGFEYGSQLPECLLRNWSKSKKWLLAL 866
FEKNIPDIKTHKSAK 910
(gap of 36 aa)
582 NTVPVAFWTFAFVLSHPNIHRTIMEGISSVFGTAGKDKIKVSEDDLKKLPLIKWCILET 406
405 IRLRAPGVIARKVLKPVKILDYTVPSGDLLMLSPFWLHRNPKYFPEPDLFKP 232
(missing PERW to end, about 82 aa)
Family CYP46 just one gene
>cattle|AV663682 94% to CYP46 Length = 506 AV663681 CC523468 Bac end C-term
(missing 111 aa N-term)
19 IQTVFGERLFGQGLVSECXYERWHKQRRIMDLAFSRSSLVGLMGTFNEKAEQLVEILEAQ 198
199 ADGQTPVSMQDMLTCATMDILAKAAFGMETSMLLGAQKPLSRKVKLILEGISASRNTLAK 378
379 FMPGKWKQLXETRESVRFLRQVGKEWVXRXRXALQRGEDVPADILTQILKAEEGAQD
367 DEILLDNFVTFFIAGHETSANHLAFTVMELSRQPEILARLQAEVDEVIGSKRHLDCEDLG 188
187 RLQYLSQVLKESLR 146
LYPPAWGTFRLLEEETLIDGVRVPGNTPLL
FSTYVMGRMDTYFEDPLT (gap of 12 aa)
PKFTYFPFSLGPRSCIGQQFAQ
MEVKVVMAKLLQRLEFRLVPGQRFGLQEQATLKPLDPVLCTLQPRGWQPAPPPPPC
Family CYP51 just one gene
>cattle|TC221126 89% to CYP51 Length = 624 aa 4-169
PRVRLARPPPNTHLPASVPRWIRRSRGDLGLQRLWQIDAGV
124 AAGIVMLDLLQAGGSVLGQAMEQVTGGNLASMLLIACAFTLSLVYLFRLAVGHLAPPLPT 303
304 GAKSPPYIVSPIPFLGHAIAFGKSPIEFLEDAYEKYGPVFSFTMVGKTFTYLLGSEAAAL 483
484 LFNSKNEDLNAEEVYSRLTTPVFGKGVAYDVPNTVFLEQKKMLKSGL 624
>cattle|AW261175 97% to CYP51 Length = 298 aa 182-277
2 IRHEKETKEYFKSWGESGEKNLFEALSELIILTASHCLHGKEIRSQLNEKVAQLYADLDGGFS 190
191 HAAWLLPGWLPLPSFRRR 244
244 DRAHREIKNIFYKAIQKR 297
>cattle|TC211264 95% to CYP51 Length = 1401 C-term
1399 LDFNPDRYLEDSPASGEKFAYVPFGA (1?)
GRHRCIGENFAYVQIKTIWSTMLRLYEFDLIDGY 1220
1219 FPTVNYTTMIHTPEKPIIRYKRRSK* 1142
>AC108175.2
22188 MLDLLQAGGSVLGQAMEQVTGGNLASMLLIACAFTLSLVYLFRLAVGHLAPPLPTGA (0) 22018
20194 KSPPYIVSPIPFLGHAIAFGKSPIEFLEDAYEK (0) 20096
17795 YGPVFSFTMVGKTFTYLLGSEAAALLFNSKNEDLNAEEVYSRLTTPVFGKGVAYDVPNT 17619
16395 VFLEQKKMLKSGLNIAHFRQHVSIIEKETKEYFKSWGESGEK 16270
14644 NLFEALSELIILTASHCLHGKEIRSQLNEKVAQLYADLDGGFSHAAWLLPGWLPLPSFR 14468
13586 RRDRAHREIKNIFYKAIQKRRESGEKIDDILQTLLESTYK 13467
13076 DGRPLTDDEVAGMLIGLLLAGQHTSSTTSAWMGFFLARDKTLQEKCFLEQKTVCGENLPPLTYDQ (0) 12882
11489 LKDLNLLDRCIKETLRLRPPIMTMMRLAKTP 11397
10597 QTVAGYTIPPGHQVCVSPTVNQRLKDSWVERLDFNPDRYLEDSPASGEKFAYVPFGA (1?) 10427
7020 GRHRCIGENFAYVQIKTIWSTMLRLYEFDLIDGYFPTVNYTTMIHTPEKPIIRYKRRSK* 6841