This is a survey of the opossum genome P450s

 

David Nelson June 10, 2005

 

There are 5 CYP4ABX family members shown here plus a CYP4A pseudogene

A CYP51 gene and a CYP51 processed pseudogene are also assembled.

 

66 genes have been found now.  This is nearly complete but I am still in

progress on the rest. I am still missing CYP46 cannot find it

Since CYP46 is in fish and other mammals it is probably in opossum too.

many genes are missing N-terminals or have small defects.

 

>73% to 1A1 65% to 1A2 Built_from_P56591_and_others

489177 - 493862 bp (489.2 Kb) on chromosome fragment scaffold_14927

This transcript is located in sequence: contig_43733

MTSILSLLGFSKSFTVTELLVVSAVFCLVFWIIDSYHQRVPKGFKSPPGPWAWPLIGNVL

TLGKNPHLVLTQMREKYGDVMQIQIGSTPVLVLSGLETIRHALVKQGDDFKGRPDLYSFS

LILDGESLSFGPDSGEVWAARRKLTQNALKAFSISSSPSSSFCYLEEHVIKEAEYLIQKF

QEQKGHFDPVRYIVVSVANVICAICFGQRYDHDDQELLNIVRLSNKFGEVAASGNPVDFI

PILRYLPNSKITAFRDLNEKIVAFTQKLVKEHYRKFEKGCIRDITDSLIEHCQEKKLDEN

ANIMLSEKKVVNVVIDLFGAGFDTVTTAISWGLMYLVAKPEVQKKIHEELDTVIGRERLP

QLSDKTQLPYMEAFILETFRHSSFLPFTIPHSTTRDITLNGFYIPKGRCVFVNQWQINHD

PKIWGDPSVFRPERFLSVDGTINKALSEKVIMFGLGKRKCIGETIARWEVFLFLSILLHR

MEFSVPSGVKVDLTPVYGLTMKHIPCEHFQTKLRS

 

>70% to 1A2, 65% to 1A1 Built_from_Q64391_and_others

451687 - 462429 bp (451.7 Kb) on chromosome fragment scaffold_14927

This transcript is located in sequence: contig_91822

MVSSLLASISISELLLASVIFCLVFWVTRSSHQRVPKGLKSPPGPWAWPLFGNVWTLGKN

PHLTLAQLSEKYGDVMKIHIGSTPVIVLSGLETIRQALVKQGEDFKGRPDLYSSTFVADG

YSLAFNPDSGEVWAVRRKLAQNALNTFSVSSSPSSSSCYLEEHVNKEVKHLIQKFQELME

GVGCFDPYRHIVASVANVISAMCFSQRYEDHKNPEFTTLINASHEFVESATSGNPVDFFP

ILRYIPNPQLQRFKEFNQRFLKFLQNTIREHHKAFDENNIQDITGALYKHSQDKAFGNTS

SSVPEMLIINLINDIFGAGFDTVTTAISWSLMYLVTNPKVQKKIQQELDTVIGRDRWPLL

SDRPQLPFMEAFILEIFRHTSFVPFTIPHSTTRATTLNNFYIPKGTCVFVNQWQTNHDPK

LWEDPSVFRPERFLSADGTVNKALSEKVILFGLGKRRCIGETIARWEVFLFLAILLHQIE

FSVPSGVKVDMTPTYGLTMKHPRCEHFQARPRFSR

 

>75% to 1B1 complete Built_from_Q64429_and_others

690855 - 694571 bp (690.9 Kb) on chromosome fragment scaffold_14803

This transcript is located in sequence: contig_70942

MATSRSSEETWLPGLLSTQQTTLLLLFSVLAVVHLGQWLLRRVRRTPWASCRPPGPFQWP

LIGNAVEVGSIPHLSFTRLARRYGDVFQIRLGNCPVVVLNGERAIRQALLQQGAAFASRP

PFASFQVVSNGRSLAFNKYSELWKVQRRVAHGTVRAFSTGQVRSRQVLEQHVLSETRELV

ELLVQGSAGGAFLNPGPLTVVAVANVMSAVCFGCRYSHSDEEFRELLSHNERFGRTVGAG

SLVDVLPWLQRFPNPVRTAFRDFQKLNQDFYSFVLDKFLKHKSSLQPGAPPRDMMDAFIH

TVGKEEENPGEVKPGLRLDTEYVPATVTDIFGASQDTLSTALQWLLILFIRYPKVQAQVQ

EELDRVVGRDRLPSLNDQPHLPYVMAFLYEAMRFSSFVPVTIPHATTIDTSVMGYHIPKD

TVVFINQWSVNHDPEKWQNPEDFNPARFLDKKGFIDKDLASGVMIFSMGKRRCIGEELSK

IQLFLFIAILAHQCNFLANPDEDPEMNFSYGLTIKPQSFTINVTLRESMELLDSTVQRLQ

EE

 

>72% to 1A8P not a pseudogene Built_from_Q9PTY7_and_others

405900 - 420186 bp (405.9 Kb) on chromosome fragment scaffold_15058

This transcript is located in sequence: contig_41044

MFVIETISKEVTISFLVLMIVFIFIRALGNRNKKHMSPPGPRPFPIIGNLLQLGDHPYLTFMEMKKKYG

DVFLIKLGMVPVVVVNGTEMVKKGLLKDGENFAGRPHMYTFSFFAEGKSLSFSVNYGESW

KLHKKIAMNALRNFSKAEAKSSTCSCVLEEHVTEEASELVKIFSKLSLKQGSFDPKSSIT

CAVANVVCALCFGKRYGHFDKEFLRIIKTNEEFLKASSAANPADFIPCFRYLPLRIIHAP

REFYCQLNHFIEQHVQDHITTFDKNHLRDITDALVSICRDKSATIKTATLSDNEIISTVS

DIFGAGFETVSGFLHWSFLYLIYYPEIQAKIHEEIDGIIGFKPPRFKDRKNLPYTEAFIN

EIFRHTTFVPFTIPHCTTKDTTLNGYFIPQKTCVFFNMYQVNHDETLWENPDSFQPERFL

NEKGEMNKNLVEKVLIFGMGIRKCLGEDVARNEVFIFIVSILQQLKLKKCPEVQLDLTPV

YGLVMKPKPYQLIVEPRFHVNSST*

 

>CYP2ABFGST cluster gene order same as human but subfamilies are single copy

except 2B

 

>65% to Cyp2t4 mouse scaffold_14886 426887-434448, 69kb from 2F1

434448 MWLLFLLCLLLLALVLQGRGLKGSQGRLPPGPMPLPLLGNLLQLGPSGLDKRLME (0) 434284

sequence Gap aa 56-138

433349 SIEERIREEAAALVQELAVTK (1) 433287

433027 EVPFNPLRHIRNAVANVICSVVFGERYTYEDPDFRTLLDLLNDNFQILSSQWGQ (0) 432864

432453 MYNIFPSLLDWIPGPHHRIFSNFKKLQAFISEEIQKHKERRQPEEPRNFIDFFLDQMEK (0) 432277

432179 EKQDPRTHFYLETLVMTTHNLFFGGTETTSTTIHYGLLILLKYPNIA (1) 432042

428778 EKMQEEIDSMVGRARPPCLEDRDRLPYTNAV 428686

428685 IHEIQRFISVVPMGLPHILTQDTHFRGHFLTK 428590

428265 GTNIIPLLISAHQDPTQFKDPENFNPNNFLDDEGAFQNNEAFMPFAL (1) 428125

427069 GKRICLGAGLARMEIFLFLTTILQHFTLCAVKRPEEIDLSPKSTGLGNVSPPYELRLKPR* 426887

 

>71% to 2F1 Built_from_Q91Y29_and_others_9 pep:novel scaffold:BROAD0.5:scaffold_14886:503290:534552:-1 (modified)

MMELGGGLGLALGLAVAAYLLLNWWRHRFSGLPPGPAPWPFLGNILGVDVVDLLKSLKE (0)

FHLGSRPCVVIAGYQALKESLIDRAEEFGGRGEFPAVQMWSHGDGIAFSNGEKWKVLRRF

SIQILRNFGMGKKSLEERILEEEAFLLEELKKTEGAPFDPTFVLSRSVSNIICSVVFGSR

FDYEDERLLKLVHLINENFKIMSSPWGEIYNIFPGLLQWIPGPHRSLFQNYGSMKTLIDR

IIHEHQINLDSTSPSDFIDCFLIKMAEEAKSPDSYFHMETLVKTTFILIFGGTETVGTTL

RHSFLLLMKYPQIQARVQAEIDSVVGRGRRPTLDDRSSMPYTDAVIHEVQRFADIIPMNL

PHRLTRDTLFRGFLLPKGTDIITLLNTVHYDPSQFKTPNEFNPAHFLDSKEEFKKSPAFM

PFSAGRRLCLGEPLARMELFLYLTGILQNFTLQPLCSPEEIDLTPLSSGLGNVPRPFQLR

MVPR

 

>80% to 2A6 complete Built_from_Q91Y29_and_others_10 pep:novel scaffold:BROAD0.5:scaffold_14886:523331:534582:-1

MLVSGLLLGALFFCLSALVVLSVWRQRKVRGSLPPGPTPLPFIGNYLQLNTKQMYDSIMK

LSLKYGPVYT

IGEKFGPVFTVHLGPRPIVVLSGYEAVKEALVDQAEEFSGRGEQATFDWLFKGFGVAFSN

GERARQLRRFSITTLRDFGMGKRGIEERIQEEAGFLVEALRGTKGAPIDPTFFLSRTVSN

VISSIVFGDRFEYEDKQFLMLRMMLGSFQFTATSMGQLYEMFSGVMKHLPGPQQQAFKEL

QGLEDFITEKVRENQATLDPNHPRNFIDSFLIKMQEEKKNPNTEFFMRNLTMTTLNLFFA

GTETVSTTLRYGFLLLMKHPEVEAKIHEEIDRVIGRNRSPKFEDKAKMPYTEAVIHEIQR

FGDMIPMGLARRVTKDTKFRGYLIPKGTEVYPMLGSVLRDQSFFACPQDFNPQHFLDEKG

QFKKSDAFVPFSIGKRYCFGEGLARMELFLFLTTILQNFRFQSPLPKEAIDISPKMVGFA

TIPRSYTISFLPRG

 

>82% to CYP2G2P Built_from_Q8SQ66_and_others_6 pep:novel scaffold:BROAD0.5:scaffold_14886:612386:623485:-1

MEFGGAITLFLALCVPCLVILIAWKRMHKGGKLPPGPTPLPFLGNLLQVRTDATFQSFLE

LSKKYGPVFTVYMGPRRVVVLCGHEAVKEALVDQAEEFSGRGELASIDRNFQGHGVALAN

GDRWRILRRFSLTVLRNFGMGKRSIEERIQEEAGFLMEEFRKTKGTPIDPTFFLSRTVSN

VISSVVFGSRFDYEDKQFLYLLHLINESFIEMSTPWAQVLYDMYSGIMQYLPGRHNKIYN

LIEELKDFIASRVKINEASLDPNNPRDFIDCFLVKMHQEKNNPKTEFNLKNLVLTTLNLF

FAGTETVSSTLRYGFLLLMKYPEVGAKVQKEIDHVIGQNRIPKAEDRMQMPYTDAVIHEI

QRLTDIVPMGVPHTVTQDTFRGYILPKGTDIFPLIGSALRDPKYFNPEAFEPQHFLDEEG

RFKKNEAFVPFASGKRVCLGEAMARMELFLYFTTILQNFSLQPLVPPSEIDITPQISGFG

NIPPTYKLCIVAR

 

 

>61% to 2B6 Built_from_Q8SQ66_and_others_8 pep:novel scaffold:BROAD0.5:scaffold_14886:627043:652084:-1

MNAGDLLLLLAVFLGFLLLLAKRPQKVNLPPGPTPLPLLGNLLQLGRHGLIKSFLEC

RDKYGDVFTVYLGTRRYFVICGPESVKKALVDNAEVFSGRGTLAISEMVFQRY

GFFVTYDGWKTLRRV

SIATLRDFGMGKRSIEEQIKAEAQCLVEELQKSQGALIDPTFIFHSVTANVICSIVFGER

FSYQDTQFREILNMFVEVFTILSSLWMQFFEQFPSVLKLLPGPHYRVLKIAQHIKDFISS

KIEQHQESLDPNSPRDFIDSFLLRIEKEKETPDSKYHVKNLVLTVLSLFFAGTETTSTTL

RYGFLLLLKYPQVAEKVQEEIDQVVGRDRAPEIKDRAKMPYTDAVIHEIQRFSDLLPMGI

PHMVTEDTSFQGYFLPKGTDVFILLSASLKDPCYFEKPHIFDPNHFLDAQGALKKNEAFI

PFSMGKRACLGEGIARTELFLFFTTILQNFSLVSSKPLEDLDIRPQCSGLGTLPQIYKLG

FLPR

 

 

>73% to 2B6 last half of gene, seq. Gap upstream

Built_from_Q8SQ66_and_others_9 pep:novel scaffold:BROAD0.5:scaffold_14886:661530:672893:-1

Missing exon 1 and exons 3,4 in two sequence gaps

REKYGDVFTIFLGSRPVVVLCGPDTVKEALVEKAEEFSGRGPIAMVDSVFQGLG

Sequence gap

LFELFSGFLKFFPGPHRRSQSNLEFINAYIADNVEKHRQSLDPNAPRDFIDIYLLRMEK

EKGISGTEFHHKNLILTVLSLFFAGTETTSTTLRYGCLILLKYPGVAEKVQDEIDQVIGK

DRTPEIKDRAKMPYTEAVIYEIQRFSDLLPLGVPHCVTQDTSFRGYLIPKGTEVYPLLST

VLHDPKYFKKPYDFDPNHFLDAQGSLKKNEAFIPFSSGKRICLGEGIARMELFLFLTTIL

QNFSLRSSMVPADIDLTPRESGVGNVPPTFQIQFLP

 

>62% to 2S1 Built_from_P24454_and_others_1 pep:novel scaffold:BROAD0.5:scaffold_14886:762460:771873:1

two frameshifts and a short seq gap in exon 1

MALATGVASPLLVLV

LVLVLLLALLVLRQGRPRSSRLPPGP

AALPLLGNVGQLWPGGXXXXXXX

(0) LSAKYGPVFTVYLGSRPVVVLSGYRAVKEALVDQAEAFSGRGKIAGLEKTFHEH

GLFFANGEQWRLRKFTTLSLRLGMGKKEGEEHIQEEARCLLEALRSTQGSPLDPSLLLSQ

AVSNITCLLLFGKRFDYEDKKFQAMVLATAGILTEISSPWGQVCEMFMGPVCYLSDILKY

LSAPHGRLTRHLSTLAAFVSDQIQQHQETLDPEAPVRDFIDDFLLKMRQEEKAQGVGPDH

TDFLLTTVNLLFAGTVTVSATLRFAFLLLLKYPEFQDRIHEELNQELGRERAPSLGDRGR

LPYTDAFLHEVQRFLALIPMGVPRTVTKPTIFQGYELPQGIEVFPLLGSVLHDPEFFERP

KEFYPRHFLNADGRFIKNEAFLPFSSGKRICLGEGLARTELFLFFTTILQNFSLESPSPL

GALSLHPAISGFANIPPTFQLRFRPR

 

>58% to 2C9 Built_from_Q29508_and_others_4 pep:novel scaffold:BROAD0.5:scaffold_13575:1131749:1157623:-1

21kb from CYP2E seq

MEPWALTTFFLVVCVSFLVFLSLWRKDYKGRNLPPGPFPLPIIGNLLQLGHNLSMSLCKL

SEKYGPVYTVYFGPQPVVVLHGYKALKEALTDQGDIFGERGHLPIIDDIYRGQGIVFSHG

EKWKQIRRFSLMTLRNFGMGKRSIEERVQEEAQFLLEELRKTNSQPFDPTFILGCAPCNV

ISSILFHQRFNYDDEEFLSMLRILNENVTLLNTPMAQLYNNFPWFLHYFPGPHKNFFSNV

KKLREFILKNAKKYQQTLDPNNLKNYIDCFLHKMQQDQKNPDSVFDLENLATAGMDLFDA

GTETTSTTLRYGLLLILKYPEVQNKIHEEIDQVIGRHRIPSIKDKLEMPYTEAVLHEILR

FVDLVPFSLPHEVTHDTQLQQHFIPKGTTVYPLLSSVLYDAKEFPNPKEFDPRHFLNKDG

SFKKSDYFVPFSIGKRACLGEGLAKMELFIFLATILQNFTLKSVIDSKEIDIKPGSTGLL

NVPPKYQLCLLPR

 

>62% to 2C19 Built_from_Q9UEH3_and_others_1 pep:novel scaffold:BROAD0.5:scaffold_13115:5872:11336:-1

exon 1 in a seq gap, exons 6-9 off the end of the scaffold

LAKKYGSIYTLYFGTQRVVVLHGYNIVKEALIDKGDIFMERGNVPIFEDTVK

VVFSRGERWKQIRRFSLMTLRNFGMGKRTIEERVQEEAQCLVEELRKTK

GQPNDPTFILGCAPCNVICSILFRERFNYKDEKFLYLMGILNENVQLFAKPWIQ

LYNFLPAFRVHLPGKHKQLFKNVEELKCFILERVKEHQEILDPNNPQDYIDCYLSKMQQ

 

90kb seq gap between these two genes may have another 2C gene

 

>62% to 2C19 complete Built_from_Q8QZW4_and_others_1 pep:novel scaffold:BROAD0.5:scaffold_13115:146250:169643:-1

MEPWGLTTTVLLTCVLFLIFLSLWNHGTKKGKLPPGPTPLPIFGNLLQFDFKNMAATMSK

LAKKYGSIYTLYFGMERVVVLHGYNIVKEALIDKGDIFMERGNVPIFEDAIKGQGVIFSR

GERWKQLRRFSLMTLRNFGMGKRSIEERVQEEAQCLVEELRKTKGQPNDPTFILGCAPCN

VICSILFRDRFKYKDEKFLYLMSLLNENFQLFTKPWIQFYNFLPAFRVHLPGKHNQFFKN

IGELKRFILERVKEHQEILDPNNPQDYIDCYLSKMQQEKNNPQSEFDVENLIMTGVDLFS

AGTETTSSTLRYGLLLILKHPEVQAKIHEEINRVIGHNRIPSIKDRQDMPYMDAVVHEVQ

RFIDLVPLNVPHAVNQDIQLQQYTIPKGTNVFPLLSPVLCDSKEFSNPDKFDLQHFLDKN

GSFKKSDYFMPFSAGKRACLGEGLARMELFLFLTTILQNFTLKPVGDPNEISVKNNHVGF

TNVPPYYQLCFLP

 

>oppossum EST DR038220 96% to scaffold_13115:146250:169643

PSIKDRQDMPYMDAVVREVQRFIDLIPLNLPHAVNQDIQLQQYTIPKGTNIFPLLSPVLR

DSKEFSNPDKFDPQHFLDKNGSFKKSDYFMPFSAGKRACLGEGLARMELFLFLTTILQNF

TLKPVGDPNEISVKNNHVGFTNVPPYYQLCFLP

 

>scaffold_13599 has three full 2C genes and two pseudogenes

This scaffold may be close to scaffold_13115 as part of the 2C cluster

 

 

>scaffold_13599 pseudogene parts between 5200000 and 5275000

related to CYP2C sequences

first pseudogene

GPVPFPTIGNTLQLDRRNIPESLCRVNK

VVILHGFKAVKEALIDGRNKFAARGSLPVFKFISGGLGMFLLDTDQKEGKE

 

Second pseudogene

LLLCISCLLILVWKRGFGKGKLPPGPVPLPIVGNLLQLDLKNIPES

LAKEYGPVFTLQLGLDRVVVLHGYKAIKEALIDHGDNFSSRGAMPIFQVINN

Missing exons

EKQQPQSEFTIDNLIWTVSDLFSAGTETMSTTLRYGLLILLKHPEI

VSEKIHEEIEHVIGRNRSPCMEDRNKMPYTNAVVQEIQRYVDLLPTGVPHAVSQDTQFRQYLIPK

GTTIIPLFTSILNDEEFPNSQQFDPGHFLDESGNFMKSDYFMPFST

GKRI*LGEGLARIELFLFFTTIL*NFTLKPLIDPKDIDTNPTANGFGKVPPPYKLCFQP

 

>67% to 2C19 scaffold_13599 between 5275000 and 5305000 not annotated in browser

MDPSVVNAFGLLFCISCLLLILAWKKDFRRGKLPPGPVPFPIIGNILQLDLKNIPESLCK

LAKEYGPVFTLQLGFTRTVVLHGYKAVKEALIDHGDQFAARGHMPVFEFISQGL

GIVSSNGERWKQLRRFSLMTLRNFGMGKKSIEENVQEEAKLLVEAIKQTK

XXPCDPTFILGCAPCNVICSLIFQKHFEYKDPKFLYLMKLLDDDLKLLSSPWIQ

VYNYFSPLIHYLPGLHHKLFKITDLQKKFILEEVKEHQQTLDPNNIRDFIDCFLMKMEQ

EKQKPLSEFTIGNLVNTAIDLFAAGTETTSTTLKYGFLMLLKHPEIT

VTEKIHEEIDRVIGYNRSPCMEDRNKVPYTNAVVHEIQRYIDHIPTSLPHAVTEDVQFRQYLIPK

GTTIIPLLTSVLYDDEEFPNPHQFDPGHFLDASGNFKKSDYFMPFST

GKRICLGEGLARMELFLFFTTVLQNFTLKSLIDPKDIDTTPVDSGFGKIPPSYKLCFLP

 

>Built_from_Q9JJ02_and_others_10 pep:novel scaffold:BROAD0.5:scaffold_13599:5307180:5340000 region :1

removed last three exons from another gene

added these back from correct location

MGPSVVTALGLLFCISCLLLILSWRKGFGKGKLPPGPVPLPIIGNMLQLNLKNIPESLCM

LAKEYGPVFTLQLGVQRIVVLHGYKAVKEALIEHGEQFAARGPMPIFELVSNGFGIGVSN

GERWKQLRRFSLMTLRNFGMGKRSIEERVQGEAKFLVEELKKTKGLPCDPTFILGCAPCN

VICSLIFQKHFEYNDQKFLYLMKLLHEQVRIGSSAWIQFYNCFPSLVQHLPGPHRKLLKL

FHFLHTFILEEIKEHQGTLDPSNPRDLIDCFLMKMEQEKQQPLSEFNIDNLVNTVADLFG

AGTETTSTTLRYGLLMLLKHPEIT

EKIHEEIDRVIGHNRSPCMEDRNKMPYTNAVVHEIQRYIDLIPTSLPHLVTEDTQFRQYIIPK

GTTIIPFLSSVLYDEKEFPNPNQFDPGHFLDENGNFKKSDYFMPFST

GKRICLGEGLARMELFLFFTTILQNFTLKSLIDPKDIDTTPIDSGFGKIPPSYKLCFLP

 

>70% to 2C18/2C19 Built_from_Q9JJ02_and_others_8 pep:novel

scaffold:BROAD0.5:scaffold_13599:5275928:5400120:1

in 5370000-5400000 region

removed first exon since it is from another gene and replaced it

MVTAFGLVCCTLCLLLISALRKRCGKGKLPPGPGPLPIIGNILQLDTKNIPKSLCM

LAKVYGPVFTLYLGSKSVVVLHGYKAMKEALIDHGEEFAGRGSFPIIDAINKGLGLAFSN

GERWKQIRRFSLMTLKNLGMGKRSIEERVQEEAKCLVEALKKTNGMPCDPTFILGCAPCN

VICSIIFQKRFEYHDQKFLHLMKLLDEKVKILSSPWIQIYNLLPLLAQYLPGSHHKLFKI

SQMMHNFFLEKVKEHQDALDPNNPQDLIDSFLIKMEQEKEKPQSEFTMENLVCTVSDIFG

AGTQTTSTTLRYGLLLLLKHPEITGKIHEEIDRVIGHNRSPCLKDRNSMPYTDAVIHEMQ

RYIDLVPANLTHSVIQDVKFRQYIIPKGTTIIPLLTSVLYDNEEFPNPDQFDPGHFLDES

GNFKKSDYFMPFSAGKRICIGEGLARMELFLFFTTILQNFTLKSLIDPKDIDTTPIASGF

GNIPPSFKLCFLPS

 

>Built_from_Q9JJ02_and_others_7 pep:novel scaffold:BROAD0.5:scaffold_13599:5275928:5470669:1

removed first 4 exons and last exon from another gene

and replaced them

this gene in the region 5400000-5440000 scaffold_13599

LILVLCISCLFLISSRKKSHGKGQLPPGPFPLPIVGNLLQLDTKHIDKSLGS

LTKVYGPVYTLHFGSERVVVLHGYEAVKEALIDHGEEFAARGSLPIIDAVSKGF

GLVFSKGERWKELRRFSLMTLRNLGMGKRSIEERVQEEAKYLVEEFKKS

XXPCDPKFILECVPCNVICSVIFQKRFEYSDRKLQTLMELLDENIKILTSPWIQ

VYNFIPSLVHYLPGPHRTFLNN

CKIMHNFIEEKVKEHQETLDSNNPQDFIDYFLIQMGQKKQNQQSEFTMENLILTVSDLFI

AGSETTSTTLRYGLLLLLKYPEITDKIHEEIDRVIGRDRSPCMKDRNSMPYTDAVIHEIQ

RHLDLIPFNLPHAVKQDTRFREYVIPKDTTIFTSLSSVLYDEKEFPNADQFDPGHFLDES

GNFKKSDYFMPFSI

XKRACVGEGLARMELFLFFTNILQNFTLKPLIDPKDIDTTPISNGFGCVPPSYKLHFLPV

 

>68% to 2C19 this gene in the region 5440000-5470000

scaffold_13599

MDPSVVTALGLIFCVSCLLLISAWRKGFGKGKLPPGPTPLPIIGNLLQLDTKNINKSFCE

LAKTYGSVFTLYLGSERAVVLHGQKAVKEALIGNGDAFAGRGSFPISETINKGL

GLLFSNGERWKQIRRFSLMTLRNFGMGKRSIEERVQEEAKRLVEALKNT

GLPCDPTFIFGCAPCNVICSVVFQKHFEYQDKKFLTLMEYLNENLQILSSPWIQ

VYNLFPSLIHHLPGIHHKVIKNFRALNDFVLERVKEHQETVDPNDPRDFIDCFLMKMEQ

EKQNPKSEFIIENLVSTTIDLFGAGTETTSTTLRYGFLLLLKHPQIV

DKIREEMDQVIGQNRSPCMKDRSSMPYTDAVIHEIQRYIDLVPTSLPHAVTQDVKFRQYLIPK

GTTIIPLLTSVLYDNEEFPNPEQFDPGHFLDESGNFKKSDYFVPFSI

GKRACVGESLAQMELFLFFTTILQNFTLKPLVDPKDIDVTPISNGFNHVPPCYELCFLPS

 

>CYP2C 57% to 2C18 Built_from_Q6PER7_and_others missing 216 aa

257423 - 310084 bp (257.4 Kb) on chromosome fragment scaffold_13485

This transcript is located in sequence: contig_23460

LYNMYPSLIKHLPGSHRTINKNVLEVRNFIMDEVKKHQETLDPNNPRDYIDGFLIKIQQE

 KLNPQSAFNYQELMATGSNLFSAGTETTSSTLRYGLLLLMKHPKIQDKVHEEIDRVLGSS

 RKPSMQDRVKMPYVDAVVHEIQRYIHLLPFSLPRLAAQDIHFQKYVIPKGTSVFPLLYSV

 LYDRKAFPNPYEFDPENFLDKSGNFQKNDHFVPFSLGKRLCLGESLARMEVFLFLTTILQ

 NFTLKPMVEPKELVTTPLRNGIVNIPSIYKLSLIP

 

>CYP2D

Database location  : contig_35735   16579 to 16764 (-)

Genomic location   : scaffold_11452 49597 to 49782 (-)

scaffold_11452:47242:51381:-1

GENSCAN00000009132 pep:Genscan scaffold:BROAD0.5:scaffold_11452:52083:52720:-1

this fragment 63% to 2D6

missing exon 1 in a sequence gap

SRRSSGKVFSVQLLWKPAVVLSRPDAVREALVHRSEDTAGRPPSLVYSHLGFGPKCPRQ

VVLAQYGEAWKEQRRFSLTTLRNLGLGKQSLERWVTAEADFLCSAFAAR

Sequence gap missing exons 4-5

This frag 80% to 2D6

AKGNPESSFNDENLRMVVVDLFTAGMVTTATTLTWALLLMILYPDVQ

KVQEEVDVVIGRSRRPTMKDQAHMPFTNAVIHEVQRFGDIVPLGIPHMTTRDTEIQGFFIPK

GTVLITNLSSVLKDEATWEQPYRFYPEHFLDAEGRFVKPEAFMPFSA

GRRACLGEPLARMELFLFFTCLLQRFSFSVPVGQPRPSTHGFFAFPVAPLPYQLCAVVR

 

>67% to 2E1 Built_from_Q29508_and_others_6 pep:novel scaffold:BROAD0.5:scaffold_13575:1164439:1178780:-1

21kb from CYP2C seq

MAFLGATLALLVWFVFLLFVSVWKQIQSSWKLPPGPFPLPIIGNLFTLDIRNFPKSFAKL

AKEYGPVFTVYVGSRRIVVLHGYKVVKEALLDLKNEFAGRADIPAFEAHQNSGIIFNDSE

TWRDTRRFGLTVLRDYGMGKQSNEERIQRESHFLVEALRNTNSQPFDPTFVLGGGPLNVI

HDILFHSRLDYSDKMCQRLLHLYNEDFYLLSSPWIQVYNNIGSYLRYLPGSHRKLLKNIS

EAKQYVFEKVKEHQEVVDLNHPYDLTDTLLIQMEKDQKKKKLGFNLENVTVTVADLLFAG

TETTSTTLRYGLLLLLKHPKVEEKLHEEIDRVIGPHRLPSMKDKINMPYMEAVVHEIQRV

INLVPSNLPHVMHEDIHFRGYLLPKGTTVYPTLDSVMLDSEEFPNPEQFDPGHFLNENGK

FRYSDYFKAFSAGKRVCVGEGLARMEIFLFLTIILQHFNLKPLISPEKIDTSPLTVGFGT

IPPIYKLCVLPRS

 

>61% to 2J2 Built_from_P12791_and_others_8 pep:novel

scaffold:BROAD0.5:scaffold_16806:796848:899843:1

be aware this seq is a hybrid of two genes, yellow region identical

LLLSAILLILASYLQRKRRQANYPPGPSPLPLLGNFFQIDFKKPQVAFQEVRSERGGPER

TETFNPEHFLENGQFTKREAFLPFSAVKLKILNIDRNYFHNLGISLSNGQIWKDQRRFTL

MTLRNFGLGKKTLEFRIQEEATYLTEAIKEEKGQPFDPHFQINNAVSNIICSVTFGYRFE

YHDSQFRELLKILDEVMVLHGRWECQLFEMFPWIMRFLPGPHHKLFREWKKLQSFVTQII

KQHKEDQNSEEAQDYIDAYLKELSKGDVSSSFSEGNLASCTLDIFFAGTETTSTTLRWAL

LYMALYPEIQGKIQAEIDRVIGQSRQPTMADKENMPYTNAAIHEVQRMGNIIPINVPRVA

TVDTTVAGYHVPKGTVLMTNLTALHRDPKEWATPETFNPEHFLENGQFKKRESFLPFSAG

KRVCLGEQLARAELFIFFTCLLQRFTFQAPPDTQLSLDFRTGVTISPVPYKICALPRET

 

>64% to 2J2 Built_from_P12791_and_others_10 pep:novel scaffold:BROAD0.5:scaffold_16806:860579:998071:1

be aware this seq is a hybrid of two genes, yellow region identical

MLLSLWEGASLLTLLLVSTVLLILAFYVQRGRQHPNFPPGPPLLPIFGNIFQMDPKKPQD

TFQEFAKKYGNIFCLKLFGAPMICVTGLPLIKEVLLKQGQVFIDRPQTPWSSYAFKHHGI

SLSNGQIWKDQRRFTLMTLRNFGLGKKTLEFRIQEEATYLTEAIKEEKGQPFDPHFQINN

AVSNIICSVTFGYRFEYHDSQFRELLKILDEVMVLHGRWECQLFEMFPWIMRFLPGPHHK

LFREWKKLQSFVTQIIKQHKEDQNSEEAQDYIDAYLKELSKGNMSSSFNEDNLVACTLDL

FFAGTETTSTTLRWALLYMALYPEIQGKIQAEIDRVIGQSRQPTMADRENMPYTHAAIHE

VQRMGDIVPLNVPRIATVDTTLAGYHVPKGTMLLTNLTALHRDPKEWATPDTFNPEHFLE

DGQFKKREAFLPFSAGKRVCLGEQLAQPELFIFFTCLLQRFTFQAPPDTQLSLDFQFGLT

ISPVPYQICALSRET

 

>63% to 2J2 Built_from_Q9YGF1_and_others_9 pep:novel, complete

scaffold:BROAD0.5:scaffold_1903:224902:254645:-1

MQFPSLWEGASLKSLLLFSAVFLMLASYLQRRRRHPNYPPGPFQLPFLGNFFHMDHKNPH

MAFYQLAKKYGNIFSLELVGCPVIVVTGLSLIKEVLVNQGQVFVDRPHTPLRSYVFKKLG

LIMSNGQEWKDQRRFTLMTLRNFGLGKRTLELRIQEEAMYLTEAIREEKGQPFDPHFQIN

NAVSNIICSVTFGNRFEYHDSQFRELLKILDEAMMLQTKWECQFFEMFPRIMKFLPGAHK

KLFREWKKLESFVLDIIRQHKENPNSEEAQDFIEAYLTELSKGNISPSFSEDNLVSCTLD

LFLAGTETTSTTLRWALLYMALCPEIQGKIQAEIDRVVGQSRLPTMADRENMPYTNAAVH

EIQRMGNIVPFNVPRMSTVDTTVAGYHVPKGTLLVTNLTALHRDPKEWATPDAFNPEHFL

EDGQFKKRESFLPFSA

GKRVCLGEQLARTELFIFFTCLLQRFTFQAPPDTQLTLDFRIGLTISPAPYKICAIPR

 

>65% to 2J2 Built_from_Q9YGF1_and_others_8 pep:novel missing last exon scaffold:BROAD0.5:scaffold_1903:224902:329336:-1

note: this assembly originally used the last exon of the above gene.

Look between 254645 and 279000 on – strand for C-term

TLLLFSAILLILASYVQRRRRRQQLKYPPGPLRLPIFGNVFQIDTTKAHVSVQEFVRKFG

NIFSMELFGVPMMTVTGLPLIKEVLINQGQVFVDRPQFPLQTYTFKSVGLLMSNGQEWKD

QRRFTLMTLRNFGLGKKTLELRIQEEATYLIEAIREEKGQPFDPHFQINNAVSNIICSVT

FGNRFEYHDSQFRELLKSLDQVSVLQASWECQFFNIIPWIMKFLPGAHKKLFREFKKLES

FVVHVIKQHKEDQNSEEARDFIDAYLKELSKGNNSSSFNEDNLVSCTLDLFFAGTETTST

TLRWALLYMALYPEIQGKIQSEIDRVIGQSRQPTMADRENMPYTNAAIHEVQRMGNIVPF

NVPRVAVVDTIVAGYYVPKGTLLVTNLNALHRDPKEWATPDTFNPEHFLENGQFKKKESF

LPFSM

GKRVCLGEQLARAELFIFFTCLLQRFTFQAPPDTQLSLDFRAGLTISPAPYKICALPRETQPKVGL*

 

>64% to 2J2 Built_from_Q9YGF1_and_others_5 pep:novel scaffold:BROAD0.5:scaffold_1903:224902:380321:-1

modified: only first two exons are new

look between 330000 and 378000 on – strand for last 7 exons

VLPSLQTLLLFSAILLILASYVQRRRRRQQSKYPPGPLRLPIFGNVFEIDIKKPHISIQE

CVRKYGNIFSMDLLSAPMIVVTGLPLIKEVLVNQGQVFVDRPRSPLQSYIFKIN

GLFMSNGQEWKDQRRFTLMTLRNFGLGKKTLELRIQEEAIYLTEAIREEKGE

PLNPHFQINNAVSNIICSVTFGNRFEYHDSQFRELLKSLNQVSVLQASWECQ (0)

FFDIFPRIMKLLPGAHKKHFR

NSRKLESFVVHVIKKHKEDQNSEEAQDFIDAYLKELSKXX

GNITSSFNENNLVACTLDLFFAGTETTSTTLRWALLYMAFYPEIQ

XKIQAEIDRVIGHSRWPTMADRENMPYTYAAIHEVQRMGNIVPLNAPRVATVDTTLAGYHVPK

GTMLLTNLTALHRDPKEWATPDTFNPEHFLEDGQFKKKEAFLPFSA (1)

GKRVCLGEQLARAELFIFFTCLLQRFTFQAPPDTQLSLDFQAGMTLSPVFYQICALPRETEPKLIL*

 

>83% to 2R1 Built_from_Q9QXF7_and_others_1 pep:novel

scaffold:BROAD0.5:scaffold_12644:255201:265755:1

LRSRRQARFPPGPAGLPFIGNIFSLAASAELPHVYMEQQSRVHGQIFSLDLGGLFTVVLN

GYDMVKECLIHQSEIFADRPALPLFKKLTKMGGLLNSTYGRGWLDHRRLAVNSFRCFGSG

QKSFESKIAEEAKCFIDAVDTYQGKPFDLKQLITNAVSNITNVIIFGERFSYEDTEFQHM

IEIFSENVELAASASVFLYNSFPWIGIFPFGKHQQLFKNAAVVYDFLSKIIEKVSANRKP

QSPQHFVDAYLDEMDKGRNDPGSTYSKENLIFSVGELIIAGTETTTNVLRWAILFMGLYP

NIQAQVHKEIDLIVGPNRTPSLEDKPQMPYTEAVLHEVLRFCNVVPLGIFHATSQDTVVR

GYSIPRGTTVITNLYSVHFDKKYWKDPEVFYPERFLDSQGQFVKKEALVPFSLGRRHCLG

EQLARMEMFLFFTSLLQRFHLHFPPDLVPNLKPKLGMTLQPLQYLICAEKR

 

>74% to 2U1 missing N-term Built_from_Q7Z449_and_others

1937657 - 1968473 bp (1.9 Mb) on chromosome fragment scaffold_14967

This transcript is located in sequence: contig_70233

RRRGVPPGPRPWPLVGNLGFMLLPAVFK

19 aa Gap here

SSQVVLEYLSRLYGPIFSFYLGPYLVVVLSDF

 HSVREALVQQGEVFSDRPRLPLISVFTKEKGIVFAHYGPIWRQQRKFSHSTLRHFGLGKL

 SLEPKIIEEFKYVKEEIQKHGGNPFSPFPIISNAVSNIICSLCFGQRFEYNNSDFKKMLH

 LMSRGLEISVSYQMMLINICSWFYYLPFGLFKEIRQIEKDLTVFLKGIIREHRETLDVEN

 PQDFIDMYLLHMEEEMKSNSNTSFDEDYLFYIIGDLFIAGTDTTTNTLLWCLLYMSLNPE

 VQEKVQKEIEKVIGPDRAPSLTDKVHMPYTEATIMEVQRMSAVVPFGIPRMTSEKTTLQG

 YTIPKGTMIIANLWAIHRDPAIWENPKNFSPERFLDEEGQLIKREHFIPFGIGKRVCMGE

 QLAKMELFLMFVSLMQNFIFTFPKDAKKPIMPGKFGLTLSPHPFNVIVSKR

 

>55% to CYP2W1 Built_from_Q6VVW9_and_others_1

scaffold:BROAD0.5:scaffold_5766:862320:898262:-1

ALLGLLLSWALFCLLTRSKERAGKWPPGPAPLPFIGNLHLLDLRRQDKSLMKISEKYGPV

FTIHFGLQKMVVLTGYQAVKEALVDSAEEFADRPPIPIFQRIQEGQGIFFSSGNLWKTTR

KFTMSSMHKLGMGKKLIAKKILEEFSFLEELIDSFKGEPFKLKLFNMAPTNVIFFLLFGE

RFDYQDPTFVTFIRLIDEVMVLLGSPFLHLFNFYPFLGWFLKPHKTVLVKIEEVRVILRK

YMVASRQNISRGYVTSYIDALIQKQNLLPHGEFSASIFHIFPWMKTGAEKPFCNGREWTQ

TKPFVSFLLAKVQDELDRVLEKSRLPEYEDQKALPYTNAVVHEIQRFIALLPHVPHSTSV

DTHFRGYFIPKGTPVIPLLTSVLLDKTQWETPNKFNPSHFLDADGNFVKKAAFLPFSIGH

RVCIGENLAKMEMFLFFASLLQRFTFHPPPGIQEADLDITPQLTFTMRPQPQAVCAVSR

 

>62% to CYP2AB1P Built_from_Q6P0T4_and_others_1 pep:novel scaffold:BROAD0.5:scaffold_15223:222913:242918:-1

MFSLATGLAILATSFLLLRMLAFFLARTQFPPGPCPLPILGNLLQLRFQLHPEKLSQLTR

KYGSIFTVWLGSTPVVVLNGFQAVKDALVTHSEDFADRPVTPLFEDLFGDKGIISTSGHA

WQQQRRFGLITLRALGMGKKVLEQRLQEEAQYLVEIFHRQNGTSFDPHVPIVRAAANVIC

ALVFGHRFPHGDPFFQELMKAIDFGLAFVNTIWRRLYDAFPWLLRQLPGPHRKIFRYQEI

VKSLICQEIERHKQRVPEDLEDFISCYLAQITKRKDDPASTFDEENLIQVIIDLFLGGTE

TTATTLRWALLYMIHHRDVQGKVQQELDTVLGPSRVISFKDRKLLPYTNAVLHEVQRFCS

VISVGAVRKCGTATTVQGFPIQKGTIVLPNLASVLCDPEHWETPWQFNPGHFLDGEGNFV

IHEAFLPFSAGHRVCLGELLAKVELFLVFAHLLREFRLRAPAGASTNERDYILWGTKQPR

PYDICASPRL

 

>64% to 2AC1P Built_from_Q6PA33_and_others_1 pep:novel

scaffold:BROAD0.5:scaffold_12611:1500312:1533637:-1

LDLLSILSGLSLILILILNMKLTLTKNFKKQSPPGPKPLPVIGNLHILNLKRPYQTMLEL

SKKYGPIFSLRMGPKTVVVLSGYETVKDALVNYSEQFGERARIPIFERIFEGKGIVFSHG

ENWKITRRFSLTTLRNFGMGKRVIEERILEECHHLIQVFESHQGKPFEISTIMSASVANI

IVSILFGKRFDYKDPQFLRLLHLIGENIRLAGGPSITIFNMFPVLGFLLQDLKRVLRNRD

ELFSFIRTTFLKHLRKLDKNDQRSFIDAFLIKQQEEKDKSDDYFNNDNLVALVSNLFAAG

TETTSSTLRWGILLMMKYPEIQKKVHNEITEVIGSAQPRIEHRTQMPYTDAVIHEIQRFS

NILPMNLSRETTTDVIFKNYYIPKGTEVITLLTSVLQDQTQWEKPCTFHPQHFTKEGKFI

KRDAFLLLFSSLFLTCVLAGQRMCAGESLAKMELFLFFTSLLQKFTFCPSPGVSNSDLDL

TPDIGFTTRPQPYKICALP

 

>55% to 3A4 Built_from_Q6LEQ2_and_others cyan insertion

2878085 - 2933432 bp (2.9 Mb) on chromosome fragment scaffold_12616

This transcript is located in sequence: contig_103558

GLSTETWTLLVAFVTLLILYGIWPYGIFKKLGIPGPRPLPFFGTFLEYRKGILEFDKQCF

 QKYGKMWGFYDGRLPILAILDPDIIKIVLVKEFYTLFTNRRNFGLNGILDSGITVAEGEK

 WKRIRSIISPTFTTGKLKEMFPIIKHHIDVLVNNIEKKVAQDESVNMKEHLWSLQVLDVI

 TTSFGVDIDSIHTKPNDPLLVHIKKLLSFSFMSPLLILICIPYQSLVLEPISVLRQKVMI

 YFKKKEGEKKGIDTKKDRVDFLQLMIDSQVMNGSRSEKRNNSPKALTEMEIVAQAVTFIF

 AGYETTSTTLNFITYNLATHPEIQKKLLEEIDSTLPNKAVPTYDTIFQMEYLDMVVNETL

 RLFPLGGRIERICQKTAEINGITIPKGTVMLIPVYVLHHDPEYWPEPEEFRPERFDQEGR

 KSIDPYVFLPFGAGPRNCVGMRFALLTLKTALVTLLQNFTMEPCKETPIPLELETKGFMQ

 PKKPIILKLVPRPRP

 

>64% to 3A4 Built_from_Q98T91_and_others grey = error region

941307 - 1075313 bp (941.3 Kb) on chromosome fragment scaffold_19210

This transcript is located in sequence: contig_73184 MNIIPSLSAGTWTLIVLFLTLLYLYGTRTHKLFKNLGIPGPKPLPFFGTVFSYRKGLVNF

 DYDCFKKYGKTWGFYDGRQPVLATMDPETIKTVMVKECYSVFTNRRSFGPVGSLESAITV

 AKDDQWKRIRTVLSPTFTSGKLKEMFPIINQYGDVLVKNMKKEAEKNKPVTMKDILGAYS

 MDVITSTSFGIHVDSLNNPNDPFVREIKKLIRFNFLDPLILSVAIFPFLIPLFNKLDLTV

 FPKEATDFLAKSIIKIKEERTKSTEKASRPLQCLEEEYVYNVNDLSDEEILAQSIIFIFA

 GYETTSSVLSFLFYHLATNPKIQEKLQKEIDAFLPNKEAVTYDALVQMEYLDMVINENLR

 LYPIAGRIERVAKKTVELNGLTIPKGTVVMAPPYVLHRDPEYWPEPEEFRPERFSKENKE

 SINPYVYLPFGAGPRNCIGMRFALMSMKVAVSRLLQEFSFRPCKETQIPLKLSNQPLLTP

 TVPIVLQAELRN

 

>66% to 3A4 Built_from_P51538_and_others

23961 - 170013 bp (24.0 Kb) on chromosome fragment scaffold_11888

This transcript is located in sequence: contig_58548

MNIIPNLSAGTWTLIILFLTILYLYGTRTHKLFKNIGIPGPTPFPFIGTILYYRKGIVGF

 DYGCYKKYGKTWGFFDGTKPVLAIMDPETIKTVLVKECYSVFTNRRMLGLSGILEKAISI

 AEDEEWKRIRTVLSPAFTSGKLKEMFPIINQYGDVLVKNMKKEAEKSKPVTMKEIFGAYS

 MDIIISTSFGIHVDSLNNPNDPFVREIRKLIRFSFLDPLILSITIFPFLIPLFKKLDITV

 FSKDATDFLGKSILRIKEERKKSTEKHRVDFLQLMMDSQTSKNSESHSQKDLSDEEILAQ

 SIIFIFAGYESTSSVLCFLFYQLATNPGIQEKLQKEIDAFLPNKEAVTYDALVQMEYLDM

 VINENLRLYPITGRIERIAKKPVELNGLMIPKGTVVMAPPYVLHRDPEYWPEPEEFRPER

 FSKENKESINPYVYLPFGVGPRNCLGMRFALMSMKVAVSRLLQEFSFRPCKETQIPLKLS

 YRPLLAPSVPIVLQAVLRNKKGN

 

There are 5 CYP4ABX family members shown here plus a CYP4A pseudogene

 

>Gene 4.1

Query location     : CYP3Aamp        430 to   456 (+)

Database location  : contig_55667  10964 to 11044 (+)

Genomic location   : scaffold_1650 18357 to 18437 (+)

 

(1) GKGLLSLEGEKWHYHRRLLTPAFHFNILKDYIYIMRNTVSLML (0)

AGGAAAAGGCCTTCTGAG

CTTAGAAGGTGAGAAATGGCATTACCACCGGCGCCTACTGACCCCAGCCTTCCACTTCAA

CATCCTGAAGGACTACATCTACATAATGAGGAACACTGTCAGCCTGATGCTGGT

 

(0) DKWEKLRTKDSSIDVFDFISYMSLDTSLKATFGLQDFNKEES (2)

AGGATAAATGGGAAAAACTCAGAACCAAGGACAGTTCCATAGACGTCTTTGACT

TTATCTCTTACATGTCCTTGGACACTTCCTTGAAAGCTACCTTCGGTCTCCAGGATTTCA

ATAAAGAAGAAAGGT

 

(2) LFSYFQNMNKLLYLVRKRMETFLYYSDFIYKLTSDHYEFQTTCKELKEEP (1)

AGCTTATTCAGCTATTTCCAG

AATATGAACAAACTTCTATATCTTGTGAGGAAGCGCATGGAAACCTTCCTCTATTACAGT

GATTTCATTTA CAAGCTGACTTCTGACCATTATGAATTCCAGACTACCTGCAAAGAATTA

AAGGAGGAGCCAGGT

 

(1) AKIIRHRRALYKNQSEKDKVPKKKYLNFLDILFQAK (0)

AGCTAAGATCATCCGGCATAGGAGGGCACTCTACAAGAACCAGAGTGAAAAAGACAAGGT

GCCAAAGAAGAAGTACCTGAATTTCCTGGACATTCTCTTCCAAGCCAAAGT

 

(0) DEDGGGFTNEELENEVNTFRFAGQDTVAV

GFSWTLYCLAMNPEYQKKCREEIQGILKDGESITW (2)

AGGATGAAGATGGAGGAGGCTTCACAAATGAGGAACTAGAAAACGAG

GTAAACACATTCAGGTTTGCAGGACAAGACACTGTAGCTGTTGGGATTCTCCTGGACCTT

ATACTGCC TGGCCATGAACCCAGAATATCAAAAGAAATGTAGAGAAGAGATCCAAGGTAT

TCTGAAAGATGGAGAATCCATAACCTGGT

 

(2) DHLTQMTYSTMCIKESFRMYSPIPKVARTLNQPITFPDGQTLPS (1)

AGGGACCA

TCTTACCCAGATGACTTACAGCACCATGTGCATCAAAGAATCCTTCCGCATGTATTCACC

TATCCCCAAAGTAGCCAGGACACTCAACCAGCCCATCACTTTCCCAGATGGACAGACTTT

ACCTTCAGGT

 

(1) DTTVTINIWALHYNPAIWENPK (0)

AGACACGACTGTGACCATAAACA

TCTGGGCTCTGCACTACAACCCTGCAATCTGGGAAAACCCAAAGGT

 

10961 (0) VFDPERFTPENTKKRHPYAFLPFSAGLR (2) 11044

 

AGGTCTTTGACCCTGAAAGATTCAC

CCCAGAGAATACTAAGAAGAGACACCCTTATGCTTTCTTACCATTTTCTGCTGGGCTCAG

GT

 

NCIGQQFAMNLIKVSLALTLLRFELLPDLEKPPIPISQIVLRSTSGFHFYLKPLR*

AGGAACTGCATTGGGCAGCAATTTGCCATGAATCTAATAAAGGTGAGCCTGGCC

CTGACACTGCTCCGCTTTGAGCTACTTCCAGATCTTGAGAAGCCTCCAATCCCCATATCC

CAAATAGTTCTCAGGTCTACGAGTGGTTTCCACTTCTATCTAAAGCCACTACGTTAG

 

49% to 4X1, N-term 3 exons in seq gap, 58% identical to gene 4.2

GKGLLSLEGEKWHYHRRLLTPAFHFNILKDYIYIMRNTVSLML (0)

DKWEKLRTKDSSIDVFDFISYMSLDTSLKATFGLQDFNKEES

LFSYFQNMNKLLYLVRKRMETFLYYSDFIYKLTSDHYEFQTTCKELKEEP

(1) AKIIRHRRALYKNQSEKDKVPKKKYLNFLDILFQAK (0)

DEDGGGFTNEELENEVNTFRFAGQDTVAV

GFSWTLYCLAMNPEYQKKCREEIQGILKDGESITW

DHLTQMTYSTMCIKESFRMYSPIPKVARTLNQPITFPDGQTLPS

(1) DTTVTINIWALHYNPAIWENPK (0)

10961 (0) VFDPERFTPENTKKRHPYAFLPFSAGLR (2) 11044

NCIGQQFAMNLIKVSLALTLLRFELLPDLEKPPIPISQIVLRSTSGFHFYLKPLR*

 

Gene4.2

Query location     : CYP4            126 to   168 (+)

Database location  : contig_55667  42355 to 42483 (+)

Genomic location   : scaffold_1650 49748 to 49876 (+)

 

MGADKIPSWLETHWTRPLHLALTFVLALLLLQVVKLYLRRQG

LLRALRLFPGPPPHWLFGNQRE (0)

ATGGGCGCGGATAAGATCCCTTCTTGGCTGGAGACGCACTGGACTCGGCCCTTGCACTTG

GCCTTGACGTTCGTTCTCGCGCTGCTGCTGTTGCAGGTCGTCAAGCTCTACCTCCGGCGC

CAGGGACTGTTGCGAGCCCTGCGCCTCTTCCCCGGACCCCCTCCACACTGGCTCTTTGGG

AACCAGAGAGAGGT

 

(0) FYLEKELQQFNVLPEQYPCALPLWVGAFQVLLNIYDPEYAKILLNRR (1)

AGTTTTACCTGGAAAAAGAACTCCAGCAATTTAATGTGTTG

CCAGAACAATACCCTTGTGCTCTCC

CTCTCTGGGTTGGGGCCTTTCAGGTGCTTTTAAATATCTATGACCCAGAATATGCGAAAA

TTCTTCTGAACCGAAGAGGT

 

(1) DPKIQHGYKFIIPWI (1)

AGATCCCAAAATACAGCATGGGTACAAGTTTATTATCCCCTGGATTGGT

 

(1) GGGLLSLEGKKWYQHRHLLTPAFHLSILKPYIHVMNDSVCRML

AGGAGGAGGACTGCTGAGCCT

GGAAGGAAAGAAGTGGTACCAGCATCGCCATCTGCTGACTCCTGCCTTCCACTTGAGCAT

TCTGAAGCCCTACATCCATGTGATGAATGATTCAGTCTGCAGGATGCTGGT

 

(0)           DTWEKLSTQDNSVEICEPIRLMTLDSIMKCAFSVQTSSQTES (2)

AGGATACATGGGAGAAGCTCAGCACCCAGGACAATTCTGTGG

AGATCTGTGAGCCCATTCGCCTGATGACCTTGGACAGCATCATGAAATGTGCCTTCAGTG

TCCAGACCAGCTCCCAAACAGAAAGGT

 

(2)  FSTNYLSTVTKLSELIFCRLNNYLHHNDLIYRWSSQGQEFQALCQIAHQLP (1)

AGCTTTTCTACCAACT

ACCTCTCAACTGTGACAAAACTCTCAGAACTAATCTTCTGCCGCCTGAACAACTACCTCC

ATCACAATGACTTGATTTACAGGTGGAGTTCTCAGGGGCAAGAATTCCAAGCTCTCTGCC

AAATAGCACATCAGCTCCCAGGT

 

(1) AKIIQERREALKNNSEQDKIRKKKFLDFLDVLLCAK (0)

AGCTAAGATCATC

CAGGAAAGGAGGGAAGCACTCAAGAATAATAGTGAACAGGACAAGATCCGAAAAAAGAAG

TTCTTGGATTTTCTAGATGTTCTTCTTTGTGCCAAAGT

 

(1) SENGEGLSNEELEAEVNTFVFGGHDTTASSLSWIFYCMAMNPEHQHQCREEIRNIIKYGDTITW (2)

AGAGTGAAAATGGAGAAGGCTTATCAAATGAAGAGCTAG

AGGCTGAGGTTAACACATTTGTGTTTGGTGGTCATGACACTACAGCCTCTAGTCTTTCCT

GGATCTTCTACTGTATGGCCATGAACCCAGAGCACCAACACCAATGTCGAGAAGAGATCA

GAAATATCATAAAATATGGGGATACCATTACCTGGT

 

(2) DHLDQMPYSTMCIKEALRLYPPSITIARELSKPITFPDGRFLPT (1)

AGGGACCACCTAGACCAG

ATGCCCTACAGCACCATGTGCATCAAGGAGGCCCTCCGCCTCTACCCACCTAGCATCACT

ATAGCCAGAGAACTTAGCAAACCCATCACCTTTCCAGATGGACGCTTCTTGCCCACAGGT

 

(1) GMTVVLNIWALHHNPTVWENPQ (0)

AGGCATGACAGTT

GTCCTGAATATCTGGGCTCTCCACCACAACCCTACTGTCTGGGAAAACCCACAGGT

 

(0) VFNPERFSQENSMKRHSYAFLPFSAGPR (2)

 

NCIGQQLAMLELKVGLALTLLRFELLPDLEKPPIPMPHLVLRSKNGIHLYLRPLH*

AGGAACTGCATTGGACAACAGCTTGCCATGTTGGAACTGAAGGTGGGACTGG

CCCTGACCCTGCTCCGTTTTGAGCTATTACCAGATCTGGAGAAGCCTCCTATTCCAATGC

CCCACTTGGTTCTCAGGTCTAAGAATGGGATTCATCTATACCTAAGGCCACTGCACTAG

 

54% to 4Z, 4X 55% to 4A11

    MGADKIPSWLETHWTRPLHLALTFVLALLLLQVVKLYLRRQG

    LLRALRLFPGPPPHWLFGNQRE (0)

(0) FYLEKELQQFNVLPEQYPCALPLWVGAFQVLLNIYDPEYAKILLNRR (1)

(1) DPKIQHGYKFIIPWI (1)

(1) GGGLLSLEGKKWYQHRHLLTPAFHLSILKPYIHVMNDSVCRML

(1) DTWEKLSTQDNSVEICEPIRLMTLDSIMKCAFSVQTSSQTES (2)

(2) FSTNYLSTVTKLSELIFCRLNNYLHHNDLIYRWSSQGQEFQALCQIAHQLP (1)

(1) AKIIQERREALKNNSEQDKIRKKKFLDFLDVLLCAK (0)

(1) SENGEGLSNEELEAEVNTFVFGGHDTTASSLSWIFYCMAMNPEHQHQCREEIRNIIKYGDTITW (2)

(2) DHLDQMPYSTMCIKEALRLYPPSITIARELSKPITFPDGRFLPT (1)

(1) GMTVVLNIWALHHNPTVWENPQ (0)

(0) VFNPERFSQENSMKRHSYAFLPFSAGPR (2)

(2) NCIGQQLAMLELKVGLALTLLRFELLPDLEKPPIPMPHLVLRSKNGIHLYLRPLH*

 

GENE 4.3 SCAFFOLD_14847 208370-233419 + STRAND, 62% to 4A11

MGTGLGLAWLPRDFSSFLQTSVLLSLVLLLLKGVQLYRRRQWLLRTFQNFPGPPAHWFNGHFWE (0)

YQKADEITVTLFWAKQFPSAFPRWFSGFTVALQVYDPEYMRILLGRP (1)

DPKADKFYRLLAPWI (1)

GKGLLILNGSTWFQHRRLLTPAFHYDILKPYVALMVDSVLVML (0)

KKWEKLITQDSSLEIFEHVSLMTLDTIMKCAFS*IHRNCQMER (2)

NADDYIQAVKEQAILVFSRVRNDLYHNDFVYWFSPQGYQARRWAHLAHNHT (1)

DQVIKKRKQHLHQEGGLEAILKKRHLDFLDILLCSK (0)

TENGDSLSDKELRAEVDTFMFEGHDTTASGISWLLYSLAMNPEHQQKCREEIRDLLGDEMAIGW (2)

EHLNRMPYTTMCIKESLRLYPPVTSISRDLSKPLTLSDGRYLPA (1)

GTIVTLHIHALHHNPSVWPEPE (0)

VFNPLRFSPENLTSRHTHSFLPFSAGTR (2)

NCIGQQFAMNEMKVAVALTLLHFHLEPDATQPPQLFPRVVLRSKNGIHLKLTRI*

 

MGTGLGLAWLPRDFSSFLQTSVLLSLVLLLLKGVQLYRRRQWLLRTFQNFPGPPAHWFNGHFWE (0)

ATGGGCACTGGCCTGGGATTAGCCTGGCTCCCCAGAGACTTCTCCAGCTT

CCTCCAGACTTCAGTGCTGCTGAGCTTAGTCCTGCTGCTGCTCAAGGGGGTCCAGCTGTA

CCGGCGCAGGCAGTGGCTTCTTAGAACCTTTCAGAACTTCCCAGGCCCTCCTGCCCACTG

GTTCAATGGACACTTCTGGGAGGT

 

YQKADEITVTLFWAKQFPSAFPRWFSGFTVALQVYDPEYMRILLGRPGEKETP

AGTATCAAAAAGCTGATGAAATAACGGTGACACTGTTCTGGG

CCAAGCAATTCCCTTCTGCCTTTCCTCGGTGGTTCTCTGGGTTCACAGTGGCCCTCCAAG

TCTATGACCCTGAATACATGAGGATTCTGCTGGGCAGACCAGGT

 

DPKADKFYRLLAPWI (1)

AGATCCCAAAGCTGATAAATTCTACAGATTATTGGCTCCCTGGATTGGT

 

KKWEKLITQDSSLEIFEHVSLMTLDTIMKCAFS*IHRNCQMER (2)

AGAAAAAATGGGAG

AAGCTCATCACCCAGGACTCGAGTCTAGAGATCTTTGAGCATGTCAGCCTGATGACACTG

GACACCATCATGAAATGTGCCTTTAGCTGAATCCATCGCAACTGCCAGATGGAGAGGT

 

NADDYIQAVKEQAILVFSRVRNDLYHNDFVYWFSPQGYQARRWAHLAHNHT (1)

AGGAATGCTGATGACTATATCCAGGCTGTGAAGGAACAGGCAATCCT

CGTATTCTCTAGAGTTCGAAATGATCTCTACCACAATGACTTCGTCTACTGGTTCAGTCC

TCAGGGCTACCAGGCCCGCCGGTGGGCCCACCTGGCCCATAACCACACAGGT

 

DQVIKKRKQHLHQEGGLEAILKKRHLDFLDILLCSK (0)

AGACCAGGTGATTAAGAAAAGGAAGCAGCACCTCCATCAG

GAAGGAGGCTTGGAGGCTATCTTGAAGAAAAGGCACTTGGATTTCCTGGACATTCTCCTC

TGTTCCAAGGT

 

TENGDSLSDKELRAEVDTFMFEGHDTTASGISWLLYSLAMNPEHQQKCREEIRDLLGDEMAIGW