This is a
survey of the opossum genome P450s
David
Nelson June 10, 2005
There are
5 CYP4ABX family members shown here plus a CYP4A pseudogene
A CYP51
gene and a CYP51 processed pseudogene are also assembled.
66 genes
have been found now. This is
nearly complete but I am still in
progress
on the rest. I am still missing CYP46 cannot find it
Since
CYP46 is in fish and other mammals it is probably in opossum too.
many genes
are missing N-terminals or have small defects.
>73% to
1A1 65% to 1A2 Built_from_P56591_and_others
489177 -
493862 bp (489.2 Kb) on chromosome fragment scaffold_14927
This
transcript is located in sequence: contig_43733
MTSILSLLGFSKSFTVTELLVVSAVFCLVFWIIDSYHQRVPKGFKSPPGPWAWPLIGNVL
TLGKNPHLVLTQMREKYGDVMQIQIGSTPVLVLSGLETIRHALVKQGDDFKGRPDLYSFS
LILDGESLSFGPDSGEVWAARRKLTQNALKAFSISSSPSSSFCYLEEHVIKEAEYLIQKF
QEQKGHFDPVRYIVVSVANVICAICFGQRYDHDDQELLNIVRLSNKFGEVAASGNPVDFI
PILRYLPNSKITAFRDLNEKIVAFTQKLVKEHYRKFEKGCIRDITDSLIEHCQEKKLDEN
ANIMLSEKKVVNVVIDLFGAGFDTVTTAISWGLMYLVAKPEVQKKIHEELDTVIGRERLP
QLSDKTQLPYMEAFILETFRHSSFLPFTIPHSTTRDITLNGFYIPKGRCVFVNQWQINHD
PKIWGDPSVFRPERFLSVDGTINKALSEKVIMFGLGKRKCIGETIARWEVFLFLSILLHR
MEFSVPSGVKVDLTPVYGLTMKHIPCEHFQTKLRS
>70% to
1A2, 65% to 1A1 Built_from_Q64391_and_others
451687 - 462429
bp (451.7 Kb) on chromosome fragment scaffold_14927
This
transcript is located in sequence: contig_91822
MVSSLLASISISELLLASVIFCLVFWVTRSSHQRVPKGLKSPPGPWAWPLFGNVWTLGKN
PHLTLAQLSEKYGDVMKIHIGSTPVIVLSGLETIRQALVKQGEDFKGRPDLYSSTFVADG
YSLAFNPDSGEVWAVRRKLAQNALNTFSVSSSPSSSSCYLEEHVNKEVKHLIQKFQELME
GVGCFDPYRHIVASVANVISAMCFSQRYEDHKNPEFTTLINASHEFVESATSGNPVDFFP
ILRYIPNPQLQRFKEFNQRFLKFLQNTIREHHKAFDENNIQDITGALYKHSQDKAFGNTS
SSVPEMLIINLINDIFGAGFDTVTTAISWSLMYLVTNPKVQKKIQQELDTVIGRDRWPLL
SDRPQLPFMEAFILEIFRHTSFVPFTIPHSTTRATTLNNFYIPKGTCVFVNQWQTNHDPK
LWEDPSVFRPERFLSADGTVNKALSEKVILFGLGKRRCIGETIARWEVFLFLAILLHQIE
FSVPSGVKVDMTPTYGLTMKHPRCEHFQARPRFSR
>75% to
1B1 complete Built_from_Q64429_and_others
690855 -
694571 bp (690.9 Kb) on chromosome fragment scaffold_14803
This transcript
is located in sequence: contig_70942
MATSRSSEETWLPGLLSTQQTTLLLLFSVLAVVHLGQWLLRRVRRTPWASCRPPGPFQWP
LIGNAVEVGSIPHLSFTRLARRYGDVFQIRLGNCPVVVLNGERAIRQALLQQGAAFASRP
PFASFQVVSNGRSLAFNKYSELWKVQRRVAHGTVRAFSTGQVRSRQVLEQHVLSETRELV
ELLVQGSAGGAFLNPGPLTVVAVANVMSAVCFGCRYSHSDEEFRELLSHNERFGRTVGAG
SLVDVLPWLQRFPNPVRTAFRDFQKLNQDFYSFVLDKFLKHKSSLQPGAPPRDMMDAFIH
TVGKEEENPGEVKPGLRLDTEYVPATVTDIFGASQDTLSTALQWLLILFIRYPKVQAQVQ
EELDRVVGRDRLPSLNDQPHLPYVMAFLYEAMRFSSFVPVTIPHATTIDTSVMGYHIPKD
TVVFINQWSVNHDPEKWQNPEDFNPARFLDKKGFIDKDLASGVMIFSMGKRRCIGEELSK
IQLFLFIAILAHQCNFLANPDEDPEMNFSYGLTIKPQSFTINVTLRESMELLDSTVQRLQ
EE
>72% to
1A8P not a pseudogene
Built_from_Q9PTY7_and_others
405900 -
420186 bp (405.9 Kb) on chromosome fragment scaffold_15058
This transcript
is located in sequence: contig_41044
MFVIETISKEVTISFLVLMIVFIFIRALGNRNKKHMSPPGPRPFPIIGNLLQLGDHPYLTFMEMKKKYG
DVFLIKLGMVPVVVVNGTEMVKKGLLKDGENFAGRPHMYTFSFFAEGKSLSFSVNYGESW
KLHKKIAMNALRNFSKAEAKSSTCSCVLEEHVTEEASELVKIFSKLSLKQGSFDPKSSIT
CAVANVVCALCFGKRYGHFDKEFLRIIKTNEEFLKASSAANPADFIPCFRYLPLRIIHAP
REFYCQLNHFIEQHVQDHITTFDKNHLRDITDALVSICRDKSATIKTATLSDNEIISTVS
DIFGAGFETVSGFLHWSFLYLIYYPEIQAKIHEEIDGIIGFKPPRFKDRKNLPYTEAFIN
EIFRHTTFVPFTIPHCTTKDTTLNGYFIPQKTCVFFNMYQVNHDETLWENPDSFQPERFL
NEKGEMNKNLVEKVLIFGMGIRKCLGEDVARNEVFIFIVSILQQLKLKKCPEVQLDLTPV
YGLVMKPKPYQLIVEPRFHVNSST*
>CYP2ABFGST
cluster gene order same as human but subfamilies are single copy
except 2B
>65%
to Cyp2t4 mouse scaffold_14886 426887-434448, 69kb from 2F1
434448
MWLLFLLCLLLLALVLQGRGLKGSQGRLPPGPMPLPLLGNLLQLGPSGLDKRLME (0) 434284
sequence
Gap aa 56-138
433349
SIEERIREEAAALVQELAVTK (1) 433287
433027
EVPFNPLRHIRNAVANVICSVVFGERYTYEDPDFRTLLDLLNDNFQILSSQWGQ (0) 432864
432453
MYNIFPSLLDWIPGPHHRIFSNFKKLQAFISEEIQKHKERRQPEEPRNFIDFFLDQMEK (0) 432277
432179
EKQDPRTHFYLETLVMTTHNLFFGGTETTSTTIHYGLLILLKYPNIA (1) 432042
428778
EKMQEEIDSMVGRARPPCLEDRDRLPYTNAV 428686
428685
IHEIQRFISVVPMGLPHILTQDTHFRGHFLTK 428590
428265
GTNIIPLLISAHQDPTQFKDPENFNPNNFLDDEGAFQNNEAFMPFAL (1) 428125
427069
GKRICLGAGLARMEIFLFLTTILQHFTLCAVKRPEEIDLSPKSTGLGNVSPPYELRLKPR* 426887
>71%
to 2F1 Built_from_Q91Y29_and_others_9 pep:novel
scaffold:BROAD0.5:scaffold_14886:503290:534552:-1 (modified)
MMELGGGLGLALGLAVAAYLLLNWWRHRFSGLPPGPAPWPFLGNILGVDVVDLLKSLKE
(0)
FHLGSRPCVVIAGYQALKESLIDRAEEFGGRGEFPAVQMWSHGDGIAFSNGEKWKVLRRF
SIQILRNFGMGKKSLEERILEEEAFLLEELKKTEGAPFDPTFVLSRSVSNIICSVVFGSR
FDYEDERLLKLVHLINENFKIMSSPWGEIYNIFPGLLQWIPGPHRSLFQNYGSMKTLIDR
IIHEHQINLDSTSPSDFIDCFLIKMAEEAKSPDSYFHMETLVKTTFILIFGGTETVGTTL
RHSFLLLMKYPQIQARVQAEIDSVVGRGRRPTLDDRSSMPYTDAVIHEVQRFADIIPMNL
PHRLTRDTLFRGFLLPKGTDIITLLNTVHYDPSQFKTPNEFNPAHFLDSKEEFKKSPAFM
PFSAGRRLCLGEPLARMELFLYLTGILQNFTLQPLCSPEEIDLTPLSSGLGNVPRPFQLR
MVPR
>80% to 2A6 complete Built_from_Q91Y29_and_others_10 pep:novel scaffold:BROAD0.5:scaffold_14886:523331:534582:-1
MLVSGLLLGALFFCLSALVVLSVWRQRKVRGSLPPGPTPLPFIGNYLQLNTKQMYDSIMK
LSLKYGPVYT
IGEKFGPVFTVHLGPRPIVVLSGYEAVKEALVDQAEEFSGRGEQATFDWLFKGFGVAFSN
GERARQLRRFSITTLRDFGMGKRGIEERIQEEAGFLVEALRGTKGAPIDPTFFLSRTVSN
VISSIVFGDRFEYEDKQFLMLRMMLGSFQFTATSMGQLYEMFSGVMKHLPGPQQQAFKEL
QGLEDFITEKVRENQATLDPNHPRNFIDSFLIKMQEEKKNPNTEFFMRNLTMTTLNLFFA
GTETVSTTLRYGFLLLMKHPEVEAKIHEEIDRVIGRNRSPKFEDKAKMPYTEAVIHEIQR
FGDMIPMGLARRVTKDTKFRGYLIPKGTEVYPMLGSVLRDQSFFACPQDFNPQHFLDEKG
QFKKSDAFVPFSIGKRYCFGEGLARMELFLFLTTILQNFRFQSPLPKEAIDISPKMVGFA
TIPRSYTISFLPRG
>82%
to CYP2G2P Built_from_Q8SQ66_and_others_6 pep:novel
scaffold:BROAD0.5:scaffold_14886:612386:623485:-1
MEFGGAITLFLALCVPCLVILIAWKRMHKGGKLPPGPTPLPFLGNLLQVRTDATFQSFLE
LSKKYGPVFTVYMGPRRVVVLCGHEAVKEALVDQAEEFSGRGELASIDRNFQGHGVALAN
GDRWRILRRFSLTVLRNFGMGKRSIEERIQEEAGFLMEEFRKTKGTPIDPTFFLSRTVSN
VISSVVFGSRFDYEDKQFLYLLHLINESFIEMSTPWAQVLYDMYSGIMQYLPGRHNKIYN
LIEELKDFIASRVKINEASLDPNNPRDFIDCFLVKMHQEKNNPKTEFNLKNLVLTTLNLF
FAGTETVSSTLRYGFLLLMKYPEVGAKVQKEIDHVIGQNRIPKAEDRMQMPYTDAVIHEI
QRLTDIVPMGVPHTVTQDTFRGYILPKGTDIFPLIGSALRDPKYFNPEAFEPQHFLDEEG
RFKKNEAFVPFASGKRVCLGEAMARMELFLYFTTILQNFSLQPLVPPSEIDITPQISGFG
NIPPTYKLCIVAR
>61%
to 2B6 Built_from_Q8SQ66_and_others_8 pep:novel
scaffold:BROAD0.5:scaffold_14886:627043:652084:-1
MNAGDLLLLLAVFLGFLLLLAKRPQKVNLPPGPTPLPLLGNLLQLGRHGLIKSFLEC
RDKYGDVFTVYLGTRRYFVICGPESVKKALVDNAEVFSGRGTLAISEMVFQRY
GFFVTYDGWKTLRRV
SIATLRDFGMGKRSIEEQIKAEAQCLVEELQKSQGALIDPTFIFHSVTANVICSIVFGER
FSYQDTQFREILNMFVEVFTILSSLWMQFFEQFPSVLKLLPGPHYRVLKIAQHIKDFISS
KIEQHQESLDPNSPRDFIDSFLLRIEKEKETPDSKYHVKNLVLTVLSLFFAGTETTSTTL
RYGFLLLLKYPQVAEKVQEEIDQVVGRDRAPEIKDRAKMPYTDAVIHEIQRFSDLLPMGI
PHMVTEDTSFQGYFLPKGTDVFILLSASLKDPCYFEKPHIFDPNHFLDAQGALKKNEAFI
PFSMGKRACLGEGIARTELFLFFTTILQNFSLVSSKPLEDLDIRPQCSGLGTLPQIYKLG
FLPR
>73%
to 2B6 last half of gene, seq. Gap upstream
Built_from_Q8SQ66_and_others_9
pep:novel scaffold:BROAD0.5:scaffold_14886:661530:672893:-1
Missing
exon 1 and exons 3,4 in two sequence gaps
REKYGDVFTIFLGSRPVVVLCGPDTVKEALVEKAEEFSGRGPIAMVDSVFQGLG
Sequence
gap
LFELFSGFLKFFPGPHRRSQSNLEFINAYIADNVEKHRQSLDPNAPRDFIDIYLLRMEK
EKGISGTEFHHKNLILTVLSLFFAGTETTSTTLRYGCLILLKYPGVAEKVQDEIDQVIGK
DRTPEIKDRAKMPYTEAVIYEIQRFSDLLPLGVPHCVTQDTSFRGYLIPKGTEVYPLLST
VLHDPKYFKKPYDFDPNHFLDAQGSLKKNEAFIPFSSGKRICLGEGIARMELFLFLTTIL
QNFSLRSSMVPADIDLTPRESGVGNVPPTFQIQFLP
>62%
to 2S1 Built_from_P24454_and_others_1 pep:novel
scaffold:BROAD0.5:scaffold_14886:762460:771873:1
two
frameshifts and a short seq gap in exon 1
MALATGVASPLLVLV
LVLVLLLALLVLRQGRPRSSRLPPGP
AALPLLGNVGQLWPGGXXXXXXX
(0)
LSAKYGPVFTVYLGSRPVVVLSGYRAVKEALVDQAEAFSGRGKIAGLEKTFHEH
GLFFANGEQWRLRKFTTLSLRLGMGKKEGEEHIQEEARCLLEALRSTQGSPLDPSLLLSQ
AVSNITCLLLFGKRFDYEDKKFQAMVLATAGILTEISSPWGQVCEMFMGPVCYLSDILKY
LSAPHGRLTRHLSTLAAFVSDQIQQHQETLDPEAPVRDFIDDFLLKMRQEEKAQGVGPDH
TDFLLTTVNLLFAGTVTVSATLRFAFLLLLKYPEFQDRIHEELNQELGRERAPSLGDRGR
LPYTDAFLHEVQRFLALIPMGVPRTVTKPTIFQGYELPQGIEVFPLLGSVLHDPEFFERP
KEFYPRHFLNADGRFIKNEAFLPFSSGKRICLGEGLARTELFLFFTTILQNFSLESPSPL
GALSLHPAISGFANIPPTFQLRFRPR
>58%
to 2C9 Built_from_Q29508_and_others_4 pep:novel
scaffold:BROAD0.5:scaffold_13575:1131749:1157623:-1
21kb
from CYP2E seq
MEPWALTTFFLVVCVSFLVFLSLWRKDYKGRNLPPGPFPLPIIGNLLQLGHNLSMSLCKL
SEKYGPVYTVYFGPQPVVVLHGYKALKEALTDQGDIFGERGHLPIIDDIYRGQGIVFSHG
EKWKQIRRFSLMTLRNFGMGKRSIEERVQEEAQFLLEELRKTNSQPFDPTFILGCAPCNV
ISSILFHQRFNYDDEEFLSMLRILNENVTLLNTPMAQLYNNFPWFLHYFPGPHKNFFSNV
KKLREFILKNAKKYQQTLDPNNLKNYIDCFLHKMQQDQKNPDSVFDLENLATAGMDLFDA
GTETTSTTLRYGLLLILKYPEVQNKIHEEIDQVIGRHRIPSIKDKLEMPYTEAVLHEILR
FVDLVPFSLPHEVTHDTQLQQHFIPKGTTVYPLLSSVLYDAKEFPNPKEFDPRHFLNKDG
SFKKSDYFVPFSIGKRACLGEGLAKMELFIFLATILQNFTLKSVIDSKEIDIKPGSTGLL
NVPPKYQLCLLPR
>62%
to 2C19 Built_from_Q9UEH3_and_others_1 pep:novel
scaffold:BROAD0.5:scaffold_13115:5872:11336:-1
exon
1 in a seq gap, exons 6-9 off the end of the scaffold
LAKKYGSIYTLYFGTQRVVVLHGYNIVKEALIDKGDIFMERGNVPIFEDTVK
VVFSRGERWKQIRRFSLMTLRNFGMGKRTIEERVQEEAQCLVEELRKTK
GQPNDPTFILGCAPCNVICSILFRERFNYKDEKFLYLMGILNENVQLFAKPWIQ
LYNFLPAFRVHLPGKHKQLFKNVEELKCFILERVKEHQEILDPNNPQDYIDCYLSKMQQ
90kb seq
gap between these two genes may have another 2C gene
>62%
to 2C19 complete Built_from_Q8QZW4_and_others_1 pep:novel scaffold:BROAD0.5:scaffold_13115:146250:169643:-1
MEPWGLTTTVLLTCVLFLIFLSLWNHGTKKGKLPPGPTPLPIFGNLLQFDFKNMAATMSK
LAKKYGSIYTLYFGMERVVVLHGYNIVKEALIDKGDIFMERGNVPIFEDAIKGQGVIFSR
GERWKQLRRFSLMTLRNFGMGKRSIEERVQEEAQCLVEELRKTKGQPNDPTFILGCAPCN
VICSILFRDRFKYKDEKFLYLMSLLNENFQLFTKPWIQFYNFLPAFRVHLPGKHNQFFKN
IGELKRFILERVKEHQEILDPNNPQDYIDCYLSKMQQEKNNPQSEFDVENLIMTGVDLFS
AGTETTSSTLRYGLLLILKHPEVQAKIHEEINRVIGHNRIPSIKDRQDMPYMDAVVHEVQ
RFIDLVPLNVPHAVNQDIQLQQYTIPKGTNVFPLLSPVLCDSKEFSNPDKFDLQHFLDKN
GSFKKSDYFMPFSAGKRACLGEGLARMELFLFLTTILQNFTLKPVGDPNEISVKNNHVGF
TNVPPYYQLCFLP
>oppossum
EST DR038220 96% to scaffold_13115:146250:169643
PSIKDRQDMPYMDAVVREVQRFIDLIPLNLPHAVNQDIQLQQYTIPKGTNIFPLLSPVLR
DSKEFSNPDKFDPQHFLDKNGSFKKSDYFMPFSAGKRACLGEGLARMELFLFLTTILQNF
TLKPVGDPNEISVKNNHVGFTNVPPYYQLCFLP
>scaffold_13599
has three full 2C genes and two pseudogenes
This
scaffold may be close to scaffold_13115 as part of the 2C cluster
>scaffold_13599
pseudogene parts between 5200000 and 5275000
related
to CYP2C sequences
first
pseudogene
GPVPFPTIGNTLQLDRRNIPESLCRVNK
VVILHGFKAVKEALIDGRNKFAARGSLPVFKFISGGLGMFLLDTDQKEGKE
Second
pseudogene
LLLCISCLLILVWKRGFGKGKLPPGPVPLPIVGNLLQLDLKNIPES
LAKEYGPVFTLQLGLDRVVVLHGYKAIKEALIDHGDNFSSRGAMPIFQVINN
Missing
exons
EKQQPQSEFTIDNLIWTVSDLFSAGTETMSTTLRYGLLILLKHPEI
VSEKIHEEIEHVIGRNRSPCMEDRNKMPYTNAVVQEIQRYVDLLPTGVPHAVSQDTQFRQYLIPK
GTTIIPLFTSILNDEEFPNSQQFDPGHFLDESGNFMKSDYFMPFST
GKRI*LGEGLARIELFLFFTTIL*NFTLKPLIDPKDIDTNPTANGFGKVPPPYKLCFQP
>67%
to 2C19 scaffold_13599 between 5275000 and 5305000 not annotated in browser
MDPSVVNAFGLLFCISCLLLILAWKKDFRRGKLPPGPVPFPIIGNILQLDLKNIPESLCK
LAKEYGPVFTLQLGFTRTVVLHGYKAVKEALIDHGDQFAARGHMPVFEFISQGL
GIVSSNGERWKQLRRFSLMTLRNFGMGKKSIEENVQEEAKLLVEAIKQTK
XXPCDPTFILGCAPCNVICSLIFQKHFEYKDPKFLYLMKLLDDDLKLLSSPWIQ
VYNYFSPLIHYLPGLHHKLFKITDLQKKFILEEVKEHQQTLDPNNIRDFIDCFLMKMEQ
EKQKPLSEFTIGNLVNTAIDLFAAGTETTSTTLKYGFLMLLKHPEIT
VTEKIHEEIDRVIGYNRSPCMEDRNKVPYTNAVVHEIQRYIDHIPTSLPHAVTEDVQFRQYLIPK
GTTIIPLLTSVLYDDEEFPNPHQFDPGHFLDASGNFKKSDYFMPFST
GKRICLGEGLARMELFLFFTTVLQNFTLKSLIDPKDIDTTPVDSGFGKIPPSYKLCFLP
>Built_from_Q9JJ02_and_others_10
pep:novel scaffold:BROAD0.5:scaffold_13599:5307180:5340000 region :1
removed
last three exons from another gene
added
these back from correct location
MGPSVVTALGLLFCISCLLLILSWRKGFGKGKLPPGPVPLPIIGNMLQLNLKNIPESLCM
LAKEYGPVFTLQLGVQRIVVLHGYKAVKEALIEHGEQFAARGPMPIFELVSNGFGIGVSN
GERWKQLRRFSLMTLRNFGMGKRSIEERVQGEAKFLVEELKKTKGLPCDPTFILGCAPCN
VICSLIFQKHFEYNDQKFLYLMKLLHEQVRIGSSAWIQFYNCFPSLVQHLPGPHRKLLKL
FHFLHTFILEEIKEHQGTLDPSNPRDLIDCFLMKMEQEKQQPLSEFNIDNLVNTVADLFG
AGTETTSTTLRYGLLMLLKHPEIT
EKIHEEIDRVIGHNRSPCMEDRNKMPYTNAVVHEIQRYIDLIPTSLPHLVTEDTQFRQYIIPK
GTTIIPFLSSVLYDEKEFPNPNQFDPGHFLDENGNFKKSDYFMPFST
GKRICLGEGLARMELFLFFTTILQNFTLKSLIDPKDIDTTPIDSGFGKIPPSYKLCFLP
>70%
to 2C18/2C19 Built_from_Q9JJ02_and_others_8 pep:novel
scaffold:BROAD0.5:scaffold_13599:5275928:5400120:1
in
5370000-5400000 region
removed
first exon since it is from another gene and replaced it
MVTAFGLVCCTLCLLLISALRKRCGKGKLPPGPGPLPIIGNILQLDTKNIPKSLCM
LAKVYGPVFTLYLGSKSVVVLHGYKAMKEALIDHGEEFAGRGSFPIIDAINKGLGLAFSN
GERWKQIRRFSLMTLKNLGMGKRSIEERVQEEAKCLVEALKKTNGMPCDPTFILGCAPCN
VICSIIFQKRFEYHDQKFLHLMKLLDEKVKILSSPWIQIYNLLPLLAQYLPGSHHKLFKI
SQMMHNFFLEKVKEHQDALDPNNPQDLIDSFLIKMEQEKEKPQSEFTMENLVCTVSDIFG
AGTQTTSTTLRYGLLLLLKHPEITGKIHEEIDRVIGHNRSPCLKDRNSMPYTDAVIHEMQ
RYIDLVPANLTHSVIQDVKFRQYIIPKGTTIIPLLTSVLYDNEEFPNPDQFDPGHFLDES
GNFKKSDYFMPFSAGKRICIGEGLARMELFLFFTTILQNFTLKSLIDPKDIDTTPIASGF
GNIPPSFKLCFLPS
>Built_from_Q9JJ02_and_others_7
pep:novel scaffold:BROAD0.5:scaffold_13599:5275928:5470669:1
removed
first 4 exons and last exon from another gene
and
replaced them
this
gene in the region 5400000-5440000 scaffold_13599
LILVLCISCLFLISSRKKSHGKGQLPPGPFPLPIVGNLLQLDTKHIDKSLGS
LTKVYGPVYTLHFGSERVVVLHGYEAVKEALIDHGEEFAARGSLPIIDAVSKGF
GLVFSKGERWKELRRFSLMTLRNLGMGKRSIEERVQEEAKYLVEEFKKS
XXPCDPKFILECVPCNVICSVIFQKRFEYSDRKLQTLMELLDENIKILTSPWIQ
VYNFIPSLVHYLPGPHRTFLNN
CKIMHNFIEEKVKEHQETLDSNNPQDFIDYFLIQMGQKKQNQQSEFTMENLILTVSDLFI
AGSETTSTTLRYGLLLLLKYPEITDKIHEEIDRVIGRDRSPCMKDRNSMPYTDAVIHEIQ
RHLDLIPFNLPHAVKQDTRFREYVIPKDTTIFTSLSSVLYDEKEFPNADQFDPGHFLDES
GNFKKSDYFMPFSI
XKRACVGEGLARMELFLFFTNILQNFTLKPLIDPKDIDTTPISNGFGCVPPSYKLHFLPV
>68%
to 2C19 this gene in the region 5440000-5470000
scaffold_13599
MDPSVVTALGLIFCVSCLLLISAWRKGFGKGKLPPGPTPLPIIGNLLQLDTKNINKSFCE
LAKTYGSVFTLYLGSERAVVLHGQKAVKEALIGNGDAFAGRGSFPISETINKGL
GLLFSNGERWKQIRRFSLMTLRNFGMGKRSIEERVQEEAKRLVEALKNT
GLPCDPTFIFGCAPCNVICSVVFQKHFEYQDKKFLTLMEYLNENLQILSSPWIQ
VYNLFPSLIHHLPGIHHKVIKNFRALNDFVLERVKEHQETVDPNDPRDFIDCFLMKMEQ
EKQNPKSEFIIENLVSTTIDLFGAGTETTSTTLRYGFLLLLKHPQIV
DKIREEMDQVIGQNRSPCMKDRSSMPYTDAVIHEIQRYIDLVPTSLPHAVTQDVKFRQYLIPK
GTTIIPLLTSVLYDNEEFPNPEQFDPGHFLDESGNFKKSDYFVPFSI
GKRACVGESLAQMELFLFFTTILQNFTLKPLVDPKDIDVTPISNGFNHVPPCYELCFLPS
>CYP2C
57% to 2C18 Built_from_Q6PER7_and_others missing 216 aa
257423 -
310084 bp (257.4 Kb) on chromosome fragment scaffold_13485
This
transcript is located in sequence: contig_23460
LYNMYPSLIKHLPGSHRTINKNVLEVRNFIMDEVKKHQETLDPNNPRDYIDGFLIKIQQE
KLNPQSAFNYQELMATGSNLFSAGTETTSSTLRYGLLLLMKHPKIQDKVHEEIDRVLGSS
RKPSMQDRVKMPYVDAVVHEIQRYIHLLPFSLPRLAAQDIHFQKYVIPKGTSVFPLLYSV
LYDRKAFPNPYEFDPENFLDKSGNFQKNDHFVPFSLGKRLCLGESLARMEVFLFLTTILQ
NFTLKPMVEPKELVTTPLRNGIVNIPSIYKLSLIP
>CYP2D
Database
location : contig_35735 16579 to 16764 (-)
Genomic
location : scaffold_11452
49597 to 49782 (-)
scaffold_11452:47242:51381:-1
GENSCAN00000009132
pep:Genscan scaffold:BROAD0.5:scaffold_11452:52083:52720:-1
this
fragment 63% to 2D6
missing
exon 1 in a sequence gap
SRRSSGKVFSVQLLWKPAVVLSRPDAVREALVHRSEDTAGRPPSLVYSHLGFGPKCPRQ
VVLAQYGEAWKEQRRFSLTTLRNLGLGKQSLERWVTAEADFLCSAFAAR
Sequence
gap missing exons 4-5
This
frag 80% to 2D6
AKGNPESSFNDENLRMVVVDLFTAGMVTTATTLTWALLLMILYPDVQ
KVQEEVDVVIGRSRRPTMKDQAHMPFTNAVIHEVQRFGDIVPLGIPHMTTRDTEIQGFFIPK
GTVLITNLSSVLKDEATWEQPYRFYPEHFLDAEGRFVKPEAFMPFSA
GRRACLGEPLARMELFLFFTCLLQRFSFSVPVGQPRPSTHGFFAFPVAPLPYQLCAVVR
>67%
to 2E1 Built_from_Q29508_and_others_6 pep:novel
scaffold:BROAD0.5:scaffold_13575:1164439:1178780:-1
21kb
from CYP2C seq
MAFLGATLALLVWFVFLLFVSVWKQIQSSWKLPPGPFPLPIIGNLFTLDIRNFPKSFAKL
AKEYGPVFTVYVGSRRIVVLHGYKVVKEALLDLKNEFAGRADIPAFEAHQNSGIIFNDSE
TWRDTRRFGLTVLRDYGMGKQSNEERIQRESHFLVEALRNTNSQPFDPTFVLGGGPLNVI
HDILFHSRLDYSDKMCQRLLHLYNEDFYLLSSPWIQVYNNIGSYLRYLPGSHRKLLKNIS
EAKQYVFEKVKEHQEVVDLNHPYDLTDTLLIQMEKDQKKKKLGFNLENVTVTVADLLFAG
TETTSTTLRYGLLLLLKHPKVEEKLHEEIDRVIGPHRLPSMKDKINMPYMEAVVHEIQRV
INLVPSNLPHVMHEDIHFRGYLLPKGTTVYPTLDSVMLDSEEFPNPEQFDPGHFLNENGK
FRYSDYFKAFSAGKRVCVGEGLARMEIFLFLTIILQHFNLKPLISPEKIDTSPLTVGFGT
IPPIYKLCVLPRS
>61%
to 2J2 Built_from_P12791_and_others_8 pep:novel
scaffold:BROAD0.5:scaffold_16806:796848:899843:1
be
aware this seq is a hybrid of two genes, yellow region identical
LLLSAILLILASYLQRKRRQANYPPGPSPLPLLGNFFQIDFKKPQVAFQEVRSERGGPER
TETFNPEHFLENGQFTKREAFLPFSAVKLKILNIDRNYFHNLGISLSNGQIWKDQRRFTL
MTLRNFGLGKKTLEFRIQEEATYLTEAIKEEKGQPFDPHFQINNAVSNIICSVTFGYRFE
YHDSQFRELLKILDEVMVLHGRWECQLFEMFPWIMRFLPGPHHKLFREWKKLQSFVTQII
KQHKEDQNSEEAQDYIDAYLKELSKGDVSSSFSEGNLASCTLDIFFAGTETTSTTLRWAL
LYMALYPEIQGKIQAEIDRVIGQSRQPTMADKENMPYTNAAIHEVQRMGNIIPINVPRVA
TVDTTVAGYHVPKGTVLMTNLTALHRDPKEWATPETFNPEHFLENGQFKKRESFLPFSAG
KRVCLGEQLARAELFIFFTCLLQRFTFQAPPDTQLSLDFRTGVTISPVPYKICALPRET
>64%
to 2J2 Built_from_P12791_and_others_10 pep:novel
scaffold:BROAD0.5:scaffold_16806:860579:998071:1
be
aware this seq is a hybrid of two genes, yellow region identical
MLLSLWEGASLLTLLLVSTVLLILAFYVQRGRQHPNFPPGPPLLPIFGNIFQMDPKKPQD
TFQEFAKKYGNIFCLKLFGAPMICVTGLPLIKEVLLKQGQVFIDRPQTPWSSYAFKHHGI
SLSNGQIWKDQRRFTLMTLRNFGLGKKTLEFRIQEEATYLTEAIKEEKGQPFDPHFQINN
AVSNIICSVTFGYRFEYHDSQFRELLKILDEVMVLHGRWECQLFEMFPWIMRFLPGPHHK
LFREWKKLQSFVTQIIKQHKEDQNSEEAQDYIDAYLKELSKGNMSSSFNEDNLVACTLDL
FFAGTETTSTTLRWALLYMALYPEIQGKIQAEIDRVIGQSRQPTMADRENMPYTHAAIHE
VQRMGDIVPLNVPRIATVDTTLAGYHVPKGTMLLTNLTALHRDPKEWATPDTFNPEHFLE
DGQFKKREAFLPFSAGKRVCLGEQLAQPELFIFFTCLLQRFTFQAPPDTQLSLDFQFGLT
ISPVPYQICALSRET
>63%
to 2J2 Built_from_Q9YGF1_and_others_9 pep:novel, complete
scaffold:BROAD0.5:scaffold_1903:224902:254645:-1
MQFPSLWEGASLKSLLLFSAVFLMLASYLQRRRRHPNYPPGPFQLPFLGNFFHMDHKNPH
MAFYQLAKKYGNIFSLELVGCPVIVVTGLSLIKEVLVNQGQVFVDRPHTPLRSYVFKKLG
LIMSNGQEWKDQRRFTLMTLRNFGLGKRTLELRIQEEAMYLTEAIREEKGQPFDPHFQIN
NAVSNIICSVTFGNRFEYHDSQFRELLKILDEAMMLQTKWECQFFEMFPRIMKFLPGAHK
KLFREWKKLESFVLDIIRQHKENPNSEEAQDFIEAYLTELSKGNISPSFSEDNLVSCTLD
LFLAGTETTSTTLRWALLYMALCPEIQGKIQAEIDRVVGQSRLPTMADRENMPYTNAAVH
EIQRMGNIVPFNVPRMSTVDTTVAGYHVPKGTLLVTNLTALHRDPKEWATPDAFNPEHFL
EDGQFKKRESFLPFSA
GKRVCLGEQLARTELFIFFTCLLQRFTFQAPPDTQLTLDFRIGLTISPAPYKICAIPR
>65%
to 2J2 Built_from_Q9YGF1_and_others_8 pep:novel missing last exon
scaffold:BROAD0.5:scaffold_1903:224902:329336:-1
note:
this assembly originally used the last exon of the above gene.
Look
between 254645 and 279000 on – strand for C-term
TLLLFSAILLILASYVQRRRRRQQLKYPPGPLRLPIFGNVFQIDTTKAHVSVQEFVRKFG
NIFSMELFGVPMMTVTGLPLIKEVLINQGQVFVDRPQFPLQTYTFKSVGLLMSNGQEWKD
QRRFTLMTLRNFGLGKKTLELRIQEEATYLIEAIREEKGQPFDPHFQINNAVSNIICSVT
FGNRFEYHDSQFRELLKSLDQVSVLQASWECQFFNIIPWIMKFLPGAHKKLFREFKKLES
FVVHVIKQHKEDQNSEEARDFIDAYLKELSKGNNSSSFNEDNLVSCTLDLFFAGTETTST
TLRWALLYMALYPEIQGKIQSEIDRVIGQSRQPTMADRENMPYTNAAIHEVQRMGNIVPF
NVPRVAVVDTIVAGYYVPKGTLLVTNLNALHRDPKEWATPDTFNPEHFLENGQFKKKESF
LPFSM
GKRVCLGEQLARAELFIFFTCLLQRFTFQAPPDTQLSLDFRAGLTISPAPYKICALPRETQPKVGL*
>64%
to 2J2 Built_from_Q9YGF1_and_others_5 pep:novel
scaffold:BROAD0.5:scaffold_1903:224902:380321:-1
modified:
only first two exons are new
look
between 330000 and 378000 on – strand for last 7 exons
VLPSLQTLLLFSAILLILASYVQRRRRRQQSKYPPGPLRLPIFGNVFEIDIKKPHISIQE
CVRKYGNIFSMDLLSAPMIVVTGLPLIKEVLVNQGQVFVDRPRSPLQSYIFKIN
GLFMSNGQEWKDQRRFTLMTLRNFGLGKKTLELRIQEEAIYLTEAIREEKGE
PLNPHFQINNAVSNIICSVTFGNRFEYHDSQFRELLKSLNQVSVLQASWECQ
(0)
FFDIFPRIMKLLPGAHKKHFR
NSRKLESFVVHVIKKHKEDQNSEEAQDFIDAYLKELSKXX
GNITSSFNENNLVACTLDLFFAGTETTSTTLRWALLYMAFYPEIQ
XKIQAEIDRVIGHSRWPTMADRENMPYTYAAIHEVQRMGNIVPLNAPRVATVDTTLAGYHVPK
GTMLLTNLTALHRDPKEWATPDTFNPEHFLEDGQFKKKEAFLPFSA
(1)
GKRVCLGEQLARAELFIFFTCLLQRFTFQAPPDTQLSLDFQAGMTLSPVFYQICALPRETEPKLIL*
>83%
to 2R1 Built_from_Q9QXF7_and_others_1 pep:novel
scaffold:BROAD0.5:scaffold_12644:255201:265755:1
LRSRRQARFPPGPAGLPFIGNIFSLAASAELPHVYMEQQSRVHGQIFSLDLGGLFTVVLN
GYDMVKECLIHQSEIFADRPALPLFKKLTKMGGLLNSTYGRGWLDHRRLAVNSFRCFGSG
QKSFESKIAEEAKCFIDAVDTYQGKPFDLKQLITNAVSNITNVIIFGERFSYEDTEFQHM
IEIFSENVELAASASVFLYNSFPWIGIFPFGKHQQLFKNAAVVYDFLSKIIEKVSANRKP
QSPQHFVDAYLDEMDKGRNDPGSTYSKENLIFSVGELIIAGTETTTNVLRWAILFMGLYP
NIQAQVHKEIDLIVGPNRTPSLEDKPQMPYTEAVLHEVLRFCNVVPLGIFHATSQDTVVR
GYSIPRGTTVITNLYSVHFDKKYWKDPEVFYPERFLDSQGQFVKKEALVPFSLGRRHCLG
EQLARMEMFLFFTSLLQRFHLHFPPDLVPNLKPKLGMTLQPLQYLICAEKR
>74% to
2U1 missing N-term Built_from_Q7Z449_and_others
1937657 -
1968473 bp (1.9 Mb) on chromosome fragment scaffold_14967
This
transcript is located in sequence: contig_70233
RRRGVPPGPRPWPLVGNLGFMLLPAVFK
19 aa Gap
here
SSQVVLEYLSRLYGPIFSFYLGPYLVVVLSDF
HSVREALVQQGEVFSDRPRLPLISVFTKEKGIVFAHYGPIWRQQRKFSHSTLRHFGLGKL
SLEPKIIEEFKYVKEEIQKHGGNPFSPFPIISNAVSNIICSLCFGQRFEYNNSDFKKMLH
LMSRGLEISVSYQMMLINICSWFYYLPFGLFKEIRQIEKDLTVFLKGIIREHRETLDVEN
PQDFIDMYLLHMEEEMKSNSNTSFDEDYLFYIIGDLFIAGTDTTTNTLLWCLLYMSLNPE
VQEKVQKEIEKVIGPDRAPSLTDKVHMPYTEATIMEVQRMSAVVPFGIPRMTSEKTTLQG
YTIPKGTMIIANLWAIHRDPAIWENPKNFSPERFLDEEGQLIKREHFIPFGIGKRVCMGE
QLAKMELFLMFVSLMQNFIFTFPKDAKKPIMPGKFGLTLSPHPFNVIVSKR
>55%
to CYP2W1 Built_from_Q6VVW9_and_others_1
scaffold:BROAD0.5:scaffold_5766:862320:898262:-1
ALLGLLLSWALFCLLTRSKERAGKWPPGPAPLPFIGNLHLLDLRRQDKSLMKISEKYGPV
FTIHFGLQKMVVLTGYQAVKEALVDSAEEFADRPPIPIFQRIQEGQGIFFSSGNLWKTTR
KFTMSSMHKLGMGKKLIAKKILEEFSFLEELIDSFKGEPFKLKLFNMAPTNVIFFLLFGE
RFDYQDPTFVTFIRLIDEVMVLLGSPFLHLFNFYPFLGWFLKPHKTVLVKIEEVRVILRK
YMVASRQNISRGYVTSYIDALIQKQNLLPHGEFSASIFHIFPWMKTGAEKPFCNGREWTQ
TKPFVSFLLAKVQDELDRVLEKSRLPEYEDQKALPYTNAVVHEIQRFIALLPHVPHSTSV
DTHFRGYFIPKGTPVIPLLTSVLLDKTQWETPNKFNPSHFLDADGNFVKKAAFLPFSIGH
RVCIGENLAKMEMFLFFASLLQRFTFHPPPGIQEADLDITPQLTFTMRPQPQAVCAVSR
>62%
to CYP2AB1P Built_from_Q6P0T4_and_others_1 pep:novel
scaffold:BROAD0.5:scaffold_15223:222913:242918:-1
MFSLATGLAILATSFLLLRMLAFFLARTQFPPGPCPLPILGNLLQLRFQLHPEKLSQLTR
KYGSIFTVWLGSTPVVVLNGFQAVKDALVTHSEDFADRPVTPLFEDLFGDKGIISTSGHA
WQQQRRFGLITLRALGMGKKVLEQRLQEEAQYLVEIFHRQNGTSFDPHVPIVRAAANVIC
ALVFGHRFPHGDPFFQELMKAIDFGLAFVNTIWRRLYDAFPWLLRQLPGPHRKIFRYQEI
VKSLICQEIERHKQRVPEDLEDFISCYLAQITKRKDDPASTFDEENLIQVIIDLFLGGTE
TTATTLRWALLYMIHHRDVQGKVQQELDTVLGPSRVISFKDRKLLPYTNAVLHEVQRFCS
VISVGAVRKCGTATTVQGFPIQKGTIVLPNLASVLCDPEHWETPWQFNPGHFLDGEGNFV
IHEAFLPFSAGHRVCLGELLAKVELFLVFAHLLREFRLRAPAGASTNERDYILWGTKQPR
PYDICASPRL
>64%
to 2AC1P Built_from_Q6PA33_and_others_1 pep:novel
scaffold:BROAD0.5:scaffold_12611:1500312:1533637:-1
LDLLSILSGLSLILILILNMKLTLTKNFKKQSPPGPKPLPVIGNLHILNLKRPYQTMLEL
SKKYGPIFSLRMGPKTVVVLSGYETVKDALVNYSEQFGERARIPIFERIFEGKGIVFSHG
ENWKITRRFSLTTLRNFGMGKRVIEERILEECHHLIQVFESHQGKPFEISTIMSASVANI
IVSILFGKRFDYKDPQFLRLLHLIGENIRLAGGPSITIFNMFPVLGFLLQDLKRVLRNRD
ELFSFIRTTFLKHLRKLDKNDQRSFIDAFLIKQQEEKDKSDDYFNNDNLVALVSNLFAAG
TETTSSTLRWGILLMMKYPEIQKKVHNEITEVIGSAQPRIEHRTQMPYTDAVIHEIQRFS
NILPMNLSRETTTDVIFKNYYIPKGTEVITLLTSVLQDQTQWEKPCTFHPQHFTKEGKFI
KRDAFLLLFSSLFLTCVLAGQRMCAGESLAKMELFLFFTSLLQKFTFCPSPGVSNSDLDL
TPDIGFTTRPQPYKICALP
>55% to
3A4 Built_from_Q6LEQ2_and_others cyan insertion
2878085 -
2933432 bp (2.9 Mb) on chromosome fragment scaffold_12616
This
transcript is located in sequence: contig_103558
GLSTETWTLLVAFVTLLILYGIWPYGIFKKLGIPGPRPLPFFGTFLEYRKGILEFDKQCF
QKYGKMWGFYDGRLPILAILDPDIIKIVLVKEFYTLFTNRRNFGLNGILDSGITVAEGEK
WKRIRSIISPTFTTGKLKEMFPIIKHHIDVLVNNIEKKVAQDESVNMKEHLWSLQVLDVI
TTSFGVDIDSIHTKPNDPLLVHIKKLLSFSFMSPLLILICIPYQSLVLEPISVLRQKVMI
YFKKKEGEKKGIDTKKDRVDFLQLMIDSQVMNGSRSEKRNNSPKALTEMEIVAQAVTFIF
AGYETTSTTLNFITYNLATHPEIQKKLLEEIDSTLPNKAVPTYDTIFQMEYLDMVVNETL
RLFPLGGRIERICQKTAEINGITIPKGTVMLIPVYVLHHDPEYWPEPEEFRPERFDQEGR
KSIDPYVFLPFGAGPRNCVGMRFALLTLKTALVTLLQNFTMEPCKETPIPLELETKGFMQ
PKKPIILKLVPRPRP
>64% to
3A4 Built_from_Q98T91_and_others grey = error region
941307 -
1075313 bp (941.3 Kb) on chromosome fragment scaffold_19210
This
transcript is located in sequence: contig_73184
MNIIPSLSAGTWTLIVLFLTLLYLYGTRTHKLFKNLGIPGPKPLPFFGTVFSYRKGLVNF
DYDCFKKYGKTWGFYDGRQPVLATMDPETIKTVMVKECYSVFTNRRSFGPVGSLESAITV
AKDDQWKRIRTVLSPTFTSGKLKEMFPIINQYGDVLVKNMKKEAEKNKPVTMKDILGAYS
MDVITSTSFGIHVDSLNNPNDPFVREIKKLIRFNFLDPLILSVAIFPFLIPLFNKLDLTV
FPKEATDFLAKSIIKIKEERTKSTEKASRPLQCLEEEYVYNVNDLSDEEILAQSIIFIFA
GYETTSSVLSFLFYHLATNPKIQEKLQKEIDAFLPNKEAVTYDALVQMEYLDMVINENLR
LYPIAGRIERVAKKTVELNGLTIPKGTVVMAPPYVLHRDPEYWPEPEEFRPERFSKENKE
SINPYVYLPFGAGPRNCIGMRFALMSMKVAVSRLLQEFSFRPCKETQIPLKLSNQPLLTP
TVPIVLQAELRN
>66% to
3A4 Built_from_P51538_and_others
23961 -
170013 bp (24.0 Kb) on chromosome fragment scaffold_11888
This
transcript is located in sequence: contig_58548
MNIIPNLSAGTWTLIILFLTILYLYGTRTHKLFKNIGIPGPTPFPFIGTILYYRKGIVGF
DYGCYKKYGKTWGFFDGTKPVLAIMDPETIKTVLVKECYSVFTNRRMLGLSGILEKAISI
AEDEEWKRIRTVLSPAFTSGKLKEMFPIINQYGDVLVKNMKKEAEKSKPVTMKEIFGAYS
MDIIISTSFGIHVDSLNNPNDPFVREIRKLIRFSFLDPLILSITIFPFLIPLFKKLDITV
FSKDATDFLGKSILRIKEERKKSTEKHRVDFLQLMMDSQTSKNSESHSQKDLSDEEILAQ
SIIFIFAGYESTSSVLCFLFYQLATNPGIQEKLQKEIDAFLPNKEAVTYDALVQMEYLDM
VINENLRLYPITGRIERIAKKPVELNGLMIPKGTVVMAPPYVLHRDPEYWPEPEEFRPER
FSKENKESINPYVYLPFGVGPRNCLGMRFALMSMKVAVSRLLQEFSFRPCKETQIPLKLS
YRPLLAPSVPIVLQAVLRNKKGN
There are
5 CYP4ABX family members shown here plus a CYP4A pseudogene
>Gene
4.1
Query
location :
CYP3Aamp 430 to 456 (+)
Database
location : contig_55667 10964 to 11044 (+)
Genomic
location : scaffold_1650
18357 to 18437 (+)
(1)
GKGLLSLEGEKWHYHRRLLTPAFHFNILKDYIYIMRNTVSLML (0)
AGGAAAAGGCCTTCTGAG
CTTAGAAGGTGAGAAATGGCATTACCACCGGCGCCTACTGACCCCAGCCTTCCACTTCAA
CATCCTGAAGGACTACATCTACATAATGAGGAACACTGTCAGCCTGATGCTGGT
(0)
DKWEKLRTKDSSIDVFDFISYMSLDTSLKATFGLQDFNKEES (2)
AGGATAAATGGGAAAAACTCAGAACCAAGGACAGTTCCATAGACGTCTTTGACT
TTATCTCTTACATGTCCTTGGACACTTCCTTGAAAGCTACCTTCGGTCTCCAGGATTTCA
ATAAAGAAGAAAGGT
(2) LFSYFQNMNKLLYLVRKRMETFLYYSDFIYKLTSDHYEFQTTCKELKEEP
(1)
AGCTTATTCAGCTATTTCCAG
AATATGAACAAACTTCTATATCTTGTGAGGAAGCGCATGGAAACCTTCCTCTATTACAGT
GATTTCATTTA
CAAGCTGACTTCTGACCATTATGAATTCCAGACTACCTGCAAAGAATTA
AAGGAGGAGCCAGGT
(1)
AKIIRHRRALYKNQSEKDKVPKKKYLNFLDILFQAK (0)
AGCTAAGATCATCCGGCATAGGAGGGCACTCTACAAGAACCAGAGTGAAAAAGACAAGGT
GCCAAAGAAGAAGTACCTGAATTTCCTGGACATTCTCTTCCAAGCCAAAGT
(0)
DEDGGGFTNEELENEVNTFRFAGQDTVAV
GFSWTLYCLAMNPEYQKKCREEIQGILKDGESITW
(2)
AGGATGAAGATGGAGGAGGCTTCACAAATGAGGAACTAGAAAACGAG
GTAAACACATTCAGGTTTGCAGGACAAGACACTGTAGCTGTTGGGATTCTCCTGGACCTT
ATACTGCC
TGGCCATGAACCCAGAATATCAAAAGAAATGTAGAGAAGAGATCCAAGGTAT
TCTGAAAGATGGAGAATCCATAACCTGGT
(2)
DHLTQMTYSTMCIKESFRMYSPIPKVARTLNQPITFPDGQTLPS (1)
AGGGACCA
TCTTACCCAGATGACTTACAGCACCATGTGCATCAAAGAATCCTTCCGCATGTATTCACC
TATCCCCAAAGTAGCCAGGACACTCAACCAGCCCATCACTTTCCCAGATGGACAGACTTT
ACCTTCAGGT
(1)
DTTVTINIWALHYNPAIWENPK (0)
AGACACGACTGTGACCATAAACA
TCTGGGCTCTGCACTACAACCCTGCAATCTGGGAAAACCCAAAGGT
10961 (0)
VFDPERFTPENTKKRHPYAFLPFSAGLR (2) 11044
AGGTCTTTGACCCTGAAAGATTCAC
CCCAGAGAATACTAAGAAGAGACACCCTTATGCTTTCTTACCATTTTCTGCTGGGCTCAG
GT
NCIGQQFAMNLIKVSLALTLLRFELLPDLEKPPIPISQIVLRSTSGFHFYLKPLR*
AGGAACTGCATTGGGCAGCAATTTGCCATGAATCTAATAAAGGTGAGCCTGGCC
CTGACACTGCTCCGCTTTGAGCTACTTCCAGATCTTGAGAAGCCTCCAATCCCCATATCC
CAAATAGTTCTCAGGTCTACGAGTGGTTTCCACTTCTATCTAAAGCCACTACGTTAG
49% to
4X1, N-term 3 exons in seq gap, 58% identical to gene 4.2
GKGLLSLEGEKWHYHRRLLTPAFHFNILKDYIYIMRNTVSLML
(0)
DKWEKLRTKDSSIDVFDFISYMSLDTSLKATFGLQDFNKEES
LFSYFQNMNKLLYLVRKRMETFLYYSDFIYKLTSDHYEFQTTCKELKEEP
(1)
AKIIRHRRALYKNQSEKDKVPKKKYLNFLDILFQAK (0)
DEDGGGFTNEELENEVNTFRFAGQDTVAV
GFSWTLYCLAMNPEYQKKCREEIQGILKDGESITW
DHLTQMTYSTMCIKESFRMYSPIPKVARTLNQPITFPDGQTLPS
(1)
DTTVTINIWALHYNPAIWENPK (0)
10961 (0)
VFDPERFTPENTKKRHPYAFLPFSAGLR (2) 11044
NCIGQQFAMNLIKVSLALTLLRFELLPDLEKPPIPISQIVLRSTSGFHFYLKPLR*
Gene4.2
Query
location :
CYP4
126 to 168 (+)
Database
location : contig_55667 42355 to 42483 (+)
Genomic
location : scaffold_1650
49748 to 49876 (+)
MGADKIPSWLETHWTRPLHLALTFVLALLLLQVVKLYLRRQG
LLRALRLFPGPPPHWLFGNQRE
(0)
ATGGGCGCGGATAAGATCCCTTCTTGGCTGGAGACGCACTGGACTCGGCCCTTGCACTTG
GCCTTGACGTTCGTTCTCGCGCTGCTGCTGTTGCAGGTCGTCAAGCTCTACCTCCGGCGC
CAGGGACTGTTGCGAGCCCTGCGCCTCTTCCCCGGACCCCCTCCACACTGGCTCTTTGGG
AACCAGAGAGAGGT
(0) FYLEKELQQFNVLPEQYPCALPLWVGAFQVLLNIYDPEYAKILLNRR
(1)
AGTTTTACCTGGAAAAAGAACTCCAGCAATTTAATGTGTTG
CCAGAACAATACCCTTGTGCTCTCC
CTCTCTGGGTTGGGGCCTTTCAGGTGCTTTTAAATATCTATGACCCAGAATATGCGAAAA
TTCTTCTGAACCGAAGAGGT
(1)
DPKIQHGYKFIIPWI (1)
AGATCCCAAAATACAGCATGGGTACAAGTTTATTATCCCCTGGATTGGT
(1)
GGGLLSLEGKKWYQHRHLLTPAFHLSILKPYIHVMNDSVCRML
AGGAGGAGGACTGCTGAGCCT
GGAAGGAAAGAAGTGGTACCAGCATCGCCATCTGCTGACTCCTGCCTTCCACTTGAGCAT
TCTGAAGCCCTACATCCATGTGATGAATGATTCAGTCTGCAGGATGCTGGT
(0)
DTWEKLSTQDNSVEICEPIRLMTLDSIMKCAFSVQTSSQTES
(2)
AGGATACATGGGAGAAGCTCAGCACCCAGGACAATTCTGTGG
AGATCTGTGAGCCCATTCGCCTGATGACCTTGGACAGCATCATGAAATGTGCCTTCAGTG
TCCAGACCAGCTCCCAAACAGAAAGGT
(2) FSTNYLSTVTKLSELIFCRLNNYLHHNDLIYRWSSQGQEFQALCQIAHQLP
(1)
AGCTTTTCTACCAACT
ACCTCTCAACTGTGACAAAACTCTCAGAACTAATCTTCTGCCGCCTGAACAACTACCTCC
ATCACAATGACTTGATTTACAGGTGGAGTTCTCAGGGGCAAGAATTCCAAGCTCTCTGCC
AAATAGCACATCAGCTCCCAGGT
(1)
AKIIQERREALKNNSEQDKIRKKKFLDFLDVLLCAK (0)
AGCTAAGATCATC
CAGGAAAGGAGGGAAGCACTCAAGAATAATAGTGAACAGGACAAGATCCGAAAAAAGAAG
TTCTTGGATTTTCTAGATGTTCTTCTTTGTGCCAAAGT
(1)
SENGEGLSNEELEAEVNTFVFGGHDTTASSLSWIFYCMAMNPEHQHQCREEIRNIIKYGDTITW (2)
AGAGTGAAAATGGAGAAGGCTTATCAAATGAAGAGCTAG
AGGCTGAGGTTAACACATTTGTGTTTGGTGGTCATGACACTACAGCCTCTAGTCTTTCCT
GGATCTTCTACTGTATGGCCATGAACCCAGAGCACCAACACCAATGTCGAGAAGAGATCA
GAAATATCATAAAATATGGGGATACCATTACCTGGT
(2)
DHLDQMPYSTMCIKEALRLYPPSITIARELSKPITFPDGRFLPT (1)
AGGGACCACCTAGACCAG
ATGCCCTACAGCACCATGTGCATCAAGGAGGCCCTCCGCCTCTACCCACCTAGCATCACT
ATAGCCAGAGAACTTAGCAAACCCATCACCTTTCCAGATGGACGCTTCTTGCCCACAGGT
(1)
GMTVVLNIWALHHNPTVWENPQ (0)
AGGCATGACAGTT
GTCCTGAATATCTGGGCTCTCCACCACAACCCTACTGTCTGGGAAAACCCACAGGT
(0)
VFNPERFSQENSMKRHSYAFLPFSAGPR (2)
NCIGQQLAMLELKVGLALTLLRFELLPDLEKPPIPMPHLVLRSKNGIHLYLRPLH*
AGGAACTGCATTGGACAACAGCTTGCCATGTTGGAACTGAAGGTGGGACTGG
CCCTGACCCTGCTCCGTTTTGAGCTATTACCAGATCTGGAGAAGCCTCCTATTCCAATGC
CCCACTTGGTTCTCAGGTCTAAGAATGGGATTCATCTATACCTAAGGCCACTGCACTAG
54% to 4Z,
4X 55% to 4A11
MGADKIPSWLETHWTRPLHLALTFVLALLLLQVVKLYLRRQG
LLRALRLFPGPPPHWLFGNQRE (0)
(0)
FYLEKELQQFNVLPEQYPCALPLWVGAFQVLLNIYDPEYAKILLNRR (1)
(1)
DPKIQHGYKFIIPWI (1)
(1)
GGGLLSLEGKKWYQHRHLLTPAFHLSILKPYIHVMNDSVCRML
(1)
DTWEKLSTQDNSVEICEPIRLMTLDSIMKCAFSVQTSSQTES (2)
(2)
FSTNYLSTVTKLSELIFCRLNNYLHHNDLIYRWSSQGQEFQALCQIAHQLP (1)
(1)
AKIIQERREALKNNSEQDKIRKKKFLDFLDVLLCAK (0)
(1)
SENGEGLSNEELEAEVNTFVFGGHDTTASSLSWIFYCMAMNPEHQHQCREEIRNIIKYGDTITW (2)
(2)
DHLDQMPYSTMCIKEALRLYPPSITIARELSKPITFPDGRFLPT (1)
(1)
GMTVVLNIWALHHNPTVWENPQ (0)
(0)
VFNPERFSQENSMKRHSYAFLPFSAGPR (2)
(2)
NCIGQQLAMLELKVGLALTLLRFELLPDLEKPPIPMPHLVLRSKNGIHLYLRPLH*
GENE 4.3
SCAFFOLD_14847 208370-233419 + STRAND, 62% to 4A11
MGTGLGLAWLPRDFSSFLQTSVLLSLVLLLLKGVQLYRRRQWLLRTFQNFPGPPAHWFNGHFWE
(0)
YQKADEITVTLFWAKQFPSAFPRWFSGFTVALQVYDPEYMRILLGRP
(1)
DPKADKFYRLLAPWI
(1)
GKGLLILNGSTWFQHRRLLTPAFHYDILKPYVALMVDSVLVML
(0)
KKWEKLITQDSSLEIFEHVSLMTLDTIMKCAFS*IHRNCQMER
(2)
NADDYIQAVKEQAILVFSRVRNDLYHNDFVYWFSPQGYQARRWAHLAHNHT
(1)
DQVIKKRKQHLHQEGGLEAILKKRHLDFLDILLCSK
(0)
TENGDSLSDKELRAEVDTFMFEGHDTTASGISWLLYSLAMNPEHQQKCREEIRDLLGDEMAIGW
(2)
EHLNRMPYTTMCIKESLRLYPPVTSISRDLSKPLTLSDGRYLPA
(1)
GTIVTLHIHALHHNPSVWPEPE
(0)
VFNPLRFSPENLTSRHTHSFLPFSAGTR
(2)
NCIGQQFAMNEMKVAVALTLLHFHLEPDATQPPQLFPRVVLRSKNGIHLKLTRI*
MGTGLGLAWLPRDFSSFLQTSVLLSLVLLLLKGVQLYRRRQWLLRTFQNFPGPPAHWFNGHFWE
(0)
ATGGGCACTGGCCTGGGATTAGCCTGGCTCCCCAGAGACTTCTCCAGCTT
CCTCCAGACTTCAGTGCTGCTGAGCTTAGTCCTGCTGCTGCTCAAGGGGGTCCAGCTGTA
CCGGCGCAGGCAGTGGCTTCTTAGAACCTTTCAGAACTTCCCAGGCCCTCCTGCCCACTG
GTTCAATGGACACTTCTGGGAGGT
YQKADEITVTLFWAKQFPSAFPRWFSGFTVALQVYDPEYMRILLGRPGEKETP
AGTATCAAAAAGCTGATGAAATAACGGTGACACTGTTCTGGG
CCAAGCAATTCCCTTCTGCCTTTCCTCGGTGGTTCTCTGGGTTCACAGTGGCCCTCCAAG
TCTATGACCCTGAATACATGAGGATTCTGCTGGGCAGACCAGGT
DPKADKFYRLLAPWI
(1)
AGATCCCAAAGCTGATAAATTCTACAGATTATTGGCTCCCTGGATTGGT
KKWEKLITQDSSLEIFEHVSLMTLDTIMKCAFS*IHRNCQMER
(2)
AGAAAAAATGGGAG
AAGCTCATCACCCAGGACTCGAGTCTAGAGATCTTTGAGCATGTCAGCCTGATGACACTG
GACACCATCATGAAATGTGCCTTTAGCTGAATCCATCGCAACTGCCAGATGGAGAGGT
NADDYIQAVKEQAILVFSRVRNDLYHNDFVYWFSPQGYQARRWAHLAHNHT
(1)
AGGAATGCTGATGACTATATCCAGGCTGTGAAGGAACAGGCAATCCT
CGTATTCTCTAGAGTTCGAAATGATCTCTACCACAATGACTTCGTCTACTGGTTCAGTCC
TCAGGGCTACCAGGCCCGCCGGTGGGCCCACCTGGCCCATAACCACACAGGT
DQVIKKRKQHLHQEGGLEAILKKRHLDFLDILLCSK
(0)
AGACCAGGTGATTAAGAAAAGGAAGCAGCACCTCCATCAG
GAAGGAGGCTTGGAGGCTATCTTGAAGAAAAGGCACTTGGATTTCCTGGACATTCTCCTC
TGTTCCAAGGT
TENGDSLSDKELRAEVDTFMFEGHDTTASGISWLLYSLAMNPEHQQKCREEIRDLLGDEMAIGW