Biochemistry 811

A Second Lecture on Evolution

D. Nelson, last modified Nov. 25, 2003 3PM

Reading: Berg, Tymoczko and Stryer 5th edition, parts of Chapter 7

Note: due to a reorganization of this course, four lectures on evolution have been reduced to three. This means that the next two lectures have been assembled from three previous ones. Some less interesting material has been ommited. The four full length lectures are still online as the 2001 notes.

SOME FEATURES THAT ARE NOT CONSERVED AMONG ALL THREE DOMAINS

One feature that is not well conserved in all three domains of life is ribonucleotide reductase. As we discussed in the section on ribonucleotide reductase, there are three very different enzymes that use different cofactors to catalyze this important reaction, removal of the 2' OH group from ribonucleotides. Sequence similarity is very low between the three classes. As shown in Fig. 7.3, p. 172, 3-D structure is preserved even when sequences diverge greatly. Sequence similarity has now been shown between the class I (tyrosyl radical, bacteria, eukarya) and class II enzymes (adenosylcobalamine, bacteria, archaea, uses NTPs). The crystal structure for a class III enzyme (glycyl radical, anaerobes) has been solved and the fold is a 10 stranded beta/alpha barrel as in the class I enzyme, even though the sequences are very different. (Science 283, 1499-1504 1999) The class III enzyme has a glycyl radical instead of a tyrosine radical and it is poisoned by oxygen (it actually cuts off its own tail). The class I and class II forms apparently evolved from the anaerobic class III enzyme after oxygen arose in the atmosphere in order to make an oxygen stable radical that would not self destruct. The fact that there are de novo pathways for ribonucleotides but not for deoxyribonucleotides (they are only made from ribonucleotides) suggests that DNA came after RNA in evolution.

A related issue is the question of which came first DNA or protein. In an article discussing this (Science 286, issue of 22 Oct. 1999, pp. 690-692) RNA is assumed to evolve before DNA or protein, but which came next? The answer seems to be that protein came next because protein catalysts are apparently necessary to make ribonucleotide reduction possible and this had to happen before DNA was ever made. The mechanism is a rare and energetically costly one and it looks like it may be the only possible way to do this reaction, since no other enzyme type has been found to do it. The possibility that RNA could catalyze a free radical reaction seems unlikely, therefore protein was needed before ribonucleotide reductase could exist and therefore before DNA could be made.

The lipids of modern archaea are ether linked lipids, while bacteria and eukarya have ester linked lipids. The archaea with the highest heat tolerance have ether linked lipids that are covalently coupled across the bilayer. This adds to their membrane stability at high temperatures. Christian de Duve suggests that the common ancestor was an ether linked species that had to switch to ester linkages to survive a drop in temperature. However, Tom Cavalier-Smith suggests that ether linked lipids replaced ester linked lipids in Archaea during adaptation to hyperthermophilic environments. The present day archaea have some members that can grow at 121 degrees (in an autoclave), but the most thermophilic bacteria can only stand about 80 degrees. De Duve suggests this is related to membrane lipid composition. If the last common ancestor had ether linked lipids, then the transition to ester linked lipids had to occur twice, once in the bacteria and again in the eukarya. If the chimeric model of eukarya is correct, the gram negative bacteria that fused with an archaea could have brought the ester linked lipid biosynthetic genes along. It is less complicated if the evolution of ether linked lipids happened only once in the Archaea.

Murein is only found in bacteria not archaea. It is a cell wall component made of sugars and D and L amino acids. The presence of both D and L amino acids suggests an ancient origin for this material. Archaea may have lost the ability to make murein, however, it is possible that the synthesis of murein evolved after bacteria and archaea split. Cavalier-Smith views loss of murein as an adaptation to thermophily.

Cholesterol and related sterols are eukaryotic lipids. Yeast make ergosterol, not cholesterol, and plants make cycloartenol derivatives, though palms do make cholesterol. Two or three bacteria are known that make demethylated lanosterol products, on the way to sterols, but the gene required (CYP51) is argued to be derived from a lateral gene transfer from plants. So sterol biosynthesis seems to be a eukaryotic phenomenon. This makes sterols (and steranes) a biomarker for eukaryotes.

Archaea and eukarya share some distinctive features, including N-linked glycoproteins, absence of N-formylmethionine and introns in some of their tRNAs(see PNAS 92, 5761-5764 1995). Bacteria use N-formylmethionine at the beginning of all their proteins. There is a special tRNA for this amino acid. Eukarya and archaea do not have N-formylmethionine. Cavalier-Smith identifies 19 shared traits which unite these two groups and exclude bacteria.

The archaea Methanococcus jannaschii has five histone genes, This implies the DNA is packaged in nucleosomes as in eukaryotes. The 3-dimensional structure of these histones show they share a common ancestor with eukaryotic histones. This requires a more complex transcription machinery to get at the DNA. Bacteria don't have histones and they have a much simpler transcriptional apparatus.

Archaea and bacteria both have polycistronic operons, and some of these have the genes arranged in a similar order in both lineages. This implies that the operons existed in the common ancestor. Recently, there has been some evidence that C. elegans, a model organism for genome sequencing also has operons. This was unheard of in animals before these C. elegans operons were described. The recent sequencing of Caenorhabditis briggsae (diverged about 100 MYA from C. elegans) shows these operons are highly conserved.

DEVELOPMENT OF PHOTOSYNTHESIS AND THE OXYGEN CRISIS

Life in the early earth was anaerobic. In the history of photosynthesis, photosystem I must have developed first. PSI cannot split water to make O2, so the first source of electrons for this pathway had to be mineral compounds. Eventually, PSII evolved from PSI and the ability to split water and form O2 changed the planet. Here is a quote "The creation of a photosynthetic apparatus capable of splitting water into oxygen, protons and electrons was the pivotal innovation in the evolution of life on Earth. This event literally changed the face of the Earth." (Dismukes, G.C. et al. The origin of atmospheric oxygen on Earth: The innovation of oxygenic photosynthesis. PNAS 98, 2170-2175 2001) O2 is the source of many toxic oxygen compounds such as superoxide anion, hydrogen peroxide and hydroxyl radical. These are known as reactive oxygen species (ROS). Cells had to protect themselves from these oxygen byproducts of photosynthesis. This required the evolution of free radical scavengers like vitamins C and E, and enzymes like superoxide dismutase and catalase.

Oxygen released to the environment by photosynthesizing cyanobacteria would react with any oxidizable substance. The early oceans were believed to contain a lot of Fe2+ that could react with oxygen to form rust. There is evidence from 3.75 billion year old sedimentary deposits called banded-iron formations, that Fe2+ served as an oxygen sink. These deposits formed from 3.75 billion years to about 1.7 billion years. As these deposits began to lessen, oxygen began to appear in the atmosphere, starting about 2 billion years ago. It has remained pretty constant since 1.5 billion years ago. The inference is that iron in the oceans served as an oxygen trap for almost 2 billion years until it was all oxidized. As the iron became depleted, oxygen concentrations in the atmosphere could rise.

Organisms that were not O2 producers either had to hide from O2 or adapt. There are still anaerobes that hid from the toxic effects of O2. The other organisms adapted and developed protections from oxygen. These organisms eventually learned how to exploit O2 as the ultimate electron acceptor of their electron transport chains and gained a great advantage in the amount of energy that could be recovered in the form of ATP.

The presence of oxygen also opened up the possibilities of oxygen chemistry that were not available before. Note that none of the amino acid biosynthesis pathways requires molecular oxygen, but many of the breakdown pathways do. You should also be aware that cholesterol biosynthesis requires oxygen to remove a 14 alpha methyl group from the cholesterol ring structure. This is needed to make cholesterol planar, which is an important feature of this critical eukaryotic lipid. Cholesterol cannot be made without oxygen and the cytochrome P450 CYP51 (lanosterol 14 alpha demethylase).

THE ACQUISITION OF MITOCHONDRIA AND CHLOROPLASTS BY EUKARYOTES

HYDROGENOSOMES PROVIDE CLUES

It is thought that the earliest eukaryotes had no mitochondria or chloroplasts, but they acquired these organelles by endosymbiosis. Today the most ancient branches on the eukaryotic tree all represent groups without mitochondria. The earliest eukaryotes seem to be the Archaezoans diplomonads (includes Giardia), and the trichomonads. The evidence is very strong that mitochondria were taken in as endosymbionts of the alpha proteobacteria. Phylogenetic trees of the rRNAs from bacteria and mitochondria show this plainly. A paper published in Nature (Nature 396, 133-143 1998) shows the Rickettsia are the closest living relatives of mitochondria. A similar case is made for the chloroplast's origin among cyanobacteria. Hydrogenosomes and peroxisomes may also be endosymbionts, but they do not contain a genome of their own today.

Note on microsporidians: Sequences of the largest subunit of RNA polymerase II show that microsporidians are really related to fungi. Their deep branching on rRNA trees is artifactual. PNAS 96, 580-585, 1999 Microsporidia are related to Fungi:Evidence from the largest subunit of RNA polymerase II and other proteins. Microsporidians also contain a characteristic insertion in their EF1 alpha protein that is only seen in the fungal and animal clade (Opisthokonta). This strongly suggests they must belong to this clade.

An article published in Science 275, 790 1997 strongly suggests that the amitochondriate eukaryotes mentioned above actually did have mitochondria, but they lost them, or they became hydrogenosomes. Mitochondrial-like heat shock protein genes are found in the genomes of these organisms. A second article from 2002 identifies some additional HSP70 like genes from amitohondriate cells. (Parasitol. Int. 51, 9-16, 2002) Because of this, the use of Giardia as a model of the primitive eukaryote before mitochondria were acquired may not be viable. In fact, there does not seem to be any living model for the earliest eukaryote that existed before mitochondria. Recent work has shown that Giardia are part of a very early branch on the eukaryotic tree. All eukaryotic alanyl tRNA synthetases in eukaryotes have been replaced by a bacterial version probably from the mitochondrial endosymbionts genome. This did not happen in Giardia where the synthetase is like an archaeal gene. The authors of this paper say that Giardia did not complete the transition to mitochondria, but lost the mitochondria before this key event took place (Origin of mitochondria in relation to evolutionary history of eukaryotic alanyl-tRNA synthetase PNAS 97, 12153-12157 2000 (Oct. 24 issue). Since this is the only organism with this property, they (diplomonads) must be the first branch on the eukaryotic line.

Hydrogenosomes are organelles found only in eukaryotes that do not have mitochondria. These organelles are surrounded by a double membrane like mitochondria. They are the site of pyruvate fermentation to produce acetate, CO2 and H2. ATP is formed by substrate level phosphorylation, so these organelles resemble mitochondria in that they produce ATP. They do not have pyruvate dehydrogenase. Instead they use an enzyme called pyruvate ferredoxin oxidoreductase, and they have hydrogenase. These enzymes are not found in mitochondria, but they are seen in anaerobic bacteria. Hydrogenosomes do not have an electron transport chain and F1F0 ATPase. They do not perform oxidative phosphorylation. But they do have an ADP/ATP carrier to exchange ATP and ADP with the cytosol. Recently a ciliate (Nyctotherus ovalis) was found with a hydrogenosome that still has a genome of its own (Nature 396, 527-528 and commentary on 517-519 1998). The small subunit rRNA gene was PCRed out and found to match with other mitochondrial versions of this gene.

One set of proteins found in hydrogenosomes are the heat shock proteins Hsp70, Hsp60 and Hsp10. These are ideal sequences to use for phylogenetic analysis. As we saw in the first evolution lecture, Gupta And Golding used them to argue for a hybrid eukaryote genome. A sequence analysis of the hydrogenosome Hsp70 and Hsp60 proteins showed a signature sequence characteristic of these proteins in mitochondria and gram negative purple bacteria. Trees made with other available sequences of all three Hsp proteins placed each of them in the mitochondrial Hsp group. Since this happened in all three cases, the hydrogenosome Hsps and the mitochondrial Hsps appear to have a common bacterial ancestor. In anaerobic environments where these organisms live, it was not useful to retain the OXPHOS genes encoded in the mitochondrial genome, which may explain why most hydrogenosomes have no genome of their own. These organelles are apparently a degenerate version of the mitochondrial ancestor.

Before the eukaryotic ancestor could engulf these bacteria, it would have to evolve into a phagocytic cell, with the ability to take a whole living bacterium inside itself. This is not possible with a rigid cell wall. Therefore, the precursor of the eukaryote host had to lose the ancestral cell wall.

An example of symbiosis between amebas and a bacterium has been documented in the lab. Kwang Jeon of our own UT (at Knoxville) was studying amebas when he noticed a bacterial infection had killed most of the cells in one of his cultures. He isolated the bacterium and tested it on another group of amebas. Again it killed most of them, but some survived and had 40,000 bacteria living inside the ameba cells. He cultured the amebas for many generations and then tried an experiment where he removed the nucleus from these infected amebas and transplanted it to the original uninfected strain. The amebas died unless he "infected" them with the bacteria. In other words, the host had become dependent on the bacteria and needed them to survive. LINK TO AN ENDOSYMBIOSIS PAGE

HOMEOBOX GENES AND MACROEVOLUTION

(see Molecular Biology of the Cell, chapter 21)

Humans, worms and flies don't look very similar and they do not go through the same developmental stages. Yet the genes that control their body shape and organization are related in sequence. These genes all share a common sequence called the homeobox. The homeobox was only recognized as a special sequence in 1984. This 183 nucleotide sequence codes for 61 amino acids found in these proteins. The rest of the proteins may be very different, but this 61 amino acid piece is crucial for their function. The homeodomain is a helix turn helix DNA binding domain that recognizes a specific DNA sequence. The homeodomain targets the remainder of the protein to regulate the gene expression of any genes with the appropriate recognition sequence in their control regions. There are at least 50 homeobox genes in Drosophila. They fall into two main divisions, the complex superclass and the dispersed superclass. Those in the complex group are found in clusters, the dispersed group are solo genes.

One subset of these genes are called homeotic selector genes. In Drosophila, there are 8 genes arranged in a series along 650,000 base pairs of DNA. This whole region is called the HOM complex. There are two smaller subsets of these genes in the HOM complex, the antennapedia complex (5 genes) and the bithorax complex (3 genes). Other insects have these genes all in one complex, so it looks as though the HOM complex became split in Drosophila.

Mutations in the 8 genes of the HOM complex cause large scale mutations in flies. A mutation in bithorax causes a fly to have an extra set of wings. Mutation in antennapedia causes a leg to grow where an antenna should be. These genes are not master switches for making wings or legs, but they specify position in the fly's body. The order of the genes on the chromosome is the same as the order of segments in the fly's body where they are expressed. The left most gene is expressed in the head, the right most gene is expressed in the abdomen. When a gene is deleted or mutated, the segment where it is normally expressed cannot tell where it is because its position clue is gone, so it behaves like the closest segment to it. (A dominant mutant behaves like the next posterior segment. A recessive mutant behaves like the next anterior segment.) That is why a bithorax mutation causes an extra set of wings. The segments adjacent to the bithorax segment dictated what should be made.

An amazing fact is that these HOM genes have clear homologs in vertebrates. These are called hox gene clusters. Mice have four hox gene clusters on four different chromosomes. These are called HoxA ,B, C and D. HoxB has all the same genes as HOM plus one more. They are in exactly the same order. The other three segments are missing some of the HOM genes, but they have some extra homeobox genes not in the HOM cluster.

The HOM cluster seems to have arisen by gene duplication of a single homeobox gene long ago. This whole cluster then was duplicated in total four times in the lineage of vertebrates . Some additional gene duplication and deletion resulted in the present day set of Hox genes in mammals. These genes specify position in mouse embryos, just like they did in flies. They seem to have a similar function to the HOM cluster in flies, except it is more complicated in mammals because there are four clusters. Two sets of hox gene clusters are expressed in limb buds in perpendicular directions. The gene products from one cluster are expressed along a left to right axis in the limb bud(HoxD) and the other gene cluster is expressed top to bottom in the same bud(HoxA). This creates a checkerboard pattern that makes each position in the limb bud unique, like the elements of a mathematical array that are described by x, y and z coordinates. If a single gradient in the fly can specify the development of different symmetrical segments, like head, thorax and abdomen, then a dual gradient in the limb bud can specify the development of asymmetry in the limbs, things like the bones and muscles of the hand, the layout of nerves and blood vessels, what is to be skin and fingernails.

The HoxA cluster in mouse has 11 genes, Drosophila has eight genes in the HOM cluster. HoxA has added three extra genes. Probably, if one looks back at simpler organisms there will be some that have fewer homeotic genes in these clusters, or fewer clusters. The Annual Review of Biochemistry 1994 has an article on homeodomain proteins (Vol. 63, 487-526). There, evidence is cited for one hox cluster in acorn worms (a hemichordate), two hox clusters in amphioxus (a cephalochordate) and three (or 4) in lamprey (a primitive vertebrate). More recent work shows three probable clusters in a fresh water lamprey (Sharman AC, Holland PW Estimation of Hox gene cluster number in lampreys. Int J Dev Biol 42,617-20 1998) Another paper shows one Hox cluster in ribbon worms PNAS 95, 3030 1998, that are more primitive invertebrate animals. One hox cluster is seen in sea urchins. It is tempting to extrapolate that gain of hox genes in a cluster increases the complexity of an organism by allowing additional segments to be specified. Initially these would be just like adjacent segments, but there would be opportunity to evolve into more specialized functions. For example, if there are three sets of legs in insects, could another set of legs be added just by duplicating a hox gene that specified a leg segment of the body? What do the hox gene clusters of spiders, centipedes and millipedes look like? Are there dozens of duplicated hox genes that specify many identical segments? This provides the possibility of macroevolution. Duplication of hox genes, or whole hox gene clusters, followed by deletion and mutation might alter a species very dramatically in a short time period.

Note added in 1997: The International Society of Developmental Biologists and the Society for Developmental Biology met in July 1997 at Alta, Utah. Researchers reported that centipedes and onychophorans, primitive, wormlike creatures believed to be the closest living relatives of the organisms that gave rise to the arthropods, including insects, have the same eight homeobox (Hox) genes as insects themselves. This indicates that the diverse body segments of manibulata (myriapoda[inclues centipedes and millipedes] + hexpoda[inclues insects and springtails]) did not evolve as a result of Hox gene duplication as previously thought, but may instead have arisen as a result of changes in Hox gene regulation. (Science 277, 639 1997). This is very exciting, because it offers many opportunities to evolve by changing the regulation of the hox genes and not the number of these genes.

Note added in 1998: The 21 August issue of Science p. 1119 says that there are seven Hox clusters in zebrafish on seven different chromosomes. There are two copies of Hoxa, Hoxb and Hoxc with only one copy of Hoxd. The interpretation is that the whole fish genome duplicated to give eight Hox clusters and one Hoxd cluster was lost. In Mol Phylogenet Evol 9, 375-381 1998, Hox genes in the simplest known animal Trichoplax adhaerens are discussed. At least 5 Hox genes in sea anemone were detected by PCR Biol Bull 193, 62-76 1997.

Another homeobox gene in Drosophila is eyeless. This gene appears to be a master switch gene that turns on eye formation(see Science 267, 1766-1767 and 1788-1792 1995). If eyeless is expressed in tissues where it normally would not be active, whole functional eyes form. These may be on the end of antenna, on the wings or on the legs. This gene has homologs in mouse (small eye, Pax-6) and man (aniridia) that also affect the formation of eyes. In fact, the mouse gene can substitute in Drosophila for the eyeless gene. This means that eye formation is controlled by a gene that evolved before eyes evolved. (invertebrates and vertebrates diverged 600 million years ago). The common ancestor apparently had a light sensitive tissue that later evolved into different types of eyes in insects and vertebrates. Eyeless controlled the development of that light sensitive tissue and it has continued in that role for at least 600 million years.

SOLVING THE MAMMALIAN RADIATION

The Oct. 97 Trends in Genetics has an article on mapping the cat genome. A genetic map of the cat is being made along with other mammalian species, such as cattle and mouse. When the maps are compared, there are regions that have the same order of genes in different species. These regions are called syntenic regions. The regions are descended from the mammalian ancestor without rearrangement caused by chromosome translocations, breaks, fusions or inversions. Surprisingly, the whole X chromosome and chromosome 11 seems to be retained intact in cats and humans, though chromosome 11 does seem to have an internal rearrangement in it. Between cats and humans there are 32 recognizable syntenic blocks. This number is about 200 for mouse and human and about 50 for cattle and humans, though the maps are not completed yet. By using these blocks to compare genomes of the mammals, it will be possible to decide the evolutionary history of mammals. This is not easily done using protein sequences because there was a rapid evolution of mammals after the dinosaurs became extinct about 65 million years ago. This has been called the mammalian radiation. This makes the percent differences between the gene sequences very similar, so conventional trees do not work to make the determination. The mapping of mammalian genomes to identify unique chromosomal breaks and rearrangements do not suffer from this same problem, so the order of branching on the mammalian tree should be firmly established by comparative genome mapping.

Comparative genomics was the subject of a major article and Genome Map X in the 1999 Science genome issue (15 Oct.). This article carried forward the concept of comparative mapping to decide the history of the mammalian radiation. The chromosome rearrangements are similar to intron insertions or losses in that they are exceedingly rare events. Therefore they can act as evolutionary tags saying this only happened once. For example, the comparison of primates suggests that all large scale differences between chromosomes can be explained by only 7 translocation events in the past 60 million years. Cats and humans have only 13 events that can explain the differences in their chromosomes. One very significant event in human evolution was the fusion of two ape chromosomes (12 and 13) to form human chromosome 2. All other great apes have 24 sets of chromosomes and humans have 23. This is a defining event in our history. (Genome, the autobiography of a species in 23 chapters Matt Ridley Harper Collins, New York, 1999 p. 24) for more detail see Comparison of the Human and Great Ape Chromosomes

THE PLANT, ANIMAL, FUNGI TRICHOTOMY

The use of unique evolutionary events can resolve other difficult branchings on the tree of life. One good example is the order of branching of plants, animals and fungi. Gene trees based on single genes from these Kingdoms can support all 3 possible branch patterns. However, EF-1alpha has a 12 amino acid insertion that is only seen in fungi and animals, but not plants or protists. Proc. Natl. Acad. Sci. USA 90, 11558-11562 (1993). These rare events can clarify evolutionary events in the history of life. In this case plants are shown to branch before fungi and animals. This has been supported by other evidence.

THYMIDYLATE SYNTHETASE FUSION TO DHFR

Dihydrofolate reductase is required to recycle dihydrofolate to tetrahydrofolate in the synthesis of thymidylate from dUMP. In bacteria these two enzymes are in an operon. In plants and some protists, these genes are fused together so that they make a single protein with both activities. Animals and fungi do not have the fused gene. Since this is an extremely rare event, all the eukaryotes with the fusion probably shared a common ancestor. (Science 297, 89-91 2002). This type of digital evolutionary signature is helping to unravel the history of eukaryotic evolution.

A LINK TO A PAGE ON THE ORIGINS AND EVOLUTION OF THE PROTISTS

MITOCHONDRIAL EVE

Mitochondrial DNA mutates at a rate that is about 17 times greater than nuclear DNA. This is probably due to lack of effective repair mechanisms. Because this DNA changes so rapidly, it can be used as a monitor of evolution on a time scale of a few hundred thousand years rather than millions or billions of years. It is a fast molecular clock. In addition, mitochondria are inherited maternally, so there is no genetic recombination to account for. The line of descent is direct from mother to mother, because fathers do not contribute mitochondria to the egg on fertilization. By comparing DNA from people from around the world and especially in Africa, it is possible to build a tree showing the divergence of human mtDNA over time. This tree can be rooted by using chimpanzee mitochondrial DNA as an outgroup sequence, and the time of the last common ancestor can be estimated. This was first done in 1987 (Nature 325, p. 31), but it was criticized for inadequate sampling of populations and weak methods for making the tree. The process was repeated with DNA sequence from 189 people (121 from Africa) using a hypervariable region of mitochondrial DNA. (Science 253, 1503-1507 1991). The results were similar in both cases, though the critics could not find as much to fault in the second paper.

The results were, that the 14 deepest branches on the tree were all of African origin. This implies that modern humans evolved in Africa. The time of the last common ancestor of all human mitochondrial DNA types is 166,000 to 249,000 years ago assuming that chimpanzees and humans diverged from 4-6 million years ago. A more recent date is 145,000 +- 20,000. Am J Hum Genet 2000 Apr;66(4):1362-83

A new study has completely sequenced 53 human mitochondrial genomes (Nature Dec 7, 2000) with similar results but with better statistics.

One inference from this conclusion is that there was one woman whose mitochondria gave rise to all present human mitochondrial genomes. This is the concept of a mitochondrial Eve. This idea was immediately misunderstood to mean that there was only one woman alive at this time. The result does not suggest that. Such a finding would create a tremendous genetic bottleneck in human history. What the evidence does say is that the present population of human mitochondrial DNA did have one founder mother. She was the lucky one whose mitochondria have survived 200,000 years. All her contemporaries have had their mtDNA lineages fizzle, by not having children, or not having female children to be more specific. There could have been a large population of humans 200,000 years ago, in fact the population is estimated to have been about 100,000 people based on statisitics of polymorphisms found in modern humans.

The newest data comes from an Australian skeleton called Mungo Man 62,000 years old. This DNA does not fit with any modern mtDNA, but matches best to a mtDNA insert on human chromosome 11. It seems to be an extinct sequence with no modern relatives. This is causing some controversy. PNAS 98 537-542 2001

Y CHROMOSOME ADAM

Males do not have to be left out of this analysis. Portions of the Y chromosome are unique to males and are inherited paternally. As long as a region of the Y chromosome is out of the pseudoautosomal region, it cannot recombine with other alleles and so it is very analogous to mitochondrial DNA, except it evolves at a much slower rate. To do these same types of calculations with Y chromosome DNA, a 729bp fragment of the ZFY gene has been sequenced from 38 men. There was no difference found. The numbers of samples is too small yet, or perhaps a more variable region should be used, like an intron.

However, the project has been continued and in 1997 advances were made in finding polymorphisms on the Y chromosome. At least 93 polymorphic sites have now been identified in 900 men scanned (Science 278, 804-805 1997). One of these is an A in non-human primates and a few African men. 15% of the Khoisan people have A, 5-10% of Ethiopians and Sudanese have A. All peoples from outside Africa have T and most Africans also have T. The interpretation is that this A-T mutation occurred very early in human evolution after humans split from apes. The majority of living humans now have the T allele. The Khoisan may be the closest living relatives to the Y chromosome Adam. The interpretation is that humans evolved in Africa between 100,000 to 200,000 years ago. The most recent data Dec. 7 2000 Nature gives an even younger age (about 59,000 years ago) This fits with the mitochondrial Eve data. A second group of researchers found a similar result in studying 1544 men (Mol Biol Evol 15, 427-41 1998). They found an A present in chimpanzees and only a few Africans. Most modern humans have G at this site. The Khosian people also had the highest frequency of the A polymorphism. They estimated an African origin for humans between 150,000 and 200,000 years ago.

The newest data on Y chromosome Adam can be found in Nature Genetics 26, 358-61 2000. This study looks at 160 polymorphic markers on 1062 men from around the world and concludes a more recent origin for modern humans of about 59,000 years in Africa. A related study done on 1007 European men shows migration patterns in Europe over the last 40,000 years (Science Nov. 10, news article 1080 research article on 1155, 2000). An award winning book on the subject has appeared [Mapping Human History Steve Olsen 2002, Mariner Books]

HOW MANY GENES DOES IT TAKE TO MAKE A LIVING CELL?

The sequences of >150 genomes are now complete at Genbank and many more are in progress. Go to TIGR Genome Projects Page Go to Magpie Genome Projects Page At private companies like ERGO in Chicago, hundreds of bacterial genomes are sequenced. Mycoplasma genitalium is the smallest known genome that is not a virus. It codes for 468 proteins, that have been called the minimal set for life. This is not strictly true, since there are probably some genes in this set that are specific for M. genitalium and won't be found in other unrelated genomes. Mushegian and Koonin compared the M. genitalium genome with the H. influenza genome (1703 protein coding genes)to identify those genes common to both.(see PNAS 93, 10268-10273 1996) These are gram positive and gram negative organisms, so they diverged about 1.5 billion years ago. Any homologous genes are probably essential. They found 240 genes. Some essential genes were missing, because the same function in some pathways was performed by different non-orthologous genes in the two organisms, so these missing genes had to be accounted for. That added back 22 genes. Then they looked for redundant functions and parasite specific genes, since both organisms are parasites, and subtracted 6 more genes to get a total of 256 protein coding genes as the minimal set. Later gene knockout experiments showed about 15% of these genes could be knocked out of Mycoplasma genitalium and the cells survived, so the number is closer to 220 genes. (Koonin, How many genes can make a cell: The minimal-gene-set concept. Annu. Rev. Genomics and Hum. Genet. 1, 99-116 2000). Experiments in disrupting these genes and their orthologs in other Mycoplasma species gives the surprising result that 111 of these confirmed essential genes have no known function. This is almost half of the total in a bacterial cell. The conclusion must be that we really are missing many fundamental aspects of biochemistry, even after sequencing over 150 genomes. (Science 286, 2165-2168 1999)

The authors point out that parasites import metabolic intermediates, but they do not import proteins, so the minimum number of genes illustrates what must be done after all possible intermediates are imported from a rich environment.

Another study (Proc Natl Acad Sci U S A. 2003 Apr 15;100(8):4678-83. Essential Bacillus subtilis genes.) found 192 indispensible genes in B. subtilis, but they predicted another 79 would be essential (271 total).

A news article from Mid November 2002 says that Craig Venter has a $3 million grant from the Dept. of Energy to construct the minimal cell. This will then be engineered to make specific products. He is interested in producing hydrogen as a fuel from a bioreactor. If this were scaled up, hydrogen could offer a non-polluting alternative to oil.