Lactate Dehydrogenase (LDH) Gene Duplication During Chordate ...

24 downloads 0 Views 2MB Size Report
Aug 29, 1997 - L-Lactate dehydrogenase (L-LDH, E.C. 1.1.1.27) is encoded by two or three loci in all vertebrates examined, with the exception of lampreys, ...
Lactate Dehydrogenase (LDH) Gene Duplication During Chordate Evolution: The cDNA Sequence of the LDH of the Tunicate StyeZaplicata David W. Stock, *iclJoseph M. Quattro, *2 Gregory S. Whitt,t_ and Dennis A. Powers* *Department of Biological Sciences, Hopkins Marine Station, Stanford University; Evolution, University of Illinois at Urbana-Champaign

and TDepartment

of Ecology,

Ethology

and

L-Lactate dehydrogenase (L-LDH, E.C. 1.1.1.27) is encoded by two or three loci in all vertebrates examined, with the exception of lampreys, which have a single LDH locus. Biochemical characterizations of LDH proteins have suggested that a gene duplication early in vertebrate evolution gave rise to Ldh-A and Ldh-B and that an additional locus, Ldh-C arose in a number of lineages more recently. Although some phylogenetic studies of LDH protein sequences have supported this pattern of gene duplication, others have contradicted it. In particular, a number of studies have suggested that Ldh-C represents the earliest divergence among vertebrate LDHs and that it may have diverged from the other loci well before the origin of vertebrates. Such hypotheses make explicit statements about the relationship of vertebrate and invertebrate LDHs, but to date, no closely related invertebrate LDH sequences have been available for comparison. We have attempted to provide further data on the timing of gene duplications leading to multiple vertebrate LDHs by determining the cDNA sequence of the LDH of the tunicate Styelu plicatu. Phylogenetic analyses of this and other LDH sequences provide strong support for the duplications giving rise to multiple vertebrate LDHs having occurred after vertebrates diverged from tunicates. The timing of these LDH duplications is consistent with data from a number of other gene families suggesting widespread gene duplication near the origin of vertebrates. With respect to the relationships among vertebrate LDHs, our data are not consistent with previous claims that Ldh-C represented the earliest divergence. However, the precise relationships among some of the main lineages of vertebrate LDHs were not resolved in our analyses.

Introduction Comparisons of nuclear genome sizes and numbers of loci encoding isozymes and other proteins in a variety of chordates led Ohno (1970) to propose that episodes of large-scale gene duplication had occurred during the evolution of the group. The more recent discovery that many genes involved in the control of development are members of multigene families with more representatives in vertebrates than in invertebrates has led to increased interest in documenting the patterns of gene duplication in chordate evolution (Holland 1992; Holland et al. 1994). The sizes of some of these families, coupled with the difficulty in determining gene number in a large sample of taxa, suggest the value of continued examination of smaller, more extensively studied gene families in detecting episodes of gene duplication in chordate evolution. One such gene family encodes L-lactate dehydrogenase (L-LDH, E.C. 1.1.1.27), which catalyzes the interconversion of pyruvate (the end product of glycolysis) and L-lactate with the concomitant interconversion of NADH and NAD+ (Markert 1984). Extensive surveys of LDH locus number, expression pattern, kiand immunochemical similarities in netic properties, representatives of the major taxa of jawed vertebrates (Marker& Shaklee, and Whitt 1975; Fisher et al 1980; Whitt 1984) have suggested that they all possess orthologs of Ldh-A and Ldh-B. LDH-A is expressed pre’Present address: Department State University.

of Anthropology,

* Present address: Department of Biology, ence, University of South Carolina. Key words: tunicate,

molecular

phylogeny,

The Pennsylvania

Program

in Marine Sci-

gene duplication.

Address for correspondence and reprints: David W. Stock, Department of Anthropology, The Pennsylvania State University, University Park, Pennsylvania 16802. E-mail: [email protected]. Mol. Biol.

Evol. 14(12): 1273-1284. 1997 0 1997 by the Society for Molecular Biology

and Evolution.

ISSN: 0737-4038

dominantly in white skeletal muscle and is kinetically suited to the reduction of pyruvate to lactate under conditions of low oxygen availability, while LDH-B is predominantly expressed in more aerobic tissues and is tailored to the oxidation of lactate to pyruvate (Markert 1984). An additional locus encoding an isozyme with more variable kinetic properties (LDH-C) is expressed in the sperm of mammals and columbid birds and in a variety of tissues or the liver or eye of ray-finned fishes (Markert, Shaklee, and Whitt 1975). Jawless fishes possess either a single locus (lampreys) or, in the case of hagfishes, two loci with tissue expression patterns similar to those of jawed vertebrate Ldh-A and Ldh-B (Markert, Shaklee, and Whitt 1975). Markert, Shaklee, and Whitt (1975) proposed that an initial gene duplication gave rise to Ldh-A and Ldh-B early in vertebrate evolution and that Ldh-C is the result of a more recent duplication. Subsequent phylogenetic analyses of vertebrate LDH sequence data have supported some aspects of this “traditional” scenario for LDH evolution and challenged others (Li et al. 1983; Hendriks et al. 1988; Crawford, Constantino, and Powers 1989; Stock and Whitt 1992; Quattro, Woods, and Powers 1993; Tsuji et al. 1994; Quattro et al. 1995). Some points that remain controversial or poorly resolved are the phylogenetic position of the LDH-Cs, the relationship of cyclostome LDHs to gnathostome LDHs, and the monophyly of gnathostome LDH-As. The former two issues involve the earliest duplication events leading to the multigene family in vertebrates. Such duplications would be further elucidated by an examination of the relationship of invertebrate and protochordate LDHs to those of vertebrates. To date, the only complete LDH sequence available from an invertebrate animal is that of the nematode Caenorhabditis elegans (Tsoi and Li 1994), a represen1273

1274

Stock et al.

tative of a group that is quite distant from vertebrates. Among protochordates, tunicates and amphioxus have been reported to possess single LDH loci (Fisher et al. 1980; Baldwin, Mortimer, and Patak 1988). Immunochemical analyses and comparisons of amino acid composition were interpreted as evidence that tunicate LDH was more closely related to the LDH-C of a teleost fish than to either LDH-A or LDH-B (Baldwin, Mortimer, and Patak 1988), while Fisher et al. (1980) suggested that the single-locus condition of amphioxus and tunicates predated the divergence of these groups from vertebrates. We have attempted to provide additional resolution of the early duplication history of chordate LDHs by cDNA sequence determination and phylogenetic analysis of an LDH locus from the tunicate Sty& plicata.

Adult specimens of the tunicate Styelu plicata were obtained from Gulf Specimen Company (Panacea, Fla.). A section of the body wall was cut from a single specimen, and the inner and outer surfaces were removed with a scalpel. The remaining portion was ground to a fine powder in liquid N2 using a mortar and pestle. Total RNA was extracted from 2 g of powdered tissue by homogenization in the presence of sodium tri-isopropylnaphthalene sulfonate followed by phenol/chloroform extraction, ethanol precipitation, and precipitation from sodium acetate, as previously described (Stock et al. 1991). Transcription

Total RNA was reverse transcribed with MMLV reverse transcriptase and either random hexamers or a dT adaptor primer (5’-GATGCTGCAGAGCTCAAGC(T)is3’) as described by Stock and Powers (1995). RT-PCR Amplification Styela Ldh

RACE Amplification

of Internal

Fragment

of

A number of degenerate PCR primers (designed from an alignment of bacterial and vertebrate LDH protein sequences-Stock and Whitt 1992; Stock and Powers 1995) were tested for their ability to amplify a product of the expected size from Styela first-strand cDNA. Of these degenerate primers, only Ldhll3FhF (5’GCGAATTCGTNGGNATGGCNTGYGCNRT-3’, where the underlined sequences indicate restriction sites, and ambiguous bases are indicated by the appropriate IUPAC codes) and LdhB736FhR (S’-GCCGCTGCAGCCCANBWNGTRTANCCYTT-3’) amplified a fragment of Styela L+dh. The sequence obtained from this fragment was used to design gene-specific primers for additional RT-PCR (S’-GCCGGAGCTCAAGGTGAAGTAATGGACT-3 ‘, used in combination with primer LdhB736FhR) and RACE amplifications (see below). Conditions for RT-PCR were as described by Stock and Powers (1995). Weak bands were reamplified in a similar PCR reaction using as a template either a dilution of a band cut from a low-melting-point agarose gel (NuSieve-FMC) or the DNA adhering to a syringe

of the 3’ End of Styela Ldh

The 3’ end of the Styelu LDH cDNA was amplified by 3’ RACE, essentially as described by Frohman (1990). A dT-adaptor reverse transcription was used as the template for an initial PCR with an adaptor primer (5’-GATGCTGCAGAGCTCAAGCTT-3’) and the gene-specific primer 5’-GCCGGAGCTCGRTGGMCCWGAAGGYTGGGA-3’. PCR conditions were as described by Stock and Powers (1995). The initial PCR was then used as a template for a second PCR with the adaptor primer 5’GATGCTGCAGAGCTCAAG-3’ and the gene-specific primer 5’-GCCGGAGCTCACAYAARCARGTYRTYGATG-3’. RACE Amplification

Materials and Methods RNA Extraction

Reverse

needle stabbed into a band of the expected size (band stab technique of Bjourson and Cooper 1992).

of the 5’ End of Styela Ldh

5’ RACE amplification followed Frohman (1990). RNA was reverse transcribed with the degenerate LDH primer 5’-GCCGCTGCAGTCNACNACYTGYTTRTG3’. The first-strand cDNA was tailed with dATP using terminal deoxynucleotidyl transferase. PCR amplification of the dA-tailed template was with the gene-specific primer 5’-GCCGCTGCAGTGGCCGATACGGAGTAATCT-3’, the dT adaptor primer, and one of the adaptor primers used above. The PCR reaction was subjected to electrophoresis on an agarose gel, and the band stab technique (Bjourson and Cooper 1992) was used to reampufy a faint band of the expected size with the nested LDH primer 5’-GCCGCTGCAGCCRTGYTGNARRTCCAT-3’ and the same adaptor primer as in the first reaction. Cloning PCR products were cloned into Ml3 mp18 and mp 19 or pBluescript KS+ and KS - (Stratagene) using standard procedures (Sambrook, Fritsch, and Maniatis 1989). PCR products were either digested with restriction enzymes (recognizing sites at the 5’ ends of the primers or in the region between the primers), made blunt-ended with the Klenow fragment of DNA polymerase I, or cloned directly using the t-vector technique (Marchuk et al. 1991). Sequencing Single-stranded DNA for sequencing was either extracted from Ml 3 phage particles or obtained from pBluescript-containing cultures infected with VCSM13 helper phage (Stratagene) according to Sambrook, Fritsch, and Maniatis (1989). DNA sequencing was by dideoxy chain termination using a modified T7 DNA polymerase (Sequenase, version 2.0, United States Biochemicals). In order to minimize errors associated with the sequencing of cloned PCR products, each nucleotide was determined from at least six clones, with at least three sequences representing each strand. Sequence

Assembly

and Alignment

The cDNA sequence of the LDH of Styelu plicata was assembled from the consensus of overlapping clones using the SEQMAN contig assembly program in

Tunicate

the DNASTAR package (DNASTAR, Inc.). The sequence described in this paper has been deposited in GenBank under accession number AF023168 (fig. 1). Additional lactate dehydrogenase sequences were obtained from the GenBank (release 92) or PIR database. D-LDHs (E.C. 1.1.1.28) were not included in the analyses, as they represent an, at best, distantly related protein family or families (Kochhar et al. 1992). The LLDH sequences used and their accession numbers (GenBank, unless otherwise noted) are: lamprey (Petromyzon marinus) LDH (M74064); dogfish shark (Squalus acanthias) LDH-A (U38893); killifish (Fundulus heteroclitus) LDH-A (I-43525), LDH-B (M33969), and LDH-C (L07336); frog (Xenopus Zaevis) LDH-A (UO7179), LDH-B (UO7176), and LDH-C (U07175); duck LDH-B (503869); chicken LDH-A (X53828) and LDH-B (PIR accession A00346); cattle LDH-A (D90143); pig LDH-A (U07178) and LDH-B (UO7180); rabbit LDH-A (M22585); human LDH-A (X02152), LDH-B (YOO711), and LDH-C (502938); mouse LDHA (YOO309), LDH-B (X51905), and LDH-C (X04752); rat LDH-A (X01964) and LDH-C (U07177); fox (Vulpes vulpes) LDH-C (U19868); nematode (C. elegans) LDH (U15420); barley LDH-A (M55685) and LDH-B (M55684); rice LDH (D13817); maize LDH (211754); the LDHs of the apicomplexan protists ToxopZusma gondii (U35 118) and Plasmodium falciparum (M93720); and the L-LDHs of the eubacteria Streptococcus mutuns (M72545), S. thermophilus (D 13405), Luctococcus lactis (M88490), Luctobacillus casei (M76708), Luctobacillus sake (U26688), Lactobacillus plantarum (D90340), Pediococcus acidilactici (X70927), Bacillus stearothermophilus (M19396), B. caldolyticus (M19394), B. caldotenax (M19395), B. psychrosacchar118 and LDH-B-X55 119), B. olyticus (LDH-A-X55 subtilis (PIR accession A25805), B. megaterium (M22305), Mycoplasma hyopneumoniae (X67286), M. genitalium (U39733), Thermus aquaticus (D00585), T. caldophilus (X045 19), Deinococcus radiodurans (D63899), Thermotoga maritima (X74302), BiJidobacterium longum (M33585), and Synechocystis sp. (D64003). Nucleotide sequences were translated into protein sequences according to the authors’ feature specifications. An alignment of amino acid sequences was constructed using CLUSTAL W (Thompson, Higgins, and Gibson 1994) and the default settings. Length variability among chordate LDH sequences in this alignment was limited to the amino terminal arm (positions l-43 in fig. 2, as defined by Li et al. 1983) and the extreme carboxy terminus. No attempt was made to improve the alignment of the former region because of extensive sequence variability, but the alignment of the later region was adjusted manually. Invertebrate and plant LDH sequences require the addition of only a few additional gaps to the alignment of chordate LDHs, but the alignment of bacterial and protist sequences appeared less reliable. Rather than attempting additional alignment methods or parameters, we conducted phylogenetic analyses with and without the latter two groups of LDHs.

Phylogenetic

LDH

1275

Analyses

Positions l-44 in the alignment of LDH sequences (fig. 2) were omitted from all analyses because of the difficulty in obtaining a reliable alignment of this region. Phylogenetic analyses were conducted using both neighbor-joining and parsimony methods. Amino acids were used as characters in both types of analysis because of the large amount of sequence divergence among some of the taxa and the fact that we were interested in early branching events in LDH evolution (Kumar, Tamura, and Nei 1993). Neighbor-joining trees were constructed from pairwise amino acid distances (gamma corrected with a = 2.0) using the MEGA package of programs (Kumar, Tamura, and Nei 1993). Parsimony analyses were conducted with PAUP, version 3.0s (Swofford 1991), and in most analyses, amino acids were treated as unordered characters with all possible transformations weighted equally. Preliminary analyses with amino acid replacements weighted according to the minimum number of nonsynonymous nucleotide substitutions required (PROTPARS weighting-Felsenstein 1993) gave similar results but were not pursued extensively because of the computational time required. Parsimony analyses with equal weighting were performed with 100 random addition sequences of taxa and TBR branch swapping (Swofford 1991), while those with PROTPARS weighting used 20 random addition sequences. The CONSTRAINTS feature of PAUP was used to determine the lengths of the shortest trees matching a number of alternative phylogenetic hypotheses, In the case of constrained analyses using PROTPARS weighting, the PROTPARS program of the PHYLIP package (Felsenstein 1993) was used to test whether the length differences between constrained trees and the best unconstrained tree were significant according to the method of Templeton (1983). Bootstrapping was used to assess the degree of support for particular nodes. Neighborjoining analyses were conducted with 500-1,000 bootand bootstrap parsimony analyses strap replications, were conducted with 10 replications of a random addition sequence for each of 100 bootstrap replications.

Results Features

of Styelu Ldh

The 1,168-nt cDNA sequence of S. plicatu LDH is shown in figure 1. Three of 10 clones of the 3’ end had an additional 10 nucleotides before the poly-A tail (not shown). No consensus polyadenylation signal (Sheets, Ogg, and Wickens 1990) was found in either class of clone, suggesting that the reverse transcription primer may have bound upstream of the true poly-A tail. Of the sites at which nucleotide differences from the sequence shown in figure 1 were found in individual clones of PCR products, only seven exhibited the same difference in more than one clone. Two of these differences occurred in the 3’ untranslated region (positions 116 1 and 1164), three were silent (positions 923, 97 1, and 1037), and only two encoded different amino acids (N and P at nucleotide positions 454 and 583, respectively). These latter two differences were only found in

1276

Stock et al.

GATTGGAGTATTTGGTTGTATTAAGAATTTTGATAATTAAACACAA TCATGGCTGATCTGAAGGGATTCGATATAATGTCCGAGTTATTCA MADLKGFDIMSELF CGGAAGTCTGTCCAGACATTCCAAAATCTGGACCATCCAAAGTCA TEVCPDIPKSGPSKV CTGTAGTTGGAGTTGGGATGGTGGGGATGGCGTGTGGGATGAGTG TVVGVGMVGMACGMS TGGTGTTGAAGGGACTATGTACAGATCTTGTACTAGTCGATGTTG VVLKGLCTDLVLVDV TTCAGGACAAACTTCAAGGTGAAGTAATGGACTTGCAACATGGTA VQDKLQGEVMDLQHG GTTTGTTTCTAGAGAATATTCTATGGAGACAAAGATTACT SLFLENIKVYGDKDY CCGTATCGGCCAACAGTCGGATTGTCATAGTAACTGCTGGTGCTC SVSANSRIVIVTAGA GTCAACZUXCTGGGGAATCTAGATTGAGTTCAACGG RQQPGESRLSLVQRN TCAACATTTTCTTCCACAAATTGCGAAATATAGTC VNIFKHIIPQIAKYS CAAGn=CTATTCTTGTTATn=TTTCAAATCCAGTTGACTT PSAILVIVSNPVDLM CATATGTTGCCTGGAAATTGTCAAACTTTCCTCGTAATCGTGTGA TYVAWKLSNFPRNRV TTGGTTCTGGZUuZGAATCTTGATTCTGCJUiGATTTCGACATCTTA IGSGTNLDSARFRHL TCGCGGAAAAATIY3AATTTGTCACCTGTCAGTGTTCATG IAEKLNLSPVSVHGW TCATCGGCGAACATGGCGATTCAAGTGTTCCAATGTGGAG IIGEHGDSSVPMWSG TTAACGTGAGTGGGZUATGTCGATTCATCCAAGAATAG VNVSGKCLNSIHPRI GATATCCGGATGGACCAGAAGGTTGGGATAAAATACATACATAAACAAG GYPDGPEGWDKIHKQ TTGTCGATGGGGCATACGATGTAATAAGATTAAAAGGATACACAA VVDGAYDVIRLKGYT ACTGGGCTATTGGTCTGAGCTGTGCAGAATTGCTTGCA?iCTATTT NWAIGLSCAELLATI TACATCACAGACATAGAATCCATCCAGTTACTTGTTTTGTGAAGG LHHRHRIHPVTCFVK GCCGATATGGAATCACTGATGATGTATGCCTATCTCTACCTTGTG GRYGITDDVCLSLPC TTTTGZUXX'GTAATGGAGTCAACTCTATTGT~TGTTGATTTGA VLNCNGVNSIVNVDL CTGCTGAAGAAGAAGCAATTGCAATGACTATTG TAEEEAMIKKSAMTI CTGATGTG CAAAAAGGCCTGAAATGGTAAAATTTACCAGAATACT ADVQKGLKW* ACATAAAAAGCTTTCATTGTGCTCTTTCTCAAAGCGCTTTCAAGT CCCTTTAATGTCATGGCCGATATGCTAAAAGACCCAGACTTGT FIG. l.+DNA cata L_dh.

and deduced

amino acid sequence

45 90 14 135 29 180 44 225 59 270 74 315 89 360 104 405 119 450 134 495 149 540 164 585 179 630 194 675 209 720 224 765 239 810 254 855 269 900 284 945 299 990 314 1035 329 1080 338 1125 1168

of Styela pli-

2 of 11 and 3 of 15 clones and may have been PCR errors. Two of the silent differences (positions 923 and 971) may be the result of heterozygosity, as two bands of relatively equal intensity were seen at these positions on autoradiographs of directly sequenced (uncloned) independent PCR products (data not shown). The Styelu LDH protein that we inferred from the nucleotide sequence (assigning the start codon according to the properties of vertebrate translation start sites listed by Kozak 1996) is 338 amino acids long including the initial M; vertebrate LDHs, in comparison, range from 332 to 334 amino acids in length. The additional length of the tunicate sequence is in the N-terminal arm (AbadZapatero et al. 1987 and included references), a region that has previously been shown to also exhibit the greatest degree of sequence variability among vertebrate LDHs and is reduced or absent in bacterial sequences (Li et al. 1983). The only other region of length difference among chordate LDHs reported to date is near the carboxy terminus (fig. 2); at the penultimate position, all vertebrate LDH-Bs and the LDH-B-derived teleost fish

LDH-C (Quattro, Woods and Powers 1993) exhibit an extra amino acid that is missing in mammal LDH-Cs, lamprey LDH, and all vertebrate LDH-As. Styelu LDH, like that of the nematode and the less reliably alignable plant LDHs is also missing an amino acid at this position. The tunicate sequence, with its greater similarity to vertebrate sequences in this region, provides additional evidence that the absence of the amino acid is the primitive condition and that the amino acid was added in the common ancestor of LDH-Bs and teleost fish LDH-C. Interestingly, there are no reversals of this character evident in the vertebrate sequences examined to date. Summaries of the pairwise sequence identities of LDHs are shown in table 1. Styelu LDH is approximately equally similar to all vertebrate LDHs, with the highest similarities to certain LDH-As and the lowest to mammalian LDH-Cs. The lowest within-vertebrate similarity is lower than some of the similarities of vertebrate and Styela LDH. Compared to other groups outside the Vertebrata, Styela-to-vertebrate similarities are approximately the same as those between nematode and vertebrate LDH, and greater than those between plants and vertebrates, apicomplexans and vertebrates, and bacteria and vertebrates. Nonorthologous Cyanobacteria

LDHs of Apicomplexan

Protists

and

A neighbor-joining analysis of 54 LDH sequences is shown in figure 3. The tree has been rooted at the midpoint; it cannot be rooted in a manner that makes eukaryotic LDHs a monophyletic group because of the clustering of the cyanobacterial Synechocystis LDH with the Apicomplexan protist LDHs of Plasmodium and Toxoplasma and the location of this group on a branch LDH and all other bacterial uniting Bifidobacterium LDHs. The node uniting Synechocystis with Plasmodiurn and Toxoplasma is very strongly supported (100% bootstrap support). This result, along with the location of the midpoint root, suggested to us that these three LDHs might not be orthologous to the other LDHs in the tree. A search of GenBank using BLASTP revealed that the closest matches to these sequences were to the malate dehydrogenases (MDH, E.C. 1.1.1.37) of three eubacteria (Bacillus subtilis [Jin, de Jesus-Berries, and Sonenshein 19961, Bacillus sp. [Wynne et al. 19961, and Chlorojlexus aurantiacus [Synstad, Emmerhoff, and Sirevag 19961). The cytoplasmic and mitochondrial MDHs of eukaryotes and eubacterial MDHs have been shown to be related to L-LDHs in both sequence and, especially, three-dimensional structure (Birktoft et al. 1982). Sequence comparisons suggested that LDHs and MDHs represent two distinct groups whose members are more closely related among themselves than to members of the other group (Iwabe et al. 1989). Recently, an MDH from the archaebacterium HuZoarcuZa marismortui was cloned that exhibits greater sequence similarity to LDHs than to other MDHs (Cendrin et al. 1993). The MDHs of the three eubacteria mentioned above appear to exhibit a similar relationship to LDHs (Jin, de Jesus-Berrios and Sonenshein 1996; Synstad, Emmerhoff, and

Tunicate 10

Hsa Hsa Fhe Sac Pma Hsa Fhe Fhe Spl Cel Hvu Blo Taq Mhy Bst Pfa ssp

Hsa Hsa Fhe Sac Pma Hsa Fhe Fhe SPl Cel Hvu Blo Taq MhY Bst Pfa Ssp

A C A A B B C A

A C A A B B C

A

Hsa Hsa Fhe Sac Pma Hsa Fhe Fhe SPl Cel Hvu Blo Taq MhY Bst Pfa Ssp

A C A A

Hsa Hsa Fhe Sac Pma Hsa Fhe Fhe SPl Cel Hvu Blo Taq MhY Bst Pfa SSP

A C A A

B B C A

B B C

A

20

30

40

I

I

I

I

110

120

130

I

I

I

I

140

150

160

I

I

I

250

260

270

280

290

I

I

I

I

EEQTP----DDENS..... ..PVG.C... SQEPRS.... SKERDPP... ..A.VP.... SSAE.P.... SSPE.P.... DIPKSG.... PVENS..... ATP.SSAVPH TTVK...... -----..... IKLMK.... . MKNNG..... __m.****. IACQS.....

VTANSKLVII .S...RI..V .... ..V .VV .S.G....V. . ..G.R ..W ..I .VV ........ .R ..VV ..S..RI .VV .S...RI ..V I..G...C W ..K..D . ..V ICRDADM.V. DLEGARV ..V DLKDADFIV. DCRDAD..V. DL.GADV ..V A..G.DV.V.

210

220

230

240

I

I

I

I

I

VHPLSCHGWV . ..T.....I L..S.....I . .SS...... .NSA.....I I..S.....I I.AS.FN... I.SS.FN..I LS.V.V...I IA.S.....I .NAQDVQAYM .NVKNV.AYI .A.Q.V.AY. .S.N.VQAY. .A.QNV.AYI .C.RDVNAHI AC.SDINTL.

310

320

330

340

350

IKGLYGIKDV .E. :iIi::::::i V~:iiI:i.. .C........ V.DF....N. . . . . . . . . .H C..QH.VH.. .S.I...... V..M...EN. .S........ V.DMF..GE. MN.I...... A..M...G.. RH.1.. .TCF V..R...T.. S.N.FAL..N V..FH..N.. .VL AS.FH..S.G Q Tiii::. S. L.DFH..S.. EKG.FT..LF TPNE.VEE. TKNIMIAGAN LR.E..FHG. ENAILT..AY LD....ER.. .KK.LIC..L LE.Q..HS.. QS.IL.AA.Y LD.A..L...

-DVFLSVPCI .EL.. .I..V .E.. ..I..V . . . ...L..V . . . ...L..V .E....L... .E....L..V .E.Y..L..V . . .C..L..V . ..Y..L.W HE....L.AR .-ICM.. .TL .-.A..L.R. .-.NIG..W .-.YIG..AV .-1.GGT.W .-I..G...R

LGQNGISDLV ..R..V..V. ..NS.LT.VI .DNH...NI. ..SE..T.II .NAR.LTSVI .NGS.VGSV. .NSG.VGSV. .NC..VNSI. ..SA.LTHV. ..RG..LGVA .NRQ.VNNTI ..AR.VEATL ..A.. .EKII INR...RNI ..A..VEQVI ..CR.VE.IL

KVTLTSEEEA .IN.N..... HM..KP...K .MK.KPD..Q NQ..KK.... NQK.KDD.V. NM...AA.V. NM...D..V. N.D..A.... .QQ..EA.VQ EMD..EA.A. NTPVSDK.L. YPR.NE..RQ EIS.NDK.KE EIE.NDD.KN ELQ.N...-E.Q..P..K.

FIG. 2.-Alignment killifish, Sac-dogfish

I

of representatives shark, Pma-lamprey,

I

VWSGMNVAGV L...V..... . . ..V.I... ..... ..... . . ..V..... . . ..V..... . . ..A..... T il:::V::&i . . ..V..... I..SIS.G.M L.ESATIG.. . ..SAQ.G.. AY.NIKI..E . ..QAYIGVM LKRYIT.G.1 LPRYCT.S..

I

1277

I

I

I

180

190

200

--QNKITWG VGAVGMACAI SILMKDLADE LALVDVI--E DKLKGEMMDL QHGSLFLRTP ...C ...I .. T .......... ..L .......... ..A ..L .............. ..FS .S ..R..V .... ..M....S .. .V.L...C ...... ..M ....... ..A .... ..A...K .H V .M Y .H.A ::Hh:;:: ::b::::i:: :+:iR::::: .::::.v ...... :i:::::::: L:::::.K.A ..N ......... .G.S ........ ..L .................. ..Q .. ..R..V ...... E::::::; :: .LR..C ...... ..M ... .R ......... ..L...K .S ..R..V .... ..Q .... ..V T..LRE ........ ..V ... ..V ............ ..K .. ..PS.V .... ..M.....G M .VVL.G.CTD .V....V ..Q . ..Q..V ........ ..JZN I ..HG.V .... ..Q .... ..Y . ..QQN..N. .C....V ..A ........... ..LA.T.H C RRLT..S.I. A.N....I .Q T..TQN .... I....AL ..P . ..R..AL .. ..AAA..PR V -T.LA.I. A . . ..STL .F AAAQRGI.R. IV.E.IA ..K ERVEA.VL.M .. ..S.YP .V ::-V.VGI . . S.F..S.T .Y ALVLQGV.R. VV .. .LD ..R KLAQAHAE.I L.ATP.AH-. ..PI..ALI. A.N..NSFL Y AAMNQG..S. YGII.IN ..P .FAD.NAF.F EDA.AS.PF. ..GARVV.I. A.F..ASYV F ALMNQGI ... IV.I.AN ... S.AI.DA ..F N..KV.APK. . .KA..VL . . S.MI.GVM.T L.VQ.N.G-D VV.F.IV ..K NMPH.KAL.T S.TNVMAYSN ..-WQV .... A.N..RTL .Q RLVQQNV.-N VV.L.IV ..P GLPQ.IAL .. MAAQSVEEYD

FRYLMGERLG . . . . I..K.. . .H . . . .KFH . .. . . . . . . . . .. . .S . . . . . . . . .A.K.. .M.A.... ::F..ADK.. ..H.IA.K.N ..F.LS.K.N ..F.IA.H.D L.F.IAQQT. ..A.LAQH.L LQFAIAK.AK ..F.L..YFS LK.YISQK.N LKAFIAMK..

I

I

TAGARQQEG- ----ESRLNL VQRNVNIFKF IIPNVVKYSP . . . . .. . . . . . . . ..T..A. ..... A.M.1 . ..AI.H ... .................... .............. I ..... ............. DI..H .. .................... .................... .............. I ..... ..V ......V ...... .. ..V ................................ ......v ..c . ..g:::: . ..V ......... ..R .... D ........ H ...LI.RH .. ..... ..P ........ ..S. ......... H . ..QIA .... .... ..R ...... ..T..S. ..... E ... G . ..QL ..... ..ALYR K .V.P.AEH .. .... ..IP ..... ..T .... L . ..P..KP .... ..Q...E. .iT .. .L.A .M..L..VA. A..VA.RP ..... ..T..Q. LD..AQV.A D VV.KIL.AA. . ..RP.KP ..... ..T..E. .AD.IR.IRE .ALK.KESGF C...N.KP ..... ..T..D. .DK.IA..R S .VES.MASGF . ..FTKAP .K SDKEWN.DD. LPL.NK.MIE .GGHIK.NC. . ..LPRRP .... ..M..DD. LGK.A..VA Q GAREALR ...

KIVS-GKDYN .VT......S ... A.D ...S ......... S ...A.D . ..S .A.D ..S :;I.A.D.. ..A .A.D ..S :~NG.D .. . .S TVKA.DT ..S R.S-..T .AA S.DG.SD.PE VW.R.SGW.E IS .. .RYE.K VDIW.HG ..D CK..GSNT .D SKIIGTNE.E

LGEHGDSSVP I......... V........A I......... I......... . .. . . . . ..A . . . . ..T... . . . . ..T... I......... I........A V....... .A A......E.. V......E.L M...... .FV I.....TEL. V.A..NKM.L ..G...LML.

80

I

_-____---_ -MATLK---- -DQLIYNLLK .......... ..S.V ..... .E.. .EK.IE .......... .-MSTQ .... .EK..SHVM. .................... ..K..GH.AT ST .GK..HEMVP :::::::::: 1:: ............ .EK..APVAE .......... ..SSVL .... .QK..TP.AS ........... ..SVL .... .HK..TP.AC ........... ..D..GFD I MSE.FTEVCP .......... MAS.I ..... .-EVFAEIAA MHKASSLSEL GFDAGGASSG FFRPVADGCP .......... .----- .... .------MAE .......... .----- .... .--------.......... .----- .... .MKKFNKKGE .......... .----- .... .------MNA .......... .----- .... .__--____.......... .----- .... .-MN.LEYAP

LRRVHPVSTM

70

LDH

SLKTLHPDLG A....D.K.. ..Q..N.NM. . ..E...E.. G.QS.N..I. ..QE.N.EM. ..QK.N.EI. N.Q..N.NI. C.NSI..RI. T.HEIK..I. PAFKSLR.SH PMCDWT.LP. D.EAFAQARCFCAYSKLTPIRK.VESKP.QEFINN-PITE.I.P--

I

170

I

I

I

NCKLLIVSNP VDILTYVAWK ISGFPKNRVI GSGCNLDSAR D..I.V ........ ..IV ... ..L.VT ............. ..I..V .............. L....RH .... ..T ...... D.1I.V .... ..V ....... L..L.MH.I. .......... ..I..V .............. L..L..H ............. D.IIIV ......... ..T .. L..L..H ............. ..TI.V .... ..V....T .. L..L..H .... ..T ...... D.IIIV .... ..V....T .. L..L.MH .... ..T ...... SA1.V ..... ..LM ...... L.N..R ..... ..T ...... DTCI.V .... ..V....T .. L..L.RE ..F ...T ...... DAL..V ..V ....... L....AS ..T....S. .AIYMLIT .........A.H..Q. LT.L.E.QIF ..... ..T ...... EAV...AT .. ..VM.Q..Y R L..L.PE ..V . ..TI..T .. SGISI..A ... ..I.RAYR D A...SDQK ... ..TV..T .. QGLF.VAT ...... ..AT .. F..L.HE .... ..TI..T .. .AFIIV.T .. ..VMVQLLH Q H..V...KI. .L.GV..TS. .AI.IV.T .. L.VM..L ... VT.L.SQ ..M .MAGV .....

TDKDKEQWK..S...H... A.G.S.N... . . . . ..N... .P..G.D... ..N.S.N... ..G....... ..F.E.N... YPDGP.G.D. EKT.N.H.EA RSF.EAALE. H.PLDADKRE GRALTPDDRL GIDSS-NYEK GEEAQ-KDLE KL-ISDAELE ------.TIE

EVHKQVVESA NI....IQ.. .L..K..DG. KL..D..D.. S......D.. . ...M..... AT..A..D.. .T..M..D.. KI.....DG. .I..K..D.. GIRRA..GG. .I.QE.KNA. RIDEG.RRA. .LEYP.SRR. RIFVN.RDA. AIFDRT.NT. .LVERTRNGG

360

370

I

I

YNIKLK--G ..I....... . . . .. . . . . . .. . .. . . . . . .. . . . . .. . . . . . . . . . .. . . . . .. . . . . . . . . .. . . . . . .D..R..... ..I....... . ...G..... .KI.NG.... .RI.EG.... ..I.NR...R .QI.EK.... L.IVN.H..A A.IAA.LQT.

YTSWAIGLSV . . .. .. . . . . . . . . . ..M.. . . . . . . . . .. . . . . . .. . . . ..N....... ..N....F.. ..N....... ..N......C .. ........ . . . . . ..Y.. A.NY...M.G A.YYG..AGL A.FYG..AAI A.YYG.AMGL SPYV.PAAAI TAYY.PAS.A

300

I

ADLAESIMKN M..VG..L.. . ..V...V.. .. . ..T.... . .. ..T.L.. . ..I..ML.. . ..T...V.. . ..T..L.R. .E.LAT.LHH .KI.QG.FS. .S..A.LLRD V.II.AVLHD .R.TRA.LTD .KIVSN.I.D .RVTRA.LH. IEM...YL.D .VMV...LR.

I _--

RLKKSADTLW GIQKELQ-FLF....E... N...D.I... Q.V...E... .V....T.L. Q.Q...T... D...D.K... QVQ...E... NV....T... Q......... D...D.KDL. Q......... . . ..D.KDL. Q.QG..S... D...D.RDI. MI....M.IA DV..G.K.W. K.HN..KA.L N.NG1V.M. Q.RR..K... ENCQL.D.L. A..R..E..K ETAAQFG... A.RR..EI.K .AASA.G... KFA..VAIID K.YQDAIKNI .FHH..A..K SVLARAFTR. --.AKF.EiAI AET.RMKALA A.HL..FiAVR LNIDVALAMV

... ... ... ...

of major groups Spl-tunicate,

of LDH sequences. Abbreviations of taxa are as follows: Hsa-human, FheCel-nematode, Hvu-barley, Blo--B$dobacterium Zongum, Taq-Z’hermus aquaticus, Mhy-Mycoplasma hyopneumoniae, Bst-Bacillus stearothermophilus, Pfa-Plasmodium falciparum, Ssp-Synechocystis sp. A, B, and C refer to the different isozymes within a species. Dashes indicate alignment gaps and periods indicate matches to the sequence of human LDH-A.

Sirevgg 1996; Wynne et al. 1996). In addition, all of these LDH-like MDHs are tetrameric like LDH, rather than dimeric like other MDHs. Although the MDH activity of the cloned sequences from Haloarculu, BacilZus, and Chlorojkxus has been relatively well charac-

terized (Jin, de Jesus-Berries and Sonenshein 1996; Synstad, Emmerhoff, and Sirevag 1996; Wynne et al. 1996), the LDH activity of Plasmodium LDH has not been as conclusively demonstrated (Bzik, Fox, and Gonyer 1993), and the Toxoplasma and Synechocystis sequences

1278

Stock et al.

Table 1 Summary

of Pairwise Differences Pairwise

Among LDH Sequences

Comparisons

Range of Distance@

Bacteria to vertebrate .................. Apicomplexa to vertebrate. ............. Plant to vertebrate. .................... Nematode to vertebrate ................ Tunicate to vertebrate. ................. Within vertebrates. .................... Tunicate to vertebrate LDH-A. .......... Tunicate to lamprey ................... Tunicate to vertebrate LDH-B ........... Tunicate to mammal LDH-C ............ Tunicate to killifish LDH-C .............

0.59-0.73 0.68-0.73 0.46-0.54 0.35-0.46 0.36-0.44 SO.40 0.35-0.39 0.36 0.37-0.40 0.39-0.44 0.38

a Distances were calculated using the alignment in figure 2, with gaps in the pairwise comparisons deleted and all other positions considered. Distances are proportions of nonidentical residues.

have been identified as LDHs only by sequence similarity to Plasmodium LDH (Kaneko et al. 1995; Yang and Parmley 1995). Regardless of whether the “LDHs” of Plasmodium, Toxoplasma, and Synechocystis actually function as LDHs or MDHs, we suggest that they are not orthologous to the other LDHs in figure 3. Strong support for this hypothesis is provided by the presence LDH (Fig. 3) in addition in B. subtilis of a conventional to the LDH-like MDH described above. Phylogeny

of Eukaryotic

LDHs

The midpoint rooting of figure 3, which supports a monophyletic group of nonprotistan eukaryotic LDHs distinct from bacterial LDHs, is consistent with the results of simultaneous analysis of LDH and MDH (Iwabe et al. 1989). This monophyletic group of plant and animal LDHs was found in 100% of the bootstrap NJ trees. Plant LDHs also form a monophyletic group with 100% bootstrap support, although this is not surprising given that all of the species are members of the family Poaceae. The monophyly of animal LDHs was found in 99% of the bootstrap NJ tree. All of the above clusters were also found in parsimony analyses (not shown). Phylogenetic

Position

of Styela LDH

The phylogeny of the LDHs of the three animal phyla matches that of the organisms. In the tree in figure 3, nematode LDH clusters outside of urochordate (Styela) and chordate LDHs, but with only moderate bootstrap support (60%). Figure 3 shows significant support (97% of the bootstrap trees) for Styelu LDH being outside of a monophyletic group of all of the vertebrate LDHs. This result indicates that all of the gene duplications within the vertebrates examined occurred after the divergence of tunicates and vertebrates. In an attempt to provide increased resolution of the relationships of tunicate and vertebrate LDHs, we conducted a neighbor-joining analysis without the more distantly related and less reliably alignable bacterial and protistan LDHs and have rooted the resulting tree between plant and animal LDHs (fig. 4). Such an analysis shows higher support for the theory that the tunicate LDH is more closely related to vertebrate LDH than is nematode LDH (92% bootstrap support), but lower support for the

monophyly of vertebrate LDHs (87% of the bootstrap trees). We used maximum-parsimony analyses to evaluate the relative support for alternative hypotheses on the phylogenetic relationships of tunicate LDH (fig. 4 and table 2). It has been suggested based on immunochemical comparisons that tunicate LDH is most closely related to LDH-C and lamprey LDH (Baldwin, Mortimer, and Patak 1988). These authors assumed that mammal and fish LDH-Cs were orthologous, although this relationship was strongly contradicted by a subsequent phylogenetic analysis of a teleost LDH-C sequence (Quattro, Woods and Powers 1993). Not surprisingly, given this latter result, the forced grouping of Styela LDH, lamprey LDH, and mammal and fish LDH-Cs required a large number of extra steps (31 for equal weighting, 46 for PROTPARS weighting) relative to the most parsimonious trees and in the case of PROTPARS weighting was significantly worse as judged by the method of Templeton (1983). Relaxing the requirement that mammal LDH-C be a part of this cluster (Baldwin, Mortimer, and Patak’s [1988] immunochemical data were based on fish LDH-C) still results in trees significantly worse than unconstrained trees. Clustering of tunicate LDH with mammalian LDH-Cs could also be rejected, but a specific relationship with fish LDH-C could not. Additional phylogenetic positions of the tunicate LDH inside the vertebrate LDH clade (not necessarily suggested by any authors) were also evaluated with respect to the number of extra steps. The clustering of the tunicate LDH with gnathostome LDH-B and teleost LDH-C required the Styela fewest additional steps. Hypotheses clustering LDH with LDH-C remained among the least likely. Phylogeny

of the Vertebrate

LDHs

The phylogenetic relationships among vertebrate LDHs were investigated with plant and invertebrate outgroups as shown in figure 4. Parsimony and neighborjoining analyses gave similar topologies, although the bootstrap values were generally higher with neighborjoining. The values discussed below are from this latter analysis. A branch uniting all of the jawed vertebrate (gnathostome) LDH-Bs as well as teleost fish (killifish) LDH-C, Xenopus LDH-A, and Xenopus LDH-C was found in 90% of the bootstrap trees. The tree topology supports a number of gene duplications within this clade. Ninety-three percent of the bootstrap trees support a derivation of teleost LDH-C from teleost LDH-B after this group diverged from tetrapods, as initially proposed on immunochemical grounds (Whitt 1969; Marker-t, Shaklee, and Whitt 1975) and subsequently supported at the protein sequence level (Quattro, Woods, and Powers 1993). Two duplications are also indicated (with 100% bootstrap support) in the Xenopus lineage after it diverged from amniotes. One is likely to be the result of tetraploidy (Kobel and Du Pasquier 1986), while the other may be a gene duplication postulated by Wolff and Kobel (1982). Stock and Powers ( 1995) concluded from similar results that LDH-A and LDH-C have been incorrectly designated in Xenopus, with LDH-A probably remaining to be cloned. Aside from these likely dupli-

Tunicate

LDH

1279

_c Cattle LDH-A

45 51

Pig LDH-A i Rabbit LDH-A loo_ Mouse LDH-A 77_ ' Rat LDH-A - Human LDH-A 81 Chicken LDH-A Fox LDH-C Human LDH-C ‘M-“i;wL;FC -

Dogfish shark LDH-A Lamprey LDH Kiilifish LDH-A Killifish LDH-B Killifish LDH-C

,

100 58

Tunicate LDH Nematode LDH

100

100

(

8. longum LDH T. aquaticus LDH T. caldophilus LDH D. radiodurans LDH T. maritima LDH M. hyopneumoniae LDH M. genitalium LDH

87

P. acidilactici LDH 95

S. mutans LDH

FIG. 3.-Neighbor-joining tree of 54 LDH sequences. The tree is rooted at the midpoint of the longest path length. Amino acid distances were calculated with a gamma correction (a = 2). Alignment positions l-44 were omitted from consideration, and the pairwise deletion option of MEGA was used to treat alignment gaps. Numbers at nodes indicate the results of a bootstrap analysis with 1,000 replications. Bootstrap support within mammal LDH isozymes has been omitted for clarity. Numbers in a larger font indicate the support for the interrelationships among the major groups of eukaryotic LDHs.

cations, the phylogeny of the LDH-B genes matches the well-accepted organismal phylogeny of (teleost fish, (frogs, (birds, (mammals)))). Although LDH-Bs and their derivatives form a well-supported clade in figure 4, the sister group of this clade (lamprey LDH + gnathostome LDH-A + mammal LDH-C) is found in only 47% of the bootstrap trees. Within this latter group, there is strong support, however, for the theory that mammalian LDH-Cs are derived from within LDH-A. The strongest contradiction of the proposals that LDH-C is the earliest divergence among vertebrate LDHs (e.g., Li et al. 1983; Baldwin, Mortimer, and Patak 1988; Crawford, Constantino, and Powers 1989; Baldwin, Patak, and Mortimer 1992; Tsoi and Li 1994; Tsuji et al. 1994; Mannen et al. 1996) is pro-

vided by the node uniting dogfish shark LDH-A with mammal LDH-C and tetrapod LDH-A (96% bootstrap support). Further support for a derivation of mammalian LDH-Cs from LDH-A is provided by the node uniting them with chicken and mammal LDH-As (80% of the bootstrap trees). Mammalian LDH-Cs are specifically placed as the sister group of bird and mammal LDH-As (77% bootstrap support), a position that requires gene losses in a number of amniote groups, as discussed by Matson (1989). This grouping may be due to a rapid rate of evolution of LDH-C, which is obvious from inspection of the within-mammal divergences relative to those of LDH-A and LDH-B (fig. 4; Whitt 1984). A clustering of mammal LDH-C and mammal LDH-A, which does not require additional gene losses, is found

1280

Stock et al. Cattle LDH-A

and 48% bootstrap support) and one strongly supported one (96% bootstrap support). The additional gene duplications and losses required by the tree in figure 4 over a tree matching the organismic phylogeny have been discussed by Tsuji et al. (1994), Quattro et al. (1993, and Stock and Powers (1995).

Pig LDH-A

IL

rt 99

Rabbit LDH-A

Mouse LDH-A

60

Rat LDH-A

77 27

Human LDH-A

1

8C) I

-

Chicken LDH-A

I

,

I-

Fox LDH-C

-

Human LDH-C

97

Mouse LDH-C I

Rat LDH-C

Dogfish shark LDH-A I-

Killifish LDH-A

Xenopus LDH-A

88 60

~ 96

Duck LDH-B Chicken LDH-B

91

L Mouse LDH-B Tunicate LDH Nematode LDH

FIG. 4.-Neighbor-joining and parsimony analyses of 24 vertebrate LDH sequences with plant and invertebrate outgroups. The topology indicated was obtained from a neighbor-joining analysis conducted as described in figure 3. Numbers above the nodes indicate the results of a bootstrap neighbor-joining analysis with 1,000 replications. Numbers below the branches indicate the support obtained in a bootstrap parsimony analysis. Some sequences with close relatives were deleted from the parsimony analyses to reduce the amount of time required; these were cattle LDH-A, pig LDH-A, rabbit LDH-A, rat LDH-A, Xenopus LDH-B, duck LDH-B, pig LDH-B, and barley LDHB. Three most-parsimonious trees were found with 100 replications of a random addition sequence and TBR branch swapping (Swofford 1991). These differed from the topology shown by placing mammalian LDH-Cs within or as the sister group of mammalian LDH-As (the only aspect of the three trees in which they differed among each other) and by grouping killifish LDH-A with lamprey LDH. The bootstrap parsimony results indicated were obtained with 100 bootstrap replications, each with 10 replications of a random addition sequence and TBR branch swapping.

in the most parsimonious trees and in 48% of the bootstrap trees from a maximum-parsimony analysis (not shown). The phylogeny of gnathostome LDH-A and lamprey LDH (suggested to be orthologous to LDH-A by Stock and Whitt [1992]) in figure 4 does not match the generally accepted phylogeny of the organisms; moving the LDH-A of the killifish to a sister group relationship with tetrapod LDH-A would bring the two phylogenies into agreement. Such a position of the killifish LDH-A is contradicted by two weakly supported nodes (47%

Discussion Gene Duplications Leading to Multiple Vertebrate LDHs Occurred After the Divergence of Tunicates Vertebrates

and

Our analyses strongly support a sister group relationship between the LDH of Styela plicata and a clade consisting of all of the vertebrate LDHs that have been sequenced to date (those of lamprey and jawed vertebrates). If it is assumed that the locus we sequenced is the only one encoding LDH in the genome of Styela (see below), then this phylogenetic result suggests that all of the LDH duplications represented by the present collection of vertebrate sequences occurred after the divergence of vertebrates and tunicates. This pattern of duplication is consistent with the hypotheses of Markert, Shaklee, and Whitt (1975) and Fisher et al. (1980), which were formulated largely in the absence of sequence data. Our phylogenetic results strongly contradict the hypothesis of Baldwin, Mortimer, and Patak (1988) that tunicate LDH is more closely related to the LDH of lampreys and even more so to the LDH-C of mammals and teleost fish than it is to the LDH-A and LDH-B of jawed vertebrates. This hypothesis was based on immunochemical comparisons of the LDH of the tunicate Pyura stolonifera (distantly related to S. plicata) with the LDH-A, LDH-B, and LDH-C of a teleost fish, as well as on comparisons of amino acid compositions of tunicate and other vertebrate LDH isozymes. The lack of agreement of our phylogenetic conclusions based on sequence comparisons with earlier conclusions based on cruder immunochemical and amino acid composition analyses suggests that hypotheses on the relationships of echinoderm LDH (Baldwin, Patak, and Mortimer 1992) and hagfish LDHs (Baldwin, Mortimer, and Patak 1988 and included references) to the LDHs of lamprey and jawed vertebrates that are based on similar evidence are no longer persuasive. LDH Locus Number Invertebrate Groups

in Tunicates

and Other

Our PCR amplifications of Styela L.dh did not provide any evidence for the existence of more than one locus in this species. A number of the primers we used are coniplementary to regions with amino acid sequences conserved among multiple vertebrate paralogs. This suggests that if the Styela locus we amplified were orthologous to a specific type of vertebrate LDH, then the primers would be capable of amplifying Styela genes orthologous to other vertebrate genes. However, amplification of additional loci may have failed for a number of reasons, including restricted expression or unique substitutions along the Styela lineage. A more systematic determination of the number of loci present is pos-

Tunicate

Table 2 Lengths of Constrained

Trees in Parsimony

128 1

Analyses PROTPARSC

EQUAL WEIGHTS~ CONSTRAINTS Nonee..................................................

(tunicate, (tunicate, (tunicate, (tunicate, (tunicate, (tunicate, (tunicate, (tunicate,

LDH

killifish C, mammal C, lamprey) . . . . . . . . . . . . . . . . . . . killifish C, lamprey). . . . . . . . . . . . . . . . . . . . . . . . . . . . . . killifish C) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . mammal C). . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . all A except Xenopus, lamprey). . . . . . . . . . . . . . . . . . . . all A except Xenopus, mammal C, lamprey) . . . . . . . . . all B, all Xenopus). . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . all B, all Xenopus, killifish C) . . . . . . . . . . . . . . . . . . . . .

Length

Extra Steps

Length

928

0

1,103

959 948 936 935 936 933 939 931

31 20 8 7 8 5 11 3

1,149 1,128 1,119 1,120 1,118 1,113 1,122 1,111

Extra Stepsd

0 46” 25” 16 17” 15 10 19 8

a Constraints consisted of requiring the indicated group of sequences to be monophyletic with no other relationships specified inside or outside this group. See figure 4 for the sequences included in the analyses. b Lengths of trees calculated with all amino acid transformations weighted equally. c Lengths of trees calculated with amino acid transformations weighted according to the number of nonsynonymous nucleotide substitutions required. d Asterisks indicate topologies judged to be significantly worse than the most parsimonious tree by the test of Templeton (1983) as implemented in PROTPARS (Felsenstein 1993). ’ eUnconstrained trees placed the tunicate outside of all vertebrate LDHs.

sible with the characterization of LDH enzymatic activity; such investigations should detect additional loci that have diverged extensively in sequence but not in function. The most thorough study of this type in tunicates is that of Baldwin, Mortimer, and Patak (1988) on P. stoloniferu. Nondenaturing electrophoresis and histochemical staining of crude extracts of whole animals and a number of tissues revealed evidence for a single locus, as did analyses of purified LDH activity. Similar electrophoretic investigations of LDH activity suggest that echinoderms (Baldwin, Patak, and Mortimer 1992) and amphioxus (Fisher et al. 1980) also possess a single LDH locus. Assuming the generally accepted phylogeny (Peterson 1995) of (echinoderms, (tunicates, (amphioxus, (vertebrates)))), the simplest explanation of these locus numbers and our phylogenetic results is that the ancestral state of deuterostomes (a single LDH locus) has been retained by echinoderms, tunicates, and amphioxus, while LDH duplications have occurred within vertebrates after their divergence from amphioxus. This hypothesis was proposed previously by Fisher et al. (1980). ’ If, in spite of evidence to the contrary, an additional tunicate LDH locus did exist, the implications for LDH evolution in vertebrates would depend on the phylogenetic position of the new locus in the LDH tree. Clustering with the existing tunicate locus would imply a duplication in tunicates after their divergence from vertebrates and would not alter hypotheses of the timing of vertebrate gene duplications. Because of the phylogenetic position of the known tunicate locus, any other position of the new locus would require that one of the two types of loci was lost in all vertebrates. Placement of the new locus within the clade consisting of vertebrate LDHs would require the loss of at least one type of locus in the tunicate as well. In other words, even if an additional tunicate locus is discovered, it will not be possible to postulate that particular vertebrate duplications occurred earlier than the divergence of tunicates and vertebrates without also postulating a more complex pattern of gains and losses.

The Location

of the Root in the Tree of LDHs

We described above the evidence that the LDHs of Synechocystis, Plasmodium, and Toxoplasma are not orthologous to the other LDHs in the tree and may in fact be more closely related to tetrameric MDHs. With the exception of a weakly supported node placing Bijidobacterium LDH closer to plant and animal LDHs than are other bacterial LDHs, the tree in figure 3 (rooted at the midpoint) depicts a phylogeny of the remaining LDHs corresponding to the organismal phylogeny of (bacteria, (plants, (nematodes, (tunicates, (vertebrates))))). This provides support for a root separating monophyletic groups of bacterial and eukaryotic LDHs (joining an internal branch with 100% bootstrap support in fig. 3), as any other root position would require a greater number of gene duplication and loss events. The clustering of bacterial LDH outside of vertebrate LDHs has also been obtained in trees of LDHs rooted with MDH sequences (Iwabe et al. 1989; Cendrin et al. 1993; Wynne et al. 1996). LDH Evolution

from Bacteria

to Vertebrates

Determining whether large numbers of LDH gene duplications and losses have occurred outside of chordates requires more extensive collection of LDH sequences. Our phylogenetic results obtained with LDHs from a relatively small number of nonchordate taxa imply an evolutionary pattern of a single locus persisting in most lineages with occasional (and usually recent) gene duplications. Examples of gene duplications in nonvertebrate taxa include the appearance of LDH-A and LDH-B of Bacillus psychrosaccharolyticus after its divergence from the other Bacillus species in the tree and the LDH-A and LDH-B of barley, which resulted from a duplication after the divergence of barley from the other two members of the family Poaceae (fig. 3). Although the number of LDH loci in invertebrates has not been well characterized, Drosophila melanogaster is believed to have a single locus for L-LDH (Alahiotis et al. 1983), and the evidence for a single locus in an echinoderm (Baldwin, Patak, and Mortimer 1992) has been

1282

Stock et al.

discussed above. A number of insects and molluscs are believed to lack LDH activity altogether, which would be loss events according to our phylogenetic trees (e.g., fig. 3). However, the biggest remaining question in the evolution of invertebrate LDHs relates to the orthology of the loci encoding the proteins or enzyme activities that have been described (Long 1976; Siebenaller et al. 1983). All of the LDH sequences that we have analyzed and all known chordate LDHs are specific for the Lstereoisomer of lactate (Whitt 1984). A number of groups of invertebrates have D-LDHs, however, and the presence of D- or L-LDH appears to be mutually exclusive for any given taxon (Long 1976). Sequence analyses have shown that bacterial D-LDHs form a separate family from L-LDHs that is much more distantly related to the latter than are MDHs (Kochhar et al. 1992). No invertebrate D-LDH sequences are available to allow a determination of their relationship to L-LDH, although Siebenaller et al. (1983) suggested that the two types of LDH were not orthologous based on the sequence of a few short peptide fragments from a D-LDH of a horseshoe crab. If this hypothesis is correct, then L-LDH would have to have been lost and replaced by D-LDH at least four times in invertebrates (Long 1976). Relationships

Within

Vertebrate

LDHs

The relationships of vertebrate LDHs that we obtained in our analyses are generally consistent with those of Stock and Whitt (1992), Quattro, Woods, and Powers (1993), Quattro et al. (1995), and Stock and Powers (1995). There was strong support for the derivation of teleost LDH-C from LDH-B after the divergence of teleosts and tetrapods and a derivation of mammalian LDH-C from LDH-A. This contradicts other sequence analyses that suggested that LDH-C represents the earliest divergence among vertebrate LDHs (Li et al. 1983; Crawford, Constantino, and Powers 1989; Tsoi and Li 1994; Tsuji et al. 1994; Mannen et al. 1996). The results of these analyses differed from ours largely in the location of the root of the vertebrate LDH tree. We conducted a series of phylogenetic analyses with varying subsets of outgroup sequences to the vertebrate LDHs (not shown) and obtained a root in or between mammalian LDH-Cs and other LDHs only with highly divergent sequences as outgroups. The analyses of Tsoi and Li (1994) and Mannen et al. (1996), in which the position of LDH-C differed from ours with respect to topological relationships as well, were conducted with the unweighted pair-group with arithmetic mean (UPGMA) method, one that has been shown to be highly sensitive to unequal rates of evolution (Huelsenbeck and Hillis 1993). Our analyses also agreed with previous studies in providing strong support for recent duplications of LDH-B in Xenopus and the monophyly of a group consisting of LDH-B and its derivatives in teleosts and Xenopus. Although the lamprey LDH clusters with LDH-As as in the analyses of Stock and Whitt (1992), this result was not strongly supported, and at least one node strongly contradicted the orthology of killifish LDH-A with tetrapod and shark LDH-As. Such a result was also ob-

tained in other studies that included teleost LDH-A sequences (Tsoi and Li 1994; Tsuji et al. 1994; Quattro et al 1995). This phylogenetic result cannot be brought in line with the orthology of LDH-As by rerooting the tree. Whether or not it is artifactual may be addressed by analyses of the LDH-As of other groups of primitive bony fishes. The interrelationships of four major lineages of LDHs (shark LDH-A + tetrapod LDH-A + mammal LDH-C; teleost LDH-A, lamprey LDH, and jawed vertebrate LDH-Bs and derivatives) were not well resolved in our analyses, as indicated by low bootstrap support. The persistence of at least some of these low bootstrap values in the absence of an outgroup (analysis not shown) suggests that the sequence of more closely related outgroups (amphioxus is the only candidate invertebrate) will not by itself resolve all aspects of the phylogeny of vertebrate LDHs. The collection of additional vertebrate sequences may decrease or eliminate support for potentially artifactual groupings but is unlikely to result in their replacement with strongly supported alternatives. The resolution of the phylogeny of vertebrate LDHs may require analyses more sophisticated than the rather simple methods we have used, such as the assignment of different expected rate categories to different sites and the consideration of structural or functional information in weighting changes. Gene Duplications

in Early Vertebrate

Evolution

Regardless of the exact pattern of gene duplication leading to multiple LDH loci in vertebrates, our analyses suggest that all duplications occurred after the divergence of tunicates and vertebrates and that at least some of them occurred early in the diversification of vertebrates. This evolutionary pattern is the same as that suggested by Markert, Shaklee, and Whitt (1975) and Fisher et al. (1980). In the latter study, it was specifically proposed that the earliest LDH duplications occurred in jawless vertebrates after the divergence of vertebrates from cephalochordates (amphioxus). Ohno (1970) proposed that extensive gene duplications had occurred at two or more points in chordate phylogeny, based largely on evidence from genome size estimates. The first of these episodes of large-scale gene duplication was proposed to have occurred in the lineage leading to amphioxus and vertebrates after the divergence of tunicates, and the second in the fish or amphibian ancestors of amniotes. Ohno suggested that the latter duplication was due to polyploidy but that the first may have been due either to polyploidization or extensive tandem duplication. Fisher et al. (1980) interpreted the evolution of four isozyme systems (including LDH) as indicating one or more rounds of polyplodization in the early vertebrate or prevertebrate lineage. Holland et al. (1994) and Sharman and Holland (1996) surveyed the phylogenetic relationships and number of loci encoding genes involved in the regulation of development and also concluded that widespread gene duplications occurred at two periods in chordate evolution. These authors proposed a slightly different timing for the duplication episodes than Ohno (1970), however. The first phase was

Tunicate LDH

proposed to have occurred close to the origin of vertebrates after the divergence of this lineage from amphioxus, and the second near the origin of gnathostomes. Our results indicating that the duplication of LDH loci in vertebrates occurred after the divergence of tunicates is consistent with both the hypotheses of Ohno (1970) and of Holland et al. (1994) and Sharman and Holland (1996). A more precise understanding of the timing of the duplications leading to multiple LDH loci and their relationship to the duplications of other genes awaits the sequencing of the LDHs of amphioxus and hagfish. That the duplications leading to LDH-A and LDHB involved large genomic regions is suggested by the linkage relationship of these two genes in mammals; mouse and human Ldh-Bs are located on chromsomes 6 and 12p, respectively, and mouse and human Ldh-As are located on chromosomes 7 and llp (Debry and Seldin 1996). In both species, the chromosome segments containing Ldh-A and Ldh-B contain additional loci with paralogous relationships between the two regions, and it has been proposed that these chromosomal segments are ancestrally related, having arisen by a process such as polyploidization (Lundin 1993). A relatively recent origin of mammalian Ldh-C from Ldh-A, as we have suggested based on phylogenetic analyses of LDH genes, is consistent with the close linkage of these two loci (perhaps within 100 kb-Stubbs et al. 1994). Ldh-C and Ldh-A have also been shown to be linked in some fish species (Morizot 1990) despite evidence from sequence analyses that the former gene is closely related to Ldh-B. This latter result suggests that linkage arrangements in modern species may bear a complex relationship to gene duplication histories. Nevertheless, it would be interesting to compare phylogenies of chordate gene families containing paralogs linked to mammalian or fish LDH loci with the duplication history that we have proposed for the LDH gene family. Phylogenetic congruence between LDH and other linked gene families would provide additional evidence for our hypothesis.

BALDWIN, J., A. PATAK,

(Holothuria

1283

and K. MORTIMER.1992. Echinoderm

atra) lactate dehydrogenase: affinities with the

putative ancestral chordate enzyme. Biochem. Syst. Ecol. 20535-539. BIRKTOFT,J. J., R. T FERNLEY,R. A. BRADSHAW,and L. J. BANASZAK.1982. Amino acid sequence homology among the 2-hydroxy acid dehydrogenases: mitochondrial and cytoplasmic malate dehydrogenases form a homologous system with lactate dehydrogenase. Proc. Natl. Acad. Sci. USA 79:6166-6170. BJOURSON,A. J., and J. E. COOPER. 1992. Band-stab PCR: a simple technique for the purification of individual PCR products. Nucleic Acids Res. 20:4675. BZIK, D. J., B. A. Fox, and K. GONYER.1993. Expression of Plasmodium falciparum lactate dehydrogenase in Escherichia coli. Mol. Biochem. Parasitol. 59: 155-166. CENDRIN,E, J. CHROBOCZEK,G. ZACCAI, H. EISENBERG,and M. MEVARECH.1993. Cloning, sequencing, and expression in Escherichia coli of the gene coding for malate dehydrogenase of the extremely halophilic archaebacterium HaZoarcula marismortui. Biochemistry 32:4308-43 13. CRAWFORD,D. L., H. R. CONSTANTINO,and D. A. POWERS. 1989. Lactate dehydrogenase-B cDNA from the teleost Fundulus heteroclitus: evolutionary implications. Mol. Biol. Evol. 6:369-383. DEBRY,R. W., and M. E SELDIN.1996. Human/mouse homology relationships. Genomics 33:337-35 1. FELSENSTEIN,J. 1993. PHYLIP phylogeny inference package. Version 3.5~. University of Washington, Seattle. FISHER, S. E., J. B. SHAKLEE,S. D. FERRIS,and G. S. WHIT-T. 1980. Evolution of five multilocus isozyme systems in the chordates. Genetica 52/53:73-85. FROHMAN,M. A. 1990. RACE: rapid amplification of complementary DNA ends. Pp. 28-38 in M. A. INNIS,D. H. GELFAND,J. J. SNINSKY,and T J. WHITE, eds. PCR protocols: a guide to methods and applications. Academic Press, New York. HENDRIKS,W., J. W. M. MULDERS,M. A. BIBBY,C. SLINGSBY, H. BLOEMENDAL, and W. W. DE JONG. 1988. Duck lens Ecrystallin and lactate dehydrogenase B4 are identical: a single-copy gene product with two distinct functions. Proc. Natl. Acad. Sci. USA 85:7114-7118. HOLLAND,I? W. H. 1992. Homeobox genes in vertebrate evolution. BioEssays 14:267-273. Acknowledgments HOLLAND,I? W. H., J. GARCIA-FERNANDEZ, N. A. WILLIAMS, and A. SIDOW. 1994. Gene duplications and the origins of We wish to thank Patricia Schulte for the gift of vertebrate development. Development (Suppl.): 125-l 33. competent bacterial cells. This study was supported by HUELSENBECK, J. P, and D. M. HILLIS. 1993. Success of phyNSF grants BSR-87-17417 (to G.S.W.), BSR-87-18425 logenetic methods in the four-taxon case. Syst. Biol. 42: (to D.A.P.), BSR-88-15362 (to G.S.W. and D.W.S.), 247-264 BSR-90-22648 (to D.A.P.), and DEB 96-29572 (to IWABE, N., K.-I. KUMA, M. HASEGAWA,S. OSAWA, and T. D.A.P.), an NSF Predoctoral Fellowship in Genetics (to MIYATA. 1989. Evolutionary relationship of archaebacteria, D.W.S.), and an Alfred F? Sloan Postdoctoral Fellowship eubacteria, and eukaryotes inferred from phylogenetic trees of duplicated genes. Proc. Natl. Acad. Sci. USA 86:9355in Studies of Molecular Evolution (to J.M.Q.). 9359. JIN, S., M. DE JESCJS-BERR~OS, and A. L. SONENSHEIN. 1996. LITERATURE CITED A Bacillus subtilis malate dehydrogenase gene. J. Bacterial. 178:560-563. ABAD-ZAPATERO, C., J. I? GRIFFITH,J. L. SUSSMANN,and M. KANEKO,T., A. TANAKA,S. SATO, H. KOTANI,T. SAZUKA,N. G. ROSSMANN.1987. Refined crystal structure of dogfish MIYAJIMA,M. SUGIURA,and S. TABATA. 1995. Sequence M4 apo-lactate dehydrogenase. J. Mol. Biol. 198:445-467. analysis of the genome of the unicellular cyanobacterium ALAHIOTIS,S. N., A. ONOUFRIOU, M. FOTAKI, and M. PELESynechocystis sp. strain PCC6803. I. Sequence features in CANOS. 1983. Drosophila lactate dehydrogenase: developthe 1 Mb region from map positions 64% to 92% of the mental aspects. Biochem. Genet. 21:199-211. genome. DNA Res. 2:153-166. BALDWIN,J., K. MORTIMER,and A. PATAK. 1988. Do ascidians KOBEL, H. R. and L. Du PASQUIER.1986. Genetics of polypossess the ancestral subunit type of vertebrate lactate dehydrogenase? J. Exp. Zool. 246: 109-l 14. ploid Xenopus. Trends Genet. 2: 3 10-3 15.

1284

Stock et al.

KOCHHAR, S., I? E. HUNZIKER, I? LEONG-MORGENTHALER,and H. HJOTTINGER. 1992. Evolutionary relationship of NAD+dependent D-lactate dehydrogenase: comparison of primary structure of 2-hydroxy acid dehydrogenases. Biochem. Biophys. Res. Commun. 184:60-66. KOZAK, M. 1996. Interpreting cDNA sequences: some insights from studies on translation. Mamm. Genome 7563-574. KUMAR, S., K. TAMURA, and M. NEI. 1993. MEGA: molecular evolutionary genetics analysis. Version 1.Ol . Pennsylvania State University, University Park. LI, S. S.-L., W. M. FITCH, Y.-C. E. PAN, and E S. SHARIEF. 1983. Evolutionary relationships of vertebrate lactate dehydrogenase isozymes A4 (muscle), B, (heart), and C4 (testis). J. Biol. Chem. 258:7029-7032. LONG, G. L. 1976. The stereospecific distribution and evolutionary significance of invertebrate lactate dehydrogenases. Comp. Biochem. Physiol. 55B:77-83. LUNDIN, L. G. 1993. Evolution of the vertebrate genome as reflected in paralogous chromosomal regions in man and the house mouse. Genomics 16:1-19. MANNEN, H., S. C.-M. TSOI, D. B. PICKFORD, J. A. DONALD, L. J. GUILLETTE, and S. S.-L. LI. 1996. Sequences of the lizard cDNAs encoding lactate dehydrogenase (LDH) isozymes A (muscle) and B (heart). Gene 171:303-304. MARCHUK, D., M. DRUMM, A. SAULINO, and E S. COLLINS. 199 1. Construction of t-vectors: a rapid and general system for direct cloning of unmodified PCR products. Nucleic Acids Res. 19: 1154. MARKERT, C. L. 1984. Biochemistry and function of lactate dehydrogenase. Cell Biochem. Funct. 2:131-134. MARKERT, C. L., J. B. SHAKLEE, and G. S. WHITT. 1975. Evolution of a gene. Science 189: 102-l 14. MATSON, R. H. 1989. Distribution of the testis-specific LDHX among avian taxa with comments on the evolution of the LDH gene family. Syst. Zool. 38:106-l 15. MORIZOT, D. C. 1990. Use of fish gene maps to predict ancestral vertebrate genome organization. Pp. 207-234 in Z.-I. OGITA and C. L. MARKERT, eds. Isozymes: structure, function, and use in biology and medicine. Wiley-Liss, New York. OHNO, S. 1970. Evolution by gene duplication. Springer, New York. PETERSON, K. J. 1995. A phylogenetic test of the calcichordate scenario. Lethaia 28:25-38. QUATTRO, J. M., D. D. POLLOCK, M. POWELL, H. A. WOODS, and D. A. POWERS. 1995. Evolutionary relations among vertebrate muscle-type lactate dehydrogenases. Mol. Mar. Biol. Biotechnol. 4:224-23 1. QUATTRO, J. M., H. A. WOODS, and D. A. POWERS. 1993. Sequence analysis of teleost retina-specific lactate dehydrogenase C: evolutionary implications for the vertebrate lactate dehydrogenase gene family. Proc. Natl. Acad. Sci. USA 90: 242-246. SAMBROOK, J., E. E FRITSCH, and T. MANIATIS. 1989. Molecular cloning: a laboratory manual. 2nd edition. Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y. SHARMAN, A. C., and I? W. H. HOLLAND. 1996. Conservation, duplication, and divergence of developmental genes during chordate evolution. Neth. J. Zool. 46:47-67. SHEETS, M. D., S. C. OGG, and M. I? WICKENS. 1990. Point mutations in AAUAAA and the poly (A) addition site: effects on the accuracy and efficiency of cleavage and polyadenylation in vitro. Nucleic Acids Res. 18:5799-5805. SIEBENALLER, J. E, T. L. ORR, B. B. OLWIN, and S. S. TAYLOR. 1983. Comparison of the D-lactate stereospecific dehydrogenase of kmulus polyphemus with active-site regions of L-lactate dehydrogenases. Biochim. Biophys. Acta 749: 153-162.

STOCK, D. W., K. D. MOBERG, L. R. MAXSON, and G. S. analysis of the 18s ribosomal WHI’IT. 199 1. A phylogenetic

RNA sequence of the coelacanth Latimeria chalumnae. Environ. Biol. Fish. 32:99-l 17. STOCK, D. W., and D. A. POWERS. 1995. The cDNA sequence of the lactate dehydrogenase-A of the spiny dogfish (SquaZus acanthias): corrections to the amino acid sequence and an anlysis of the phylogeny of vertebrate lactate dehydrogenases. Mol. Mar. Biol. Biotechnol. 4:284-294. STOCK, D. W., and G. S. WHITT. 1992. Evolutionary implications of the cDNA sequence of the single lactate dehydrogenase of a lamprey. Proc. Natl. Acad. Sci. USA 89:17991803. STUBBS, L., E. M. RINCHIK, E. GOLDBERG, B. RUDY, M. A. HANDEL, and D. JOHNSON. 1994. Clustering of six human 11~15 gene homologs within a 500-kb interval of proximal mouse chromosome 7. Genomics 24:324-332. SWOFFORD,D. L. 1991. PAUP phylogenetic analysis using parsimony. Version 3.0s. Illinois Natural History Survey, Champaign. SYNSTAD, B., 0. EMMERHOFF, and R. SIREVAG. 1996. Malate dehydrogenase from the green gliding bacterium Chlorojlexus uuruntiucus is phylogenetically related to lactic dehydrogenases. Arch. Microbial. 165:346-353. TEMPLETON, A. R. 1983. Phylogenetic inference from restriction endonuclease cleavage site maps with particular reference to the evolution of humans and the apes. Evolution 371221-244. THOMPSON, J. D., D. HIGGINS, and T. J. GIBSON. 1994. CLUSTAL W improving the sensitivity of progressive multiple sequence alignment through sequence weighting, positionspecific gap penalties and weight matrix choice. Nucleic Acids Res. 22:4673-4680. TSOI, S. C.-M., and S. S.-L. LI. 1994. The nucleotide and deduced amino-acid sequences of a cDNA encoding lactate dehydrogenase from Cuenorhubditis eleguns: the evolutionary relationships of lactate dehydrogenases from mammals, birds, amphibian, fish, nematode, plants, bacteria, mycoplasma, and plasmodium. Biochem. Biophys. Res. Commun. 205:558-564. TSUJI, S., M. A. QURESHI, E. W. Hou, W. M. FITCH, and S. S.L. LI. 1994. Evolutionary relationships of lactate dehydrogenases (LDHs) from mammals, birds, an amphibian, fish, barley, and bacteria: LDH cDNA sequences from Xenopus, pig and rat. Proc. Natl. Acad. Sci. USA 91:9392-9396. WHITT, G. S. 1969. Homology of lactate dehydrogenase genes: E gene function in the teleost nervous system. Science 166: 1156-l 158. 1984. Genetic, developmental and evolutionary aspects of the lactate dehydrogenase isozyme system. Cell Biochem. Funct. 2: 134-139. WOLFF, J., and H. R. KOBEL. 1982. Lactate dehydrogenase of Xenopus luevis luevis and Xenopus borealis depends on a multiple gene system. J. Exp. Zool. 223:203-210. WYNNE, S. A., D. J. NICHOLLS, M. D. SCAWEN, and T. K. SUNDARAM. 1996. Tetrameric malate dehydrogenase from a thermophilic Bacillus: cloning, sequence and overexpression of the gene encoding the enzyme and isolation and characterization of the recombinant enzyme. Biochem. J. 3171235-245. YANG, S., and S. E PARMLEY. 1995. A bradyzoite stage-specifically expressed gene of Toxoplasma gondii encodes a polypeptide homologous to lactate dehydrogenase. Mol. Biochem. Parasitol. 73:291-294. SHOZO YOKOYAMA,

Accepted

August

reviewing

29, 1997

editor