Structure of simian immunodeficiency virus regulatory genes.

1 downloads 0 Views 994KB Size Report
Mar 8, 1989 - the SIV tat gene is modulated by cell-specific factors. The .... +e. + >+. + -> +. Nucleotide Sequence Analysis of the cDNA Clones. We have.
Proc. Natl. Acad. Sci. USA Vol. 86, pp. 4813-4817, July 1989 Biochemistry

Structure of simian immunodeficiency virus regulatory genes (tat/rev/nef)

SANDRA COLOMBINI*t, SURESH K. ARYA*, MARVIN S. REITZ*, LINDA JAGODZINSKIt, BARBARA BEAVER*, AND FLOSSIE WONG-STAAL* *Laboratory of Tumor Cell Biology, National Cancer Institute, National Institutes of Health, Bethesda, MD 20892; and tMolecular Biology Department, Biotech Research Laboratories, Rockville, MD 20850

Communicated by Hilary Koprowski, March 8, 1989

Three full-length cDNA clones were obtained ABSTRACT from cells infected with the simian immunodeficiency virus (SIV) isolated from captive macaques (SIVmAc). Nucleotide sequence analyses suggested that these represented mRNA for the SIV MAC genes tat, rev (formerly, art/trs), and nef (formerly, 3'of). The putative tat-specific clone was active in trans-activation of the SIV MAC long terminal repeat in COS-1 and Jurkat cells. In contrast, the human immunodeficiency virus 1 long terminal repeat was si nificantly trans-activated only in the COS-1 cells. This suggests that trans-activation by the SIV tat gene is modulated by cell-specific factors. The structure of all of the clones suggested an mRNA splicing pattern more complex than that described for human immunodeficiency virus 1.

MATERIALS AND METHODS cDNA Library Construction. RNA was obtained from SIV MAC-infected HUT 78 cell lines (generously provided by P. Kanki and M. Essex, Harvard School of Public Health, Boston) by the guanidine thiocyanate procedure (16). The poly(A)-selected RNA [oligo(dT)-cellulose, Collaborative Research] was copied using cloned reverse transcriptase (Life Sciences, Gainesville, FL) and the Okayama-Berg pCDV vector primer (Pharmacia) (14, 15). The cDNA was tailed, HindIII-digested and circularized utilizing oligo(dG)tailed PL1 linkers (Pharmacia). Finally, the RNA strand was replaced with ribonuclease H (Pharmacia) and DNA polymerase I (Boehringer Mannheim). Competent HB101 cells (Bethesda Research Laboratories) were transformed with a recovery of 5 X 105 colonies. Duplicate filters of the library were screened with a U3-specific SIV LTR probe and a 17-mer oligonucleotide probe derived from the sequence of the U5 region of the SIV LTR. Four clones hybridized with both probes and were selected for further testing. In addition, several clones containing large inserts that hybridized only with the U3 probe were also studied. DNA sequencing of the selected clones was performed by the chemical degradation method of Maxam and Gilbert (17). DNA Transfection. DNA transfection of the cDNA clones and an indicator plasmid pSVIICAT (11) was carried out by the DEAE-dextran procedure using COS-1 cells or Jurkat uninfected lymphocytes. The COS-1 cell line is derived from African green monkey kidney cells and constitutively expresses the simian virus 40 large tumor antigen (18), which supports the replication of plasmids containing the simian virus 40 origin of replication, present in the vector used here. Cytoplasmic extracts were prepared 48 hr after transfection by three cycles of freezing and thawing. Aliquots were assayed at various times for chloramphenicol acetyltransferase (CAT) activity as described (19).

The human T-lymphotropic viruses share, among other common properties, the ability to regulate their own expression, mediated by trans-acting viral regulatory proteins. Human immunodeficiency virus 1 (HIV-1) has at least three regulatory genes (tat, rev, and nef) operating at various stages of virus expression (for a review, see ref. 1). In this report, we have adopted the nomenclature proposed by Gallo et al. (2). rev and nef were previously referred to as trs/art and 3'orf, respectively. It has been suggested that the interaction of these genes may determine the latent and replicative phases of virus infection and, therefore, have important implications in the progress of the disease. Furthermore, these regulatory genes are potential targets of anti-viral agents in addition to those applicable to all retroviruses. Another group of primate retroviruses has been described (3, 4). The human isolates are grouped under the name HIV-2. Similar viruses have been isolated from captive macaques (5, 6) and are called simian immunodeficiency virus (SIV MAC). They show only moderate sequence homology with HIV-1 but have a very similar genomic organization (7-10). All of them possess a trans-activator gene (tat) able to greatly increase expression directed by the viral long terminal repeat (LTR) (10-13). We were interested in studying the trans-regulatory functions of this virus subgroup to determine if they have similar mechanisms for viral gene regulation and might, therefore, be subject to types of intervention similar to those designed for HIV-1. Furthermore, comparison of the analogous sequences of corresponding genes ofthe two distantly related virus subgroups should help define the functionally relevant domains. We, therefore, constructed a cDNA library from SIV MAC-infected cells using a mammalian expression vector (14, 15). In this study, we report the cloning of functional cDNA clones with transactivation activity and present the complete nucleotide sequence of one such clone.§

RESULTS Trans-Activation Activity of cDNA Clones. We assayed the trans-activation capacity of our cDNA clones by cotransfection with a plasmid (pSV11) containing the putative tat target sequences (TAR) of the SIV LTR upstream of CAT (11). In addition, cotransfection experiments using pC15CAT (13), containing the TAR of HIV-1 and/or pCV-1, a functional HIV-1 tat gene cDNA clone, were carried out in parallel. The results are shown in Fig. 1. Although clones P8 and P6 were inactive, clones P5 and P11 showed marked trans-activating Abbreviations: SIV, simian immunodeficiency virus; HIV, human immunodeficiency virus; LTR, long terminal repeat; nt, nucleotide(s); ORF, open reading frame; CAT, chloramphenicol acetyltransferase. tTo whom reprint requests should be addressed. §The sequence reported in this paper has been deposited in the GenBank data base (accession no. M23165).

The publication costs of this article were defrayed in part by page charge payment. This article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. §1734 solely to indicate this fact. 4813

Biochemistry: Colombini et al.

4814 A

Proc. Natl. Acad Sci. USA 86

(1989)

and P6 are all full-length transcripts (judged by the W-P8, tz zpresence of U5 and 5' R sequences) and represent spliced FZ U U0U 0-0 00 :U u U 0 0 Y3 , U, mRNA tK GI> U GI tively. > > 0 > > > > (0> >I:>U + -> + + >+ Nucleotide Sequence Analysis of the cDNA Clones. We have + + +e determined the structures of the three spliced cDNA clones I I+1+ mby obtaining the complete nucleotide sequence of clone P11 * * * * and partial sequences of clones P8 and P6. The structures of W W ~ these clones are presented in Fig. 2. Clone P11 is a complete cDNA clone, starting at the cap site and ending in a (dT-dA)23 tail. A comparison with the available genomic sequences of SIV/HIV-2 (7-10) suggested the presence of at least four I " " < 4

L,




C

-

^^

~~

* " * * W W

splice junctions in this clone. The first exon starts at the junction between U3 and R and utilizes a donor splice site at position +64. By comparison of * * the sequence of this first exon with the sequence of the HIV-1 LTR, this splice site should correspond to the end of the TAR-responsive sequence in the HIV-1 genome (19). The second exon, which contains the tRNALYS binding site and B at -the leader sequence, utilizesnta4733' (acceptor) splice site is that I "< at system (donor) [numbering 5' 42 position 209 and Z < 4z F > :> F ,F> 1> _ -J. sequences are GT and AG, respectively. This first intron of i:A Un> >a + Fn + + en + + I I+ 145 nt contains the AATAAA polyadenylylation signal. The zfact that the same splicing pattern is evident in cDNA clones P8 and P6 as well strongly suggests that this is a true splicing event rather than an artifact of cDNA cloning. A consensus donor splice site has not been identified in the same region in HIV-1 LTR but is present in all the HIV-2 provirus sem quenced so far. The third exon of P11 utilizes an acceptor at position 5163 and a donor at position 5240 and is derived from the vif region. Sequences of this exon are not present in clones P8 and P6. The fourth exon of clone P11 starts in a region of the genome that was deleted in a cloned provirus previously sequenced in our laboratory (8). However, other FIG. 1. Activation of' SIV and HIV-1 LTR linked to the CAT gene SIV and HIV-2 proviruses contain these sequences (7, cloned HIV-1 and cDNA ,clones. Tat, rev, and nefare, SIV by respectively, 9, 10, 20). Clone P8 contains a shorter exon in this position, the P11, P8, and P6 clon ies. Clone P5 is described in the text, the utilizing a 3' splice site 214 nt further downstream, but uses HIV-1 tat is the cDNA cl1 one PCVI (13). Approximately 2 x 106 cells the same 5' splice site at position 5722. Clone P6 does not were transfected by the IDEAE-dextran protocol and the assay was performed 48 hr later. (A,) COS-1 cells. (B) Jurkat cells. Each value contain any sequences derived from this region. The fifth was determined in triplic,ate. exon is common to all three cDNA clones and utilizes an acceptor donor site at nt 7937. function. Consistent vvith the results of others using virusInspection of the complete nucleotide sequence of clone infected cells (10, 11),] HIV-i tat trans-activated the SIV LTR P11 (Fig. 3) revealed three ORFs, each starting with a in Jurkat cells, but SIV'tat (clone P11, see Fig. 1A, lane5) had methionine codon. The positions of these three ORFs are very little activity with the HIV-1 LTR. In contrast, in COS-1 similar to those of the tat, rev, and nef coding regions in a cells (Fig. 1B), we fouind complete reciprocity between SIV functional tat cDNA clone (pCV-1) of HIV-1 (13). The first tat/LTR and HIV-1 tat t/LTR pairs. These results suggest that ORF (nt 424-810) has a coding capacity of 130 amino acids cell-specific factors m tight be important in determining the and has significant amino acid identity (31.7%) with the specificity of trans-acti ivation. Clone P5 is a4-kilobase partial HIV-1 tat amino acid sequence. The fact that clone P11, reverse transcript of go enomic mRNA. It contains open readwhich contains this ORF, has strong trans-activating activity, ing frames (ORFs) for the nonstructural genes tat, rev, and while clones P8 and P6, which lack the ORF, do not, suggests that this ORF indeed encodes the tat of SIV. nef. This cDNA clorie also encodes the gpl60 envelope protein, as demonstral ted by immunoprecipitation of lysates Comparison of the SIV and HIV-1 Regulatory Genes. A of transfected cells (dalta not shown). By contrast, clones P11, comparison of the predicted amino acid sequences of the SIV W

W ~

(00 ~

00

VO

i >

' C; °JCC°V

5

0

J

U>

>

uY

+I

115

vox

LTR

sIV

IIr

CLONE 11 (tat)

DO

CLONE 8 (rev)

Do

CLONE 6 (nef)

go

gag

rif C

OI

e.nv a v. env a VDr

I

o

LT LTR

11 17

U I

FIG. 2. Diagrammatic representation of the tatsiv, revsIv, nefsIv gene cDNA clones. The coding region present on the three ORFs is marked by a solid box.

Proc. Natl. Acad. Sci. USA 86 (1989)

Biochemistry: Colombini et al. R

30

60

1140 1110 TAGGAGAGGTGGAAGATGGATCCTCGCAATCCCTAGGAGGATTAGGCAAGGGCTTGAGCT L G e V E D G S S Q S L G G L G K G L S

U5-4 F-

1170 1200 CACGCTCTTGTGAGGGACAGAAATACAATCAGGGGCAGTATATGAATACTCCATGGAGAA S R S C E G Q K Y N Q G Q Y M N T P W R

TAR REGION

GAGTCGCTCTGCGGAGAGGCTGGCAGATTGAGCCCTGGGAG(

AO,

r4

U5

90

CTCTCCAGCACTAGCAGTCGCCGCCTGGTCAACTCGGTACTCGGTAATAAGAAGACCCTG - tRNA

" 3 --4 LEADER

GTCTGTTAGGACCCTTTI

180 150 SEQUENCE AAACCGAAGCAGGAAAATCCCTAGCAGATTGGC

210 240 GCCCGAACAGGGACTTGAAGGAGAGTGAGAGACTCCTGAGTACGGCTGAGTGAAGGCAGT 270 300 AAGGGCGGCAGGAACCAACCACGACGGAGTGCTCCTATAAAGGCGCGGGTCGGTACCAGA

1230

1260

ACCCAGCTGAAGAAAAGGAAAAATTAGCATACAGAAAACAAAATATGGATGATATAGATG N P A E E K E K L A Y R K Q N M D D I D

I-U3

1290

1320

AGGAGGATGATGACTTGGTAGGGGTATCAGTGAGGCCAAAAGTTCCCCTAAGAGCAATGA E E D D D L V G V S V R P K V P L R A M

1350 1380 CTTACAAATTGGCAATAGATATGTCTCATTTTATAAAAGAAAAGGGGGGACTGGAAGGGA T

Y

K

L

A

I

D

M

S

H

F

I

K

E

K

G

G

L

E

G

330 360 CGGCGTGAGGAGCGGGAGAGGAGGAGGCCTCCGGTTGCAGCGGGAGAAGTGAGAAGGGCC

1410 1440 TTTATTACAGTGCAAGAAGACATAGAATCTTAGACATGTACTTAGAAGGCATCATACCAG

390 p20 ATCAGGGGAGAACAACTGCTGTCTTGCTGCAAGTTCCCGAGAGCTCATAGGTACCAGACA

1470 1500 ATTGGCAGGATTACACCTCAGGACCAAGAATTAGATACCCAAAGACATTTGGCTGGCTAT D W Q D Y T S G P R I R Y P K T F G W L

450 480 TGGAGACACCCTTGAGGGAGCAGGAGAACTCATTAGAATCCTCCAACGAGCGCTCTTCAT E T P L R E Q E N S L E S S N E R S S

i--TAT M

510 540 GCATTTTAGAGGCGGATGCAACCACTCCAGAATCGGCCAACCTGGGGGAGGAAATCCTCT C I L E A D A T T P E S A N L G E E I L I-TRS M Q P L Q N R P T W G R K S S 570 600 CTCAACTATACCGCCCTCTTGAGGCGTGCTATAACACATGCTATTGTAAAAAGTGTTGCT S Q L Y R P L E A C Y N T C Y C K K C C L N Y T A L L R R A I T H A I V K S V A 630 660

ACCATTTCCAGTTTIGTTTTCTTAAAAAGGGATTGGGGATATGTTATGAGCAGTCACGAA Y

Q

C

H T

I

A

F S

C

F

L

F V

F

L

K K G L G I C Y E Q S K R D W G Y V M S S

R

K

R

R

R T P K K A K E EL R K R L

A

N

T

S

A

S

S

K

N

L

R E LI H L L H Q T N S 750 780 TACCCAACAGGACCCGGCACTGCCAACCAGAGAAGGCAAAGAAGGAGACGGTGGAGAAGG I P N R T R H C Q P E K A K K E T V E K Y P T G P G T A N Q R R Q R R R R W R R 810 840 CGGTGGCAACAGCTCCTGGCCTTGGCAGATAGAATATATTCATTTCCTGATCCGCCAACT A V A T A P G L G R TAT-I R W Q Q L L A L A D R I Y S F P D P P T 870 900 GATACGCCTCTTGACTTGGCTATTCAGCAACTGCAGAACCTTGCTATCGAGAGCATACCA D T P L D L A I Q Q L Q N L A I E S I P

930

960

GATCCTCCAACCAATACTCCAGAGGCTCTCTGCGACCCTACGAAGGATTCGAGAAGTCCT D

P

P

T

N

T

P

E

A

L

C

D

P

T

K

D

S

R

I

Y

Y

S

A

R

R

H

R

S

I

L

D

M

Y

L

E

G

I

I

P

1560 1530 GGAAATTAGTCCCTGTAAATGTATCAGATGAGGCACAGGAGGATGAGAGGCATTATTTAA K L V P V N V S D E A Q E D E R H Y L

W

1590 1620 TGCAGCCAGCTCAAACTTCCAAGTGGGATGACCCTTGGGGAGAGGTTCTAGCGTGGAAGT M Q P A Q T S K W D D P W G E V L A W K

1650 1680 TTGATCCAACTCTAGCCTACACTTATGAGGCATATGTTAGATACCCAGAAGAGTTTGGAA F

D

P

T

L

A

Y

T

Y

E

A

Y V R

Y

P

E

E

F

G

1710 1740 GCAAGTCAGGCCTGTCAGAGGAAGAGGTTAGAAGAAGGCTAACCGCAAGAGGCCTTCTTA S K S G L S E E E V R R R L T A R G L L

E

S 720 690 AGAGAAGAAGAACTCCGAAAAAGGCTAAGGCTAATACATCTrCTGCATCAAACAAACTCA

4815

1770

1800

ACATGGCTGACAAGAGGGAAACTCGCTGAGACAGCAGGGACTTTCCACAAGGGGATGTTA N M A D K R E T R 3'ORF 1830

1860

TGGGGAGGAGCCGGTCGGGAACACCCACTT'TCTTGATGTATAAATATCACTGCATTTCGC TATA BOX 1890 1920 U3- I-R TCCGTATTCAGTCGCTCTGCGGAGAGGCTGGCAGATTGAGCCCTGGGAGGTTCTCCAG

1950 1980 CACTAGCAGGTAGAGCCTGGGTGTTCCCTGCTAGACTCTCACCAGCACTTGGCCAGTGCT 2040 R _--4 2010 GGGCAGAGTGGCTCCACGCTTGCTTGCTTAAAGACCTCTTCAATAAAGCTGCCATTTTAG

P

1020 990 CAGGACTGAACTGACCTACCTACAATATGGGTGGAGCTATITCCATGAGGCGGTCCAAGC Q D TRS-4 -3'ORF H G G A I S H R R S K

2060 AAGTAMAAAAAAAAAAAAAA

1080 1050 CGACTGGAGATCTGCGACAGAAACTCTTGCGGGCGCGTGGGGAGACTTATGGGAGACTCT P T G D L R Q K L L R A R G E T Y G R L

FIG. 3. Nucleotide sequence of clone P11. The first exon of the clone corresponding to HIV-1 TAR-responsive sequence by nucleotide alignment is marked as are the others-regulatory elements and the splice sites. The derived amino acid sequence is presented using the single-letter amino acid code. TRS, rev; 3'ORF, nef.

and HIV-1 tat genes revealed some interesting features that may be useful in localizing of the functional domains of these genes (Fig. 4A). The SIV tat gene is larger by 44 amino acids. There is little homology between the two genes except for a central core sequence that includes two potential nucleic acid binding domains. The first is a cysteine-rich region that has been shown to be involved in metal binding and dimerization of the tat protein (22). A second highly conserved potential nucleic acid binding domain is located downstream and contains a hydrophilic, highly basic stretch of lysine and arginine residues. It has been shown (13) that a tat gene lacking sequences on the 3' side of this hydrophilic stretch still retained significant trans-activating potential. Consistent with the idea-that these 3' sequences are not essential is the low homology between HIV tat and SIV tat in this region. The second ORF of clone P11 (nt 498-968) encodes 157 amino acids as shown (Fig. 2). Clone P8 contains a truncated version (106 amino acids) of this ORF and uses an alternate downstream 3' splice junction. Although both ORFs are potentially equivalent to the rev gene of HIV-1, the shorter ORF of clone P8 more closely approximates the size of the HIV-1 rev. Alignment of the shorter orf with rev from the first methionine residue showed a high homology with the first half of the gene, which includes a stretch rich in arginine residues (Fig. 4B).

The third ORF, which is present in all cDNA clones, is derived from the 3' end of the provirus genome and corresponds to the nef gene of HIV-1. Clone P6 contains only this ORF and presumably is derived from the mRNA of this gene. Alignment with the HIV-1 nef gave only 30% amino acid identity, but the overall size and hydropathy of the predicted proteins are similar. Moreover, a possible core sequence is 70%o identical to the nef of HIV, as observed (8).

DISCUSSION We have obtained cDNA clones corresponding to the regulatory genes (tat, rev, and nef) of SIV. The tat cDNA clone was functional in trans-activation of a reporter gene (CAT) linked to the SIV LTR. A complete reciprocity in transactivation was observed between SIV and HIV-1 when the assay was carried out in COS-1 cells. However, in human T cells (Jurkat), the HIV-1 tat trans-activates the SIV LTR much more efficiently than is true with the reciprocal heterologous system. Analysis of the structures of these cDNA clones suggested some distinctive features of mRNA processing for SIV. First, an intron is found within the LTR. It has not been reported that HIV-1 splices out this intron. The TAR-responsive region for HIV-2 has been found to be longer than the HIV-1

4816

Biochemistry: Colombini et al.

Proc. Natl. Acad Sci. USA 86

(1989)

A HIV 1 BH-10

STLV III HIV 2 ROD

L E P N K H P G

M E P V D P R

II M E T P L R E i I 1 1I:

II II

Q E N S L E S S N E R S S C I L E A D A T T P E S A

II

I

M E T P L K AP E S S L K S C N E PF S R T S E Q D V A T Q E L A

P K T A C

SQ

HIV 1 BH-10

II

T N C Y C K K C C F H C Q V C

I I I 11 I I II IIII A C Y N T C Y C K K C C Y H C Q I I I: I II I I I I I

F C

STLV III

N L G E E I L S Q L Y R P L E

HIV 2 ROD

RQ G E E I L S Q L Y R P L E T C N N S C Y C K R C C Y H C Q M C

HIV 1 BH-10

F L T K G L G I S Y G

STLV

III

I I F L II

R K K R R Q R R R P P QG S Q T H Q V S

I I 1: 1 I I I K K G L G I C Y EQ S R K R R R T P K K E K K A N T S S A S N : I I I I III 1 1111111

KIG

R R R R T P K K TK T H PS PT PD K

HIV 2 ROD

F L N K G L G I C YER

HIV 1 BH-10

L S K Q P T SQR R G D P TG P K E

STLV III

A T A P G L G R ETVEKAl III II G R S I S T R T G D SQ P T K K Q K KT V EA T I E T D T

HIV 2 ROD

L I P N R T

II

K A K K

G P

B

SIV

M A G R S G D S D E E L I R T V R L I K L L YQS N P P P N P : I 1 1: 1I1 1:1 I : I I I MS S S H E R E E E L R K R L R L I H L L H Q T N S Y P T G

HIV 1 BH-10

M N E

HIV 2 ROD

E G T R QA R R N R R R R W R E R Q R Q I

HIV 2 ROD

I

R AD E E G L Q R K

I I

II I Q R R Q R R R R W R R R WQ I I I I I I I: II I I

II

I I I 1: 11 lI I I

Y P QG

L R L I R L L H Q T N

I1 I 1I

H

S I S E R I L G T

I:

SIV

P G T A N

HIV 1 BH-10

P G T A S

HIV 2 ROD

sIV

Y L G R S A E P V P L Q L P P L E R L T L DC N E DC G T S G : ~~~~~~~~~~~~~~~~~~~I I I F P D P P T DT P L D L A I Q Q L Q N L A I E S I P

HIV 1 BH-10

F

HIV 2 ROD

T Q G V G S P Q I L VE SP TV L E SG TKE

sIV

: I D P P T N T P E A L C D

HIV 1 BH-10

D P P

I I I I

QIR

Q L L I

:

II

R N R R R R WK Q R W RQ I L A L A D S I Y T

II

I

P D P P A D S P L D Q TI Q H L

I

II

T

I

H L

P

E SQ R L

Q

I

I

G L T I Q E L P

II P T K D S R S P Q D AE T

FIG. 4. Amino acid alignment for HIV-1 (strain HTLVIIIB), SIV, and HIV-2ROD tat (A) and rev (B) proteins. Gaps were introduced to optimize the alignment; vertical lines indicate amino acid identity; dots indicate conservative changed. The regions for nucleotide binding capacity are enclosed in boxes. HIV-1 sequences are from Ratner et al. (21) and HIV-2RoD sequences are from Guyader et al. (10). The single-letter amino acid code is used.

TAR and includes a stretch of 40 nt from the U5 region that are missing in HIV-1. These 40 nt are required for full trans-activation by SIV tat (23). These same nucleotides would be spliced out in the cDNAs that we analyzed. It has been suggested that tat may interact with either viral DNA or RNA to enhance viral gene expression transcriptionally or post-transcriptionally. The fact that part of the SIV TAR region is not present in the viral mRNA may indicate that at least one mode of tat action is mediated either at the DNA or at the RNA level before splicing occurs. Furthermore, the splicing of this intron destroys the stem-loop structure in TAR of the mRNA, which has been suggested to inhibit efficient translation (25). Consistent with this result is our

observation that SIV mRNAs are efficiently translated in cell-free systems (S.C., unpublished data). The second feature of SIV that differs from the previously accepted view of HIV-1 is that at least one form of the rev gene of SIV appears to be transcribed as a separate mRNA from that of tat, utilizing a methionine codon 214 nt downstream from the first methionine codon of its ORF. This laboratory has demonstrated (24) that rev of HIV-1 is indeed expressed from a distinct mRNA species. As shown (7, 8, 11), the structural comparison of the regulatory genes of the two viral subgroups, HIV-1 and SIV/HIV-2, shows clusters of high homology. The conserved regions likely represent important functional domains of these

Biochemistry: Colombini et al. gene products. This suggests that SIV and HIV-2 diverged from a common ancestor more recently than from HIV-1. 1. Wong-Staal, F. (1987) in Viral Carcinogenesis: Functional Aspects, eds. Kjeldgaard, N. 0. & Forchhammer, J. (Munksgaard, Copenhagen), pp. 55-60. 2. Gallo, R. C., Haseltine, W., Montagnier, L., Wong-Staal, F. & Yoshida, M. (1988) Nature (London) 333, 504. 3. Kanki, P. J., Barin, F., M'Boup, S., Allan, J. S., RometLemonne, J. L., Marlink, R., McLane, M. F., Lee, T. H., Arbeille, B., Denis, F. & Essex, M. (1986) Science 232, 238243. 4. Clavel, F., Guetard, D., Brun-Vezinet, F., Chamaret, S., Rey, M. A., Santos-Ferreira, M. O., Laurent, A. G., Dauguet, C., Katlama, C., Rouzioux, C., Klatzmann, D., Champalimaud, J. L. & Montagnier, L. (1986) Science 233, 343-346. 5. Daniel, M. D., Letvin, N. L., King, N. W., Kannagi, M., Sehgal, P. K., Hunt, R. D., Kanki, P. J., Essex, M. & Desrosiers, R. C. (1985) Science 228, 1201-1204. 6. Kanki, P. J., McLane, M. F., King, N. W., Letvin, N. L., Hunt, R. D., Sehgal, P., Daniel, M. D., Desrosiers, R. C. & Essex, M. (1985) Science 228, 1199-1201. 7. Hirsch, V., Riedel, N. & Mullins, J. I. (1987) Cell 49, 307-309. 8. Franchini, G., Gurgo, C., Guo, H. G., Gallo, R. C., Collalti, E., Fargnoli, K. A., Hall, L. F., Wong-Staal, F. & Reitz, M. S. (1987) Nature (London) 328, 539-543. 9. Chakrabarti, L., Guyader, M., Alizon, M., Daniel, M. D., Desrosiers, R. C., Tiollais, P. & Sonigo, P. (1987) Nature (London) 328, 543-547. 10. Guyader, M., Emerman, M., Sonigo, P., Clavel, F., Montagnier, L. & Alizon, M. (1987) Nature (London) 326, 662-669. 11. Arya, S. K., Beaver, B., Jagodzinski, L., Ensoli, B., Kanki, P. J., Albert, J., Fenyo, E., Biberfeld, G., Zagury, J. F., Laure, F., Essex, M., Norrby, E., Wong-Staal, F. & Gallo, R. C.

Proc. Natl. Acad. Sci. USA 86 (1989)

4817

(1987) Nature (London) 328, 548-550. 12. Clavel, F., Guyader, M., Guetard, D., Salle, M., Montagnier, L. & Alizon, M. (1986) Nature (London) 324, 691-695. 13. Arya, S. K., Josephs, S. F., Guo, C. & Wong-Staal, F. (1985) Science 229, 69-73. 14. Okayama, H. & Berg, P. (1983) Mol. Cell. Biol. 3, 280-282. 15. Okayama, H. & Berg, P. (1982) Mol. Cell. Biol. 2, 151-155. 16. Chirgwin, J. M., Przybyla, A. E., McDonald, R. J. & Rutter, W. J. (1979) Biochemistry 18, 5294-5299. 17. Maxam, A. M. & Gilbert, W. (1980) Methods Enzymol. 65, 499-560. 18. Gluzman, Y. (1981) Cell 23, 175-182. 19. Seigel, L., Ratner, L., Josephs, S. F., Derse, D., Feinberg, M. B., Reyes, G. R., O'Brien, S. J. & Wong-Staal, F. (1986) Virology 148, 226-231. 20. Hahn, B., Kong, L. I., Lee, S. W., Kuhar, P., Taylor, H. E., Arya, S. K. & Shaw, G. M. (1987) Nature (London) 330, 184186. 21. Ratner, L., Haseltine, W., Patarca, R., Livak, K. J., Starcich, B., Josephs, S. F., Doran, E. R., Rafalski, J. A., Whitehorn, E. A., Baumeister, K., Ivanoff, L., Petteway, S. R., Jr., Pearson, M. L., Lautenberger, J. A., Papas, T. S., Ghrayeb, J., Chang, N. T., Gallo, R. C. & Wong-Staal, F. (1985) Nature (London) 313, 277-284. 22. Frankel, A. D., Bredt, D. S. & Pabo, C. 0. (1988) Science 240, 70-73. 23. Emerman, M., Guyader, M., Montagnier, L., Baltimore, D. & Huesing, M. A. (1987) EMBO J. 6, 3755-3760. 24. Sadaie, M. R., Rappaport, J., Benter, T., Josephs, S. F., Willis, R. & Wong-Staal, F. (1988) Proc. Natl. Acad. Sci. USA 85, 9224-9228. 25. Parkin, N. T., Cohen, E. A., Darveau, A., Rosen, C., Haseltine, W. & Sonenberg, N. (1988) EMBO J. 7, 2831-2837.