Proprotein convertases in amphioxus: Predicted ... - Semantic Scholar

1 downloads 0 Views 1MB Size Report
is performed by proprotein convertases which recently have .... 2). The complete coding sequences for amphioxus SPC2 and. SPC3 contain 689 and 774 aa, ...
Proc. Natl. Acad. Sci. USA Vol. 92, pp. 3591-3595, April 1995 Evolution

Proprotein convertases in amphioxus: Predicted structure and expression of proteases SPC2 and SPC3 (precursor processing/subtilisin-related endoproteases/prohormone biosynthesis/protochordates)

ANTHONY A. OLIVA, JR.*, DONALD F. STEINERt,

AND

SHU JIN CHAN

Howard Hughes Medical Institute, Department of Biochemistry and Molecular Biology, The University of Chicago, 5841 South Maryland Avenue, Chicago, IL 60637

Contributed by Donald F. Steiner, December 21, 1994

neuroendocrine cells) (6-9), and in vitro transfection assays (12) have shown that these enzymes can cleave proinsulin appropriately. Here we report the cDNA sequencest and expression of SPC2 and SPC3 from the protochordate amphioxus. We have previously shown that amphioxus expresses an insulin-like peptide (ILP) mRNA (13). The putative precursor, proILP, is like proinsulin in that it contains a C-peptide which is flanked by paired basic residues, but it is also similar to the pro-insulin-like growth factors in that proILP also contains a carboxyl-terminal E domain which is probably removed during maturation. Our goals are to characterize the proprotein convertases which are expressed in protochordates and more specifically identify the convertases involved in processing proILP to its active form.

SPC2 and SPC3 are two members of a family ABSTRACT of subtilisin-related proteases which play essential roles in the processing of prohormones into their mature forms in the pancreatic B cell and many other neuroendocrine cells. To investigate the phylogenetic origins and evolutionary functions of SPC2 and SPC3 we have identified and cloned cDNAs encoding these enzymes from amphioxus (Branchiostoma californiensis), a primitive chordate. The amino acid sequence of preproSPC2 contains 689 aa and is 71% identical to human SPC2. In contrast, amphioxus preproSPC3 consists of 774 aa and exhibits 55% identity to human SPC3. These results suggest that the primary structure of SPC2 has been more highly conserved during evolution than that of SPC3. To further investigate the function(s) of SPC2 and SPC3 in amphioxus, we have determined the regional expression of these genes by using a reverse transcriptase-linked polymerase chain reaction (RT-PCR) assay. Whole amphioxus was dissected longitudinally into four equal-length segments and RNA was extracted. Using RT-PCR to simultaneously amplify SPC2 and SPC3 DNA fragments, we found that the cranial region (section 1) expressed equal amounts of SPC2 and SPC3 mRNAs, whereas in the caudal region (section 4) the SPC2to-SPC3 ratio was 5:1. In the mid-body sections 2 and 3 the SPC2-to-SPC3 ratio was 1:5. By RT-PCR we also determined that amphioxus ILP, a homologue of mammalian insulin/insulin-like growth factor, was expressed predominately in section 3. These results suggest that the relative levels of SPC2 and SPC3 mRNAs are specifically regulated in various amphioxus tissues. Furthermore, the ubiquitous expression of these mRNAs in the organism indicates that they are involved in the processing of other precursor proteins in addition to proILP.

MATERIALS AND METHODS Materials. Oligonucleotides were synthesized on an Applied Biosystems model 380B DNA synthesizer. Radioisotopes were purchased from Amersham. Plasmid pGEM4Z was obtained from Promega. RNA and Reverse Transcriptase-Linked Polymerase Chain Reaction (RT-PCR) Assays. Live amphioxus (Branchiostoma californiensis) were purchased from Pacific Bio-Marine (Venice, CA). Total RNA was isolated by tissue homogenization in guanidine thiocyanate and centrifugation through a 5.7 M CsCl cushion as described (13). RNAwas quantitated by absorbance at 260 nm (A2w = 1 at 40 ,g/ml) and its integrity was verified by UV fluorescence on denaturing (0.66 M formaldehyde) 1% agarose gels stained with ethidium bromide at 1 p.g/ml. cDNA was synthesized with a Moloney murine leukemia virus (M-MLV) reverse transcriptase kit purchased from GIBCO/BRL. After 1-hr incubation at 37°C, the reverse transcriptase was inactivated by heating to 100°C for 3 min and an aliquot was removed for PCR. For the subtilisin-related proprotein convertases, cDNA fragments which contained the catalytic domain were amplified by using the following degenerate oligonucleotide primers: SQ-3 (sense), 5'-CAYGGNACNCGNTGYGC; SQ-4 (sense), 5'-CAYGGNACNAGRTGYGC; QS-1 (antisense), 5'-YTGCATRTCYCTCCANGT; and QS-2 (antisense), 5'-YTGCATRTCNCGCCANGT, in which R = G or A, Y = C or T, and N = G, A, T, or C. PCR was performed for 40 cycles each at 94°C for 1 min, 50°C for 1 min, and 72°C for 1 min. For RT-PCR to simultaneously amplify amphioxus SPC2 (480-bp) and SPC3 (334-bp) cDNA fragments, the following primers were used: SPC2-1 (sense), GGATGTTGGACCAAC-

In eukaryotic cells, proteins destined for export or the plasma membrane surface are often synthesized as larger precursor forms which are proteolytically processed during subsequent transport and maturation. The initial endoproteolytic cleavage is performed by proprotein convertases which recently have been identified as members of the subtilisin family of serine endoproteases. These enzymes recognize a motif of paired or multiple basic residues with a strong preference for arginine at the P1 site (1-3). In yeast a single subtilisin-related proprotein convertase, kexin, has been found (4). However, mammals express at least six members of this family, including SPC1 (furin) (5), SPC2 (PC2) (6, 7), SPC3 (PC3/PC1) (8, 9), SPC4 (PACE4) (10), SPC5 (PC4) (11), and SPC6 (PC5/PC6) (12, 13). These enzymes are related by a similar catalytic core domain but differ markedly in their carboxyl-terminal sequences. We have investigated the structures of SPC2 and SPC3 and the functional roles of these enzymes in the processing of proinsulin to insulin. Studies have shown that both SPC2 and SPC3 are expressed in islet B cells (as well as in other

Abbreviations: ILP, insulin-like peptide; RT-PCR, reverse transcriptase-linked polymerase chain reaction; RACE, rapid amplification of cDNA ends. *Present address: Division of Neuroscience, Baylor College of Medicine, One Baylor Plaza, Houston, TX 75226. tTo whom reprint requests should be addressed. tThe sequences reported in this paper have been deposited in the GenBank data base (accession nos. U22051 and U22052).

The publication costs of this article were defrayed in part by page charge payment. This article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. §1734 solely to indicate this fact.

3591

3592

Proc. Natl. Acad. Sci. USA 92 (1995)

Evolution: Oliva et at 1

3

2

Table 1. Amino acid sequence identity between amphioxus and human (prepro)SPC2 and -SPC3 % sequence identity Amphioxus SPC3 Amphioxus SPC2 vs. human SPC3 Domain vs. human SPC2 16 24 Pre 52 37 Pro 74 84 Catalytic P 59 75 21 61 C terminus 55 71 Whole protein

4 -1353 -1078 -872

l- 603

-310 FIG. 1. Amplification of subtilisin-related proprotein convertase catalytic domains by PCR. Amphioxus cDNA reverse transcribed from 0.1 ,ug of poly(A)+ RNA was subjected to 30 cycles of PCR using oligonucleotide primers SQ-3/QS-1 (lane 1), SQ-4/QS-1 (lane 2), SQ-3/QS-2 (lane 3), or SQ-4/QS-2 (lane 4). Primer sequences are listed in Materials and Methods. The reaction products were analyzed by electrophoresis in a 5% polyacrylamide gel and stained with ethidium bromide. Molecular weight standards (labeled in terms of bp) were 4X174 DNA digested with Hae III.

CGTTC; SPC2-2 (antisense), CAGTGCCAGCGCGAAAACC; SPC3-1 (sense), TCCATCTTTGCCTGGGCGTC; and SPC3-2 (antisense), CGATCAGGTGCTGTACGTCTC. The template contained 100 ng of amphioxus RNA reverse transcribed into cDNA. PCR was performed for 30 cycles each at 94°C for 1 min, 56°C for 1 min, and 72°C for 1 min in 100 ,ul. Isolation of poly(A)+ RNA and Northern blot analysis were performed essentially as described (14). cDNA Cloning. Cloned amphioxus SPC2 and SPC3 cDNAs were obtained using the RACE technique as described by Frohman et al. (15) and by screening a phage cDNA library prepared from whole amphioxus RNA (13). Other standard molecular biological techniques were as described by Sambrook et al. (16).

RESULTS AND DISCUSSION To identify subtilisin-related proprotein convertase genes expressed in amphioxus we amplified whole amphioxus cDNA by using degenerate oligonucleotide primer sets which corresponded to highly conserved amino acid sequences in the catalytic domain of these proteases. When primer set SQ-3/ QS-1 was used in PCR a 600-bp fragment was obtained, but no bands of the expected size were observed with the other primer combinations of SQ-3/QS-2, SQ-4/QS-1, or SQ-4/QS-2 (Fig.

1). However, restriction enzyme analysis showed that the 600-bp fragment was heterogeneous. For example, when the fragment was digested with Bgl I multiple bands in the range of 120-480 bp were observed (data not shown). The heterogeneous nature of the 600-bp fragment was confirmed by plasmid cloning into pGEM4Z. From individual clones we have obtained four distinct cDNA sequences, of which the deduced amino acid sequence from one clone showed the highest identity (80%) to human SPC2, while another clone exhibited the greatest identity (78%) to human SPC3. These sequences were expanded by using the rapid amplification of cDNA ends (RACE) technique (15) and by screening cDNA libraries (Fig. 2). The complete coding sequences for amphioxus SPC2 and SPC3 contain 689 and 774 aa, respectively, whereas there are 638 and 753 aa in the human (prepro)SPC2 and -SPC3 sequences (Figs. 3 and 4). As summarized in Table 1, the amino acid sequence similarities were highest in the catalytic domains. In addition, amphioxus SPC2 contained an Asp-for-Asn substitution at position 317 (corresponding to residue 310 in human SPC2), as is the case in all other known SPC2 homologues (see below). In subtilisin the side chain of this Asn residue has been demonstrated to stabilize the transition state during catalysis (17). To form a similar hydrogen bond with the carbonyl oxygen next to the scissle bond, however, the Asp side chain in SPC2 would have to be protonated, and this is consistent with the probability that SPC2 in all species is active only in an acidic compartment such as the dense-core secretory granule. A high percentage of identity for SPC2 and SPC3 is also found in the P domain, which consists of 190-192 aa located on the carboxyl side of the catalytic domain. Recently, transfection experiments have shown that when successively truncated forms of SPC1 (furin) were expressed, deletions within the P domain were correlated with loss of catalytic activity (18). Downstream from the P domain amphioxus SPC2 and

Amphioxus SPC2 S391

D174 H215

w w

lw

Catalytic

Pre Pro

P-Domain

* pAPC-2-550

pAPC-2-650 .-

pAPC-2 -1200

pAPC-2-850

10Cbp

Amphioxus SPC3 D174 H215 _

_w

FA

S389

----I

i

pAPC3-1800

pAPC3-600 0

pAPC3-l110

0

I

FIG. 2. Molecular cloning of amphioxus SPC2 and SPC3 cDNAs. The organization of amphioxus SPC2 and SPC3 cDNAs is shown schematically. Arrows indicate clones obtained by the RACE technique (15); brackets indicate clones isolated from a cDNA library.

Evolution: Oliva et

at

CAGCACTAATCCTCT(3LCCTCTTCCZ,"AGAAAGAAAAAAAcTcAACCT=I GAAG,---m,

CCACAAACCACCAAAGCv

M VK

Y

L

L V

L

A

T

I

L

V

W

Y

Sci. USA 92

Acad.

P

VG

G

A

C

S

N

*A

P

P

Y

T

N

F

D

A

0

V

3593

11 200

44

R

1

(1995)

1000

FLLR

W

r-.AAAACGAGrAGAGAGGATAT-GGTGAAGCC~-GGTAGGATGGTTCTCCTCC

ITAAGAAAAAAGCTGACTGTGGTCAACCCGAGC%T-GCTGAATT~TC V

Natl.

Proc.

GGTCGACCTGTTGCCTA ACRTTKGYGACGGGYLrGTACTC RGCEQC RDLCFRGCHCCCTGARTAGG

300

L

77 400

QQC

CGGACCAATCGPGVCG ACTGGGAAAGSTCTCA,TG,CGTCAAAQQAGA

FRACTCGRACG

ill

GGGCGCjCCCATGTCCGCGGCAGGGCCAG'rGCGGCGCAGCAGAGCArGGCTGGAGAACGACATGCGGGTACGAGC.-GCAG-TACAACAACAGGGTT-TCCGCC .K RI G YN EV ND N YRQ I D I ND PL FP KQ0WY L L NTG GTAGGAACGCGGGTATAACGAAGGAATGACrAATCCGTCAGATcGACATCAACGACCCGCTCTTCCCCAACAGrGGTAcCCT,GCTGAATACTGGCCA

144 600

0 177 700

211 800

0 244

CGGTT.CAACAGCCATGGTACCCGCGTGCAGGAGAGG-TGGTGGGTAAGATCAACAACGGTCT,GTGC,GGAGTGGGCGTCAGG-TATG--GCGCACGAGTGGCAGG

900

277 1000

~C'GACGGACGACGGCCGTACTGTGGACGGGCCGCGGGAGCTCACCGTGCAGGCAATGGCGGACGGrGTCAATAAGGGGCGT,GGAGGAAAAGGCAGCATC.¶ V

W

A

S

G

D

G

G

S

Q

D

D

C

N

C

D

G

A

Y

S

S

W

M

T

I

S

I

S

N

A

D

N

i

311 1100

344

ACSGTrGGGCATCCGGAGACGGCGGGAGTCAGGACGACTGTAACTGTGACGGATACGCCTCCTCGATGTGGACCATCTCCATCAACTCCGCCATCAATGA

1200

377

CGGCCGCACCGCCCTTACGACGAGTCCTGC-CCTCCACTCTCGCCTCCACCTTCAGTAACG CGCGACAGcCA-CCGAGCCGcGFCGGCCACCACG l S 'V A LA LQ AN PN L

DL Y GN C TLKH SG T SA AA P EA AG V

1300

411

1400

T WR

DM0H

L T VL T

SKR NQL

YD

PV

H EWR

RN G VGL

444

E

TGACTIGGCGGGACATGCAGCACCTGACGGTCCIGACGTCCAAGCGGAACCAGCTGTACGACCCGGTGCACGAGTGGCGGCGGAACGGAGTGGGGCTGGA

1500

MA EDWNTmlV PK R FHCT G F NH L FGFG V LD AG SM V K G-I-ICAACCATCTGTTCGGGTTCGGCGTGCT.GGACGCGGGCTCCATGGTGAAGATGGCCGAGGACGGAACACTGT"ACCCAAACGGTTCcACT&GCACAGGG

1600

T SM SDA K PI

P

VEG

KV VV KL TT

D ACE

RYLL

GQ EN FV

ACATCCATGTCTGACGCCAAGCCGATCCCAGTGGAGGGGAAAGTAGTGGTCAAGCTGACCACTGACGCGT.LGCGAGGGCCAGGAAAACTTTGTGCGCTATC

477

511 1700

544 1800

577

CACACrCCCCAATGACCGATCACTCCAGAGGAAGGGCr'TCGACCGCTGGCCCTTCATGACCACTCACACCTGGGGGGAAGACcCTAGAGGAGACTGGGTG L

E

V

G

F

Q

G

D

E

P

Q

E

G

D

L

L

E

W

T

L

R

L

H

G

T4Q

P

S

P

I

Y

V

D

EE

KE TTN

EEEE

EE

EN G

P

677

KKG KE GGSS -

-

-

-

-

--

-

-

-

611 2000

VTGAGTTGGTC TCGG LAGMGCA KGAGGC QCLTCGLGTGSACGCGCTCAGGKCE EEGCECCCACT644AA Q EK E KT EA

1900

-

A 113-13 A~"

68 9 2300

CA,TAAAAGAGGAGGCTGATCAGCTGTATACATCTCTGGGTATCTAAAAATACCCTGGTTGCATTGAGrCTcACTCTAACTTATcATGTTATAATATTccGC CAGAGTATTGTATAAAAAAGATATATAATTGTACTTATTGTATCTTGGGTATGTGA,AAGGAAGTTAGTTCATCTGTGGAGTTCTGTGACCGGTTAGCATCT ATATTGCGTTGTGGCAATGATTTGTGGAGrAGTCAAATTGTATTAGATCAAAAAAAAAAAAACCCCcAAAAATAGA ATGCACACGTGrATTCACAATCAGAGCGAAAAGrGA,TTTTACACACGCACACACACGCACATATAGACACACACAAACTCAACGTGA,TAATTGGATAAGA

2400 2500 2600 2700

TAAAGAAAAATGATATTTTTGTCGTGAGCTAAATACTGTAcTAAAGATGAAGTACTTTGACAccAGGAACcCTATAcTGTTTAGATGcGGCCGGCGAATT

2800

CAGCTTGGA

2809

Nucleotide and deduced amino acid sequence of amphioxus SPC2 cDNA. Putative cleavage site for the signal peptide is assigned to residue (arrow). The subtilisin-related catalytic domain is boxed and dots indicate residues involved in the catalytic mechanism. Arrowheads identify two potential glycosylation sites, and two potential signals (RDG) for binding to extracellular matrix proteins are overlined. An arrow also identifies the end of the P domain at residue 602. A carboxyl-terminal glutamic acid-rich sequence, also found in human and rodent SPC2, is underlined. FIG. 3.

30

carboxyl-terminal sequences which are not highly conserved, although the carboxyl termini of both human and amphioxus SPC2 contained a region enriched in Glu residues which is predicted to form an amphipathic helix (1). Overall, amphioxus SPC2 exhibited 71 % identity to human SPC2, whereas amphioxus SPC3 is 55% identical to human SPC3. Very recently, SPC2 and SPC3 sequences have been reported from a number of phylogenetically diverse species, including frogs (19), fish (20), molluscs (21-23), and hydra (24). Fig. 5 shows the amino acid identities in the catalytic and P domains of SPC2 and SPC3 from these species compared to the human homologues and plotted against the estimated time of evolutionary divergence. The results clearly show that SPC2 has evolved more slowly than SPC3. It is not known whether SPC3 contained

this

relatively

slow substitution rate observed for SPC2 has

a

possibility is that SPC2 may have a more ordered structure and/or restricted catalytic specificity. Northern blot analysis revealed a single 2.7-kb band for amphioxus SPC3 mRNA, while two bands, a major band at 4.0

functional basis, but

kb and

a

one

minor band at 7.0 kb,

were

observed for SPC2 mRNA

analysis indicate that the 7.0-kb SPC2 mRNA is due to extended transcription of the 3' untranslated region (data not shown). Although the SPC2 and SPC3 mRNAs could be easily detected on Northern blots, our attempts to perform in situ hybridization on thin (10- to 20-1km) saggital sections of whole mounted amphioxus were unsuccessful due to a high background. As an alternative, we have determined if there were

(Fig. 6). Preliminary

3'RACE

3594

Evolution: Oliva et at

Proc. Natl. Acad. Sci. USA 92 (1995)

GAATTCGCGGCCG;CCCCCCGl-.GGCATCCATCCTATCCAGGGTTGCTCTCCTGTGrGGITr-AGTCACGTCC-TCCGTGTTGGACTCT-TCGGCAGCTCCAAAG TCATC TCTTAGCTCTGTTTATATC ATCACAAGTGTATCACTTCTCGGACTGTGAGACCCACAACCT, GAAC=CGGTCGGGAACA I TSCATCTG;;

ACACGCTGAAACCTCGGTTGGAGACCCACATCTGGC', GCGGCGCGAATCGTGTTGAACAGTlTACGAG^AACGGGCGC G

M

R

F

Y

A W

L

V

L

I

A

A

V

F

CGCGGAACATGGCGTTAT TGCCTGCGTCC-TGCGATTGTGw T

N

T

W

A

I Y

E

V

G

G

P

D

R A

S

S

G*R

H

E

A

D

G

0 P

0

E

G

H

F

LCTCCTCCACGC-GCGTGCC-GAGGACGGAGACGkC AACCCCGCCACTT

D

L

L A

L

D

H

G

N

E

Y

L G Q

I

N

G

L

CACCAACCTGGGCGGTGGAATCTACGGAGGGCCCGACAGAGCTGACCTGTTAGCCCTAGACCATG&CTIACGAGAACTTGGGCAGATAGGGAACCTG E

D

H

Y

L

F

H K

R

D

V

P

H

R

R R G A

S

H

Q

H

T

K

R

D D E

L G

I Q

R

W

GAGGATCACTACCTCTTCAGACACAAGATGTGCCTCACCGGTCCCGCAGAGGGGCGCATCAGCACACCAGwGGCTGGGGGACATGAAAGGATACAAT V

O

A

O V G R

A

R

S

RIG

K

P

M

OQ R

G

R

Q

S

D

T

D

P

R

F

T

M

R

D

P

GGGTGGCTCAGCAGGTTGGGCGAGCCCGGAGrAAAGAGGACCCATGGGCCAGCAGCGCCGCCAATCAGACGACACCAGGCCCATGACCTTCCGTGACCC W E

Y

K

Q

W

Y

L

H

D

T

R

T

S

T

L

N

P

K

L

D

L

H

L

V

P

K

V W R

G

T

r

TT-ACTGAAACAGTGGTATC'GGCaCGACACAAGAACTTCGCAAACCS"GCCCAAGCTAGAC'-CGTACTTCCGGTCTGGAGAAGGCTCACG G

K G I V V A V L D D G I E K D H P D L V D N Y D P D A S Y D F N D GGAAAAGGATCGTCGTGGCCG . GTTGGACGACGGTCGiAAAAGGACCATCCAGACCTCGTGGAACTATGACCCCGiATGCAAGCTATGACTTCAACG

D

D

N

D

Q

P

R

P

E

Y

E

T

E

N

N

K

H

G

T

R

G

A

C

I

E

A

H

A A

A S

S

N

N

S

E

ACAACGATGACGvACCCGCAACCGGATATGAGGAAACGAACG- AAACAAGCAGGACCTGiTGCGGGAGATCGCCAGCAGCCAACAACTCGG; C

G V

I

G

F

A

R

N A

I

G

G

R

V

M

L

D

G

V

V

D

ml

A

E

V

I

F

G

N

GT-GCGiGCGTGGGATAGCTT. AATGCAAGATAGaGGTGTGOCGiCATGCT.GGACGGiAGiTGGTGACTGACeCCGTGGAGGCAG;:.-CCATCGGTCAA I

Q H

D

V

I

Y

S

S

A

P

G

w

D

N

D

G

T

K

E

V

G

E

P

L A R A A

K

E

F

G

K

ATCCAGCACGTGGACATCTACAGCGCCTCGTGGGACCCAACGACGACGGCAGACCGTGGAGG;GCC'-CCrGAAGCTLCGCTCGGGCGcCTTCAGAAGG V

E

R

G

R

G

G

K

G

S

-

I

A

A

W

G

S

N

G

G

S

N

G

D

C D C D G Y T S rGAC -.GG.GACCGC,C.ACACCAG

N

G_GTTAGGAGGTGGAGAAGGCA,.TGCCTGGGCGTCCGGCAAC-GGGGCAGTAACG GGCAACTGC

100 200 300 31 400 64 500 98 600 131 700

164 800 198 900 231 1000 264 1100

298 1200 331 1300

S I Y T V S I S S A S Q Q G G S P W Y G E K C A S T L A T A Y S S CTCCATCTACACCGTGTCCATCAGCAGTGCG-,C'-CAGCAGCGGCGC"TUCGC-.GGTACGGCGAGAAGTGCG mCCACGC-TGGCCGUACACG

364 1400

G E

A G V CGGGG

398 1500

L L V W L -S--S--N- -PTGC'. GGCGCTGGCTCTGGAGGAACC-CGAAC-:;~GC'-GGACGAGCTACAGCACCT,GATCvTGTGACkCMCGAGTACGAC -.'-1CTCC>GCACCC

431 1600

G W F Q N G A G L W V N S R F G Y G L L N A E A M V D M A L T W K CGGCTGGCCAGAATGGCGCCGCCTGTGGGTCAACTCCCGGTTCGGCTACGGCCTGCTGAACGCCGAGGtGATGGTGGACATGGCGCTGACGTGGAAG

464 1700

Y

K

D

I

Q K

S

S

T

7r

D

H

E

H

C

T

D

S

T

H

G

T

S

A A A ° L CGCAC

A

GGG7GACAAGCAAATAAGiCAGCkCGGCCTGCATCAC -AG. G-ACGGACACUCACGAATCAGCCGCGG

lw

I-T- W--R--D--V -Q--H- -L -I--

--L--A--L -A- -E--A-- N--P--N

L.

--T S--E- Y- -D--P

G

498

ACTGTACCGAGAAGACCAAGTGTGAAGTTAGGATTGAGAACTTCCAGCCGAGGGACCTGGGACGGAGAAGAGATCATCATAGAGCTGGAGACTGACG

1800

T

V

E

P

K

T

N

K

C

E

H V E

A

E

I

R

V

F

N

P

Q

R

D

L

G

N

E

G

I

E

I

I

L

E

E

T

D

GTTGCCGTGGCCAACCACGTGGAAGCTCTGGAGCACGTGCAGGTCAAAACGACGATCGACTACTCCCGCCGGGGGGACCTGAGGATAGWGCTTACGTC

531 1900

P S G T S T T L L D T R R Q D K S Q M G F Q D W P F M S T H N W G ACCGTCAGGAACCAGCACCACCCTCCTGGACACCCGCCGTCAGGACAAGTC-CAGATGGGATTCCAGGACT;GGCCCTTCATGTCCACTCACAACTGGGGA

2000

C

E

R

K

Q

G

P

K

G

Q

T

W

L

L E

E

I

T

H V

D

Q V K T T

D

S

K

H

A

I

D

N

E

Y

N

S

G

R

V

G

R

V

K

L

D

D

V

V

I

R

I

L

L

V

L

T

H

S

T*

G

GAGAAGCCACAGGCAGrGGAkCACTCACCATAGAGGACAAGTC-TGACCACGCGGAAAACAACGGTGTTG4-GAAGGATGTCGTGCTGATTCTGCACGGAA P

E

Q

P

A

Y

G

S

Q

G

R

Y

V

T

D

N

Y

R V

Q

G

D

N

R

E

V

I

A

S

K

A

A A

CCCCGGAACAGCCGGCCTACCAGTCCGGAGGG;&GGGTGTACACTrGACTACAACCGCGrGCAGGGCGJvCAGAAATGTGGAAATATCGGCCGCTAAAGCCGC

564

598

2100 631

2200

CCCAGCCCCCGCTGCGGATGCCGCCCCCGCCCGGGGGTCAGAGGAACGGATCCCCGGCTCTGGTCCCTTCGGCTCTGCTGCGAGCGTGGTTATCGAAGAG

664 2300

I P E K E T E F L V N W Q D G M N R Y E T D F N P V N S G P F E P S ATTCCTGAAAAAGAAACAGAGTTCTTGGTGAACTGGCAAGACGGAATGAATCGCTACGAGACTGAC= CAACCCCGTCAACTCAGGCCCGTTCGAGCCCT

2400

P

A

A

P

D

A

A

A

G

D

S

A

D

A

V

M

Y

R

A

P

D

G

E

A

S

E

D

E

L

R

R

IP

W

P

E

G

A

S

G

F

F

P

K

Y

G

L

S

A

R

0

A

S

V

V

M K S

Q

E

I

P

S

E

G

Q

CAGCTGATGCCGGATCAGACGTATACATGGATGCCGAAGACCTTCGTCCTTGGGaGGCTTTCAAGTACCTGCAACGGCAAATGAAATCACCGAGCGGCCA R

H

P

A

H

A

Y

P

K

N

S

S

Q

Y

I

R

D

P

W

V

S

Q

N

A

L

D

K

E

A

N

L

V

K

ACGTCATCCAGCGCATGCGTACAACAAGCCATCGTCACAGTACATCCGGGACCCGTGGGTATCCCAGAATGCCCTGGACAAAGAGGCGAACCTGGTGAAA Y

Y

L

Q

L

L

G

Y

E

S

X

TACTACCTACAGCTGCTGGGCTACGAATCATGAAAAAAAAACGTGGATGAAGAAAATAAAACGGAACACCAGAAACTGGAACGAAAAATAAACTGGTCTT AGAT-A

698 731 2500

764 2600

774 2700 2725

FIG. 4. Nucleotide and deduced amino acid sequence of amphioxus SPC3 cDNA. Putative cleavage site for the signal peptide is assigned to residue 19 (arrow). The subtilisin-related catalytic domain is boxed and dots indicate residues involved in the catalytic mechanism. Arrowheads identify two potential glycosylation sites and a potential signal (RDG) for binding to extracellular matrix proteins is overlined. An arrow also identifies the end of the P domain at residue 598. any regional differences in the expression of SPC2 and SPC3. Whole amphioxus were sliced into equal-length sections and total RNA was extracted from each section. To obtain the ratio of SPC2 to SPC3 expression we performed multiplexed RTPCR in which SPC2 and SPC3 DNA fragments were simultaneously amplified from the same RNA template. In calibration assays it was established that under the RT-PCR conditions, the amount of each DNA fragment obtained (as visualized by ethidium bromide staining) was roughly proportional to the amount of input RNA within the range of 20 to 2000 ng. Fig. 7 shows the results of a representative experiment in which 100-ng amounts of amphioxus RNA were subjected to RT-PCR. With whole amphioxus RNA as template, equally intense ethidium bromide-stained SPC2 and SPC3 DNA frag-

ments were obtained. This result indicated that whole amphioxus expressed approximately equal amounts of SPC2 and SPC3 mRNA. However, there were clear regional differences in the ratio of SPC2 to SPC3 expression. In the anterior (cranial) segment the SPC2-to-SPC3 ratio was 1:1, whereas in the posterior (caudal) segment the ratio was approximately 5:1 in favor of SPC2 (Fig. 7, lanes labeled 1 and 4). In contrast, the mid-body segments exhibited a reversed SPC2-to-SPC3 ratio of about 1:5. We also analyzed the expression of ILP mRNA in these segments by Northern blot and RT-PCR, and both analyses showed that ILP mRNA was predominately localized to segment 3 (data not shown). The strong conservation of primary sequence between mammalian and amphioxus SPC2 and SPC3 strongly suggests

Proc. Natl. Acad. Sci. USA 92 (1995)

EVOlUtiOn: OliVa et alb 550-

co

500-

)

3595

0

Mo Ra

'a

a) 450-

Ra \>

ci)

Xe

0

-i

I 1 1

2

1

3

1

4

1

0 c

400-

oAm

_ SPC2

0

D

Ly

350-

E

Am

z

_ SPC3

300-

Ap

250-

Hy

0

200

400 600 Time, Mya

800

1 000

FIG. 5. Evolutionary divergence of SPC2 and SPC3 sequences. Identity of amino acid sequences in the catalytic and P domains of SPC2 (0) and SPC3 (x) from various species compared with human SPC2 and SPC3 is plotted against the estimated elapsed time (Mya, millions of years) since the species diverged (25, 26). Species are as follows: Mo, mouse (7, 8); Ra, rat (25); Xe, Xenopus (19); Af, anglerfish SPC3 (20); Am, amphioxus; Ly, Lymnaea (21); Ap, Aplysia (22, 23); and Hy, hydra SPC3-like (24).

that these enzymes are capable of cleaving the C-peptide domain from proILP as they do in processing proinsulin. However, it should be noted that we have not yet demonstrated that these enzymes and ILP are coexpressed in the same cell. Indeed, the broad regional distribution and variable ratio of SPC2 to SPC3 expression suggest that these enzymes probably act on additional substrates in amphioxus. Possible substrates include other pancreatic and/or neuroendocrine hormone precursors such as proglucagon, prosomatostatin, and proopiomelanocortin, although none of these hormones has yet been identified in amphioxus. Whether SPC2 and SPC3 are necessary or sufficient to process proILP into a mature insulin-like peptide remains to be determined. An approach to the solution is to characterize the processing of proILP expressed in mammalian cells which contain SPC2 and SPC3. In particular, it is possible that the complete processing of proILP, particularly in the carboxylterminal extended sequence, will require an as-yetunidentified proprotein convertase capable of cleaving at single basic residues. In our analyses of SPC DNA fragments amplified by PCR, we have observed that amphioxus expresses

A

B

9.49 7.46 4.40 4.40 -

__ W

2.57

-

1.35 -

FIG. 6. Northern blot analysis of amphioxus SPC2 and SPC3 mRNAs. A 5-,ug sample of amphioxus poly(A)+ RNA was electrophoresed in a 1% agarose/0.66 M formaldehyde gel, blotted onto nitrocellulose, and hybridized with radiolabeled SPC2 cDNA (lane A) or SPC3 cDNA (lane B). RNA molecular weight markers (kb) were from BRL.

FIG. 7. Regional expression of SPC2 and SPC3 in amphioxus analyzed by RT-PCR. Whole amphioxus were sliced into four equal segments, as shown above (approximately life size); total RNA was isolated from each segment and subjected to RT-PCR amplification of SPC2 (480 bp) and SPC3 (334 bp) cDNA fragments. The reaction products were electrophoresed in a 1.5% agarose gel and stained with ethidium bromide.

at least two other subtilisin-related proprotein convertases in addition to SPC2 and SPC3. We thank Sean Martin for assistance in preparing the figures and Florence Rozenfeld for assistance in preparation of the manuscript. This work was supported by National Institutes of Health Grants DK 13914 and DK20595 and by the Howard Hughes Medical Institute. 1. Steiner, D. F., Smeekens, S. P., Ohagi, S. & Chan, S. J. (1992)J. Biol. Chem. 267, 23435-23438. 2. Smeekens, S. P. (1993) Bio/Technology 11, 182-186. 3. Van de Ven, W. J. M., Roebroek, J. M. & Van Duijnhoven, H. L. P. (1993) Crit. Rev. Oncogen. 4, 115-136. 4. Fuller, R. S., Sterne, R. E. & Thomer, J. (1988) Annu. Rev. Physiol. 50, 345-362. 5. Roebroek, A. J. M., Schalken, J. A., Leunissen, J. A. M., Onnekink, C., Bloemers, H. P. J. & Van de Ven, W. J. M. (1986) EMBO J. 5,2197-2202. 6. Smeekens, S. P. & Steiner, D. F. (1990) J. Bio. Chem. 265, 2997-3000. 7. Seidah, N. G., Gaspar, L., Mion, P., Marcinkiewicz, M., Mbikay, M. & Chretien, M. (1990) DNA Cell Biol. 9, 415-424. 8. Smeekens, S. P., Avruch, A. S., LaMendola, J., Chan, S. J. & Steiner, D. F. (1991) Proc. Natl. Acad. Sci. USA 88, 340-344. 9. Seidah, N. G., Marcinkiewicz, M. & Benjannet, S. (1991) MoI. Endocrinol. 5, 111-122. 10. Kiefer, M. C., Tucker, J. E., Joh, R., Landsberg, K. E., Saltman, D. & Barr, P. J. (1991) DNA Cell Biol. 10, 757-769. 11. Nakayama, K., Kim, W. S., Torii, S., Hosaka, M., Nakagawa, T., Ikemizu, J., Baba, T. & Murakami, K. (1992) J. Biol. Chem. 267, 5897-5900. 12. Smeekens, S. P., Albiges-Rizo, C., Carroll, R., Martin, S., Ohagi, S., Phillips, L. A., Benig, M., Gardner, P., Montag, A. G., Swift, H. H., Thomas, G. & Steiner, D. F. (1992) Proc. Natl. Acad. Sci. USA 89, 8822-8826. 13. Chan, S. J., Cao, Q.-P. & Steiner, D. F. (1990) Proc. Natl. Acad. Sci. USA 87, 9319-9323. 14. Davis, L. G., Dibner, M. D. & Battey, J. F. (1986) Basic Methods in Molecular Biology (Elsevier, New York), pp. 143-146. 15. Frohman, M. A., Dush, M. K. & Martin, G. R. (1988) Proc. Natl. Acad. Sci. USA 85, 8998-9002. 16. Sambrook, J., Fritsch, E. F. & Maniatis, T. (1989) Molecular Cloning: A Laboratory Manual (Cold Spring Harbor Lab. Press, Plainview, NY), 2nd Ed. 17. Wells, J. A., Cunningham, B. C., Graycar, T. P. & Estell, D. A. (1986) Philos. Trans. R. Soc. London A 317, 415-423. 18. Hatsuzawa, K., Nagahama, M., Takahashi, S., Takada, K., Murakami, K. & Nakayama, K. (1992) J. Biol. Chem. 267, 16094-16099. 19. Braks, J. A. M., Culdemond, K. C. W., van Riel, M. C. H. M. & Martens, G. J. M. (1992) FEBS Lett. 305, 45-50. 20. Roth, W. W., Mackin, R. B. & Noe, B. D. (1993) Endocrinol. J. 1, 131-140. 21. Smit, A. B., Spijker, S. & Geraerts, W. P. M. (1992) FEBS Lett. 312, 213-218. 22. Ouimet, T., Mammarbachi, A., Cloutier, T., Seidah, N. G. & Castellucci, V. F. (1993) FEBS Lett. 330, 343-346. 23. Chun, J. Y., Korner, J., Kreiner, T., Scheller, R. H. & Axel, R. (1994) Neuron 12, 831-844. 24. Chan, S. J., Oliva, A. A., Jr., LaMendola, J., Grens, A., Bode, H. & Steiner, D. F. (1992) Proc. Natl. Acad. Sci. USA 89, 6678-6682. 25. Colbert, E. H. & Morales, M. (1991) Evolution of the Vertebrates: A History of the Backboned Animals Through Time (Wiley/Liss, New York). 26. Field, K. G., Olsen, G. J., Lane, D. J., Giovannoni, S. J., Ghiselin, M. T., Raff, E. C., Pace, N. R. & Raff, R. A. (1988) Science 239, 748-753.