The Aromatic Stacking Interactions Between Proteins ...

3 downloads 0 Views 474KB Size Report
hGSTA1-1, a lock-and-key motif is found at Phe52 and the .... in prokaryotic cold shock proteins (Csp) family [52]. Csp ..... while, β-amyloid peptide (Aβ) aggregation is the most acute ..... G.J. New crystal structures of human glutathione transferase A1-1 ... UDP-glucuronosyltransferase 2B7 (UGT2B7) and identification of.
Pre-Print Copy

The Aromatic Stacking Interactions Between Proteins and their Macromolecular Ligands Mohammad Mizanur Rahman1,2, Ziyad Tariq Muhseen1, Muhammad Junaid1 and Houjin Zhang1,* 1

Department of Biotechnology, College of Life Science and Technology, Huazhong University of Science and Technology, 1037 Luoyu Rd., Wuhan 430074, Hubei, China; 2Department of Genetic Engineering and Biotechnology, University of Rajshahi, Rajshahi-6205, Bangladesh Abstract: Aromatic stacking interactions arise from the attractive force between the π-electron clouds in the neighboring aromatic groups. The aromatic stacking is common between proteins and small molecules. The stacking interactions at the interfaces of proteins and other macromolecules are relatively rare. However it contributes to a significant portion of the stabilizing forces. In the proteinprotein complexes, aromatic interactions are involved in the protein oligomerization, such as dimer, trimer and tetramer formation. Also, aromatic residues can bind to nanoparticles through stacking interactions which offer them stronger affinity than other residues. These interactions play crucial roles in proteinnanoparticle conjugation. In the protein-nucleotide complexes, the specific recognitions are realized through stacking interactions between aromatic residues and the bases in the nucleotides. Many nucleoproteins use aromatic stacking to recognize binding site on DNA or RNA. Stacking interactions are involved in the process of mismatch repair, strand separation, deadenylation, degradation and RNA cap binding. They are proved to be important for the stability of complexes. The aromatic stacking is also the underlying reasons of many fatal diseases such as Alzheimer, cancer and cardiovascular diseases. The chemicals that can block the stacking interactions could have potential pharmaceutical values. In this review, we summarize recent finding regarding the functions of aromatic stacking interactions in the protein-macromolecule complexes. Our aim is to understand the mechanisms underlying the stacking-mediated complex formation and facilitate the development of drugs and other bio-products.

Keywords: Aromatic amino acid, carbon nanotube, π-π stacking, protein-nucleotide complexes, protein-nanoparticle interaction, protein-protein interaction. 1. INTRODUCTION Aromatic compounds contain cyclic aromatic rings that are unsaturated and planar [1]. The stability of these compounds results from the π -electrons situated above and below the plane of the aromatic ring. Together, these electrons form a π-electron cloud surrounding the ring. The flat face of an aromatic ring has a partial negative charge because of these π electrons. Aromaticity is a chemical term that refers to the characteristics associated with the cyclic and planar aromatic system. It can be regarded as a manifestation of cyclic delocalization and resonance. The well-studied model system of aromaticity is the benzene molecule [2]. The aromatic rings are present in the side-chains of three amino acids with 6-member ring: phenylalanine (F), tyrosine (Y), tryptophan (W). And aromaticity is also found in the 5member ring residue, such as histidine (H). These amino acids are called aromatic amino acids [3]. The aromatic rings of these residues interact with other aromatic systems by

*Address correspondence to this author at the Department of Biotechnology, Faculty of Life Science and Technology, Huazhong University of Science and Technology, Zipcode 430074, Wuhan, Hubei, China; Tel: +86 02787793085; Fax: +86 27 87793085; E-mail: [email protected]

attractive non-covalent force [4]. Such interactions are called aromatic stacking, π -stacking or aromatic-aromatic interactions. The common features of aromatic-aromatic interactions include (i) nonpolar interactions (ii) the distances of two phenyl rings fall between 4.5 Å to 7 Å, (iii) the dihedral angles range from 30° to 90° (iv) a typical non-bond energy of-1 to -2 kilocalories per mole [5]. Stacking between different aromatic systems is a prominent stabilizing force in the biomacromolecules and their related complexes [6]. Specifically, the aromatic residues in proteins play significant roles in the stability of interacting complex, in substrate binding as well as drug designs [7]. The main factors that affect the strength of aromatic stacking interactions are the geometry and proper orientations of interaction molecules [8]. A study of 2280 α-helices in 434 protein chains has observed that the geometry of the stacking system depends on the sequence between the interacting partners [9]. The study also demonstrated that His is the most conspicuous residue in the context of helix stability. Similarly, Samanta and coworkers analyzed the geometry of the aromatic side chains interactions involving Trp indole ring [10]. The Phe ring prefers either offset-stacked arrangement or perpendicular orientation with Trp by edge-to-face stacking, while Tyr prefers

face-to-edge or face-to-face fashion with Trp. Trp-Trp interaction prefers edge-to-face style. On the other hand, histidine preferentially stacks with Trp by parallel stacking. The energetically favorable stacking might come from the positively charged His imidazole ring under physiological conditions. As suggested by the mutagenesis assay, this charged His-Trp stacking can contribute to the protein stability by more than 4 kJmol-1 [11]. The aromatic stacking not only provides the stability and mechanical insight of proteins and protein/ligand complexes, but could also be implemented in the protein design process. The judicious placements of aromatic residues and the resulting geometric parameters could be useful tools in de novo protein design and might be useful for modeling protein interactions [12]. Additionally, aromatic residues can preferentially bind with nanoparticles to form protein-nanoparticle hybrid that has applications in many fields of science such as biomedicine, catalysis, and water treatment [13]. Based on ππ stacking interactions, some nanoparticles have already been developed such as carbon nanotubes (CNTs) [14], fullerene [15], graphene [16]. Furthermore, aromatic amino acids are key residues that interact with DNA or RNA, which provide recognition sites for the ligands to bind [17]. These aromatic residues play important roles in gene expression, damaged DNA repair, RNA cap binding [18]. The mutations on these aromatic residues are often found to be the underlying reasons for diseases caused by faulted protein-nucleotide recognition [19]. Although the π-π aromatic stacking in protein-ligand interactions has important physiological roles in organism, no review paper has covered this subject yet. Due to the abundance of stacking interactions between protein and small molecules, the current review will focus on the stacking interactions between proteins and macromolecular ligands. 2. AROMATIC INTERACTIONS PROTEIN COMPLEXES

IN

Similarly, stabilization of the dimer interface in UDPglucuronosyltransferase 2B7 is achieved by Phe187/Phe187 and Phe189/Phe189 aromatic stacking interactions along with two salt bridges (H183/E200) and two S-aromatic (face) interactions (F187/M198) [24]. The π -π stacking on the dimer interface is also observed in farnesyl pyrophosphate synthase (FPPs) and octaprenyl pyrophosphate synthase (OPPs). Crystal structures of Thermotoga maritima OPPs indicate Phe117 form a π -π stacking with Phe117 from the other chain in the dimer [25]. Likewise, in the structures of Escherichia coli FPPs, Phe-122 residues from the opposing chains stack on each other to connect the two protomers [26]. Also, the C-terminal domain (CTD) of HIV-1 integrase form dimer through stacking interaction between Trp243 on neighboring chain [27]. The structure of Trypanosoma brucei ornithine decarboxylase (TbODC) forms a twofold symmetric dimer. The TbODC monomer consists of two domains, namely, N-terminal domain and C-terminal domain. The two monomers complex with each other through aromatic interactions. The interface is formed by the interaction between N-terminal domain and C-terminal domain. A prominent feature at the dimer interface is the aromatic amino acid zipper (Fig. 1). This zipper structure plays a key role in dimerization as well as positioning pyridoxal 5’phosphate for catalysis [28]. Likewise, the phage λ cI repressor also forms dimer to distinguish DNA sequence specificity in operator [29]. The side chains of two symmetrically related Tyr88, although far from the DNA -binding site, stack with face-to-face at the dimer interface, which is vital for the dimer stability as well as the operator recognition.

PROTEIN-

It has been long known that the aromatic interactions contribute to the overall stability of protein molecules [20]. Recently, studies have demonstrated that the aromatic residues are also the stabilizing factors for the protein-protein complexes. The aromatic stacking interactions are formed between neighbouring proteins in the complexes and, therefore, provide the stabilizing forces. A good example of such interaction can be found in the glutathione transferases (GSTs) dimer interfaces [21]. On the surface of the GST from malaria vector Anopheles dirus, the Phe104 is exposed and forms a stacking interaction with Phe104 on the other protomer. Such interaction is strengthened by the nearby hydrophobic residues (Val107 and Leu103). These residues altogether form a lock-and-key motif on the dimer interface. This motif is not uncommon in other GST classes. In human GSTP1-1 (GST class Pi), the Try47 protrudes into the hydrophobic pocket formed by the Val90, Phe127, Leu130 from the neighbouring chain [22]. In the alpha-class GST hGSTA1-1, a lock-and-key motif is found at Phe52 and the residues surrounding it [23]. These motifs are proven to be important to the function of GSTs. The mutations on these residues diminish the enzyme activities, suggesting the importance of aromatic stacking in the function of GSTs.

Fig. (1). Representative diagram of aromatic stacking in dimer interfaces. The T. brucei ornithine decarboxylase (TbODC) dimer interface is stabilized by aromatic stacking between Tyr323Phe397, Tyr323'-Phe397', Tyr323-Tyr331', Tyr331-Tyr323'. The different monomers are colored as black or gray.

In addition, many signaling molecules contains modular domains which are responsible for protein-protein interactions [30]. The SH3 domains of epidermal growth factor receptor pathway substrate 8(Eps8) protein form an intertwined, domain-swapped dimer. The dimer interface is stabilized by aromatic stacking of Trp40 and Phe52 with their counterparts in the other monomer [31]. Also, the receptor tyrosine kinases are involved in lateral dimerization where aromatic residues are important in specific transmembrane

(TM) helix-helix interaction [32]. NMR structure revealed that the EphA2tm dimer is formed by intermolecular aromatic stacking between the side chains of two Phe556 and two Phe557 which plays key roles in TM helix packing [33]. Not only are the aromatic stacking interactions found in the dimers, they are also found in other protein complex forms, such as trimers and tetramers. The microsomal prostaglandin E synthase 1 (mPGES-1) is a homo-trimer with inter-helical interactions within each monomer formed by aromatic stacking [34]. A three-layered π - π stacking formed by helix 2 Phe82 (H2), helix 3 Phe103 and Phe106. The strong aromatic stacking is the main driving force for mPGES-1 subunit stability. Recently, Wang et al. found aromatic stacking at the interface of ketoreductase SiaM protein tetramer [35]. Phe123 and Tyr111 form a T-shaped aromatic stacking in the N- terminal interface, while Phe227 of one protomer form a parallel-displaced π-π stacking interaction with Phe227 of another protomer in the C-terminal interface (Fig. 2). In addition, Tyr235 inserts into the binding pocket in the opposite protomer and form a N-H/Pi interaction with an arginine residue nearby. Subsequent mutagenesis study confirmed the importance of aromatic stacking for maintaining the structural integrity and enzymatic activity of SiaM.

achieved through aromatic stacking between tryptophan and the aromatic residues in the antibody [37]. In addition, the aromatic stacking is found at the recognition sites of three human antibodies against influenza virus hemagglutinin (HA). Since the HA mediates the receptor binding and viral entry during influenza infection, such binding pattern could well serve as a guide for drug design [38]. AROMATIC STACKING 3. NANOPARTICLE COMPLEXES

IN

PROTEIN-

The aromatic stacking interactions play a significant role in the formation of protein-nanoparticle conjugate. The surface chemistry of CNTs and proteins includes several kinds of interactions such as π-π stacking, hydrophobic, surfactantlike and charge π interactions [39]. The aromatic residues of proteins are the main binding sites for CNTs. The side chains of these residues form strong π-π stacking with the surface of the CNT, thus, therefore, provides the main driving force for the protein/CNT binding [40]. The π−π stacking interactions of proteins with CNTs appear to be of substantial importance because of extended π conjugation on CNTs [41]. It has been reported that the adsorption of proteins into CNTs depends on the content of aromatic residues in the protein sequence. Proteins with many aromatic residues bind to CNTs easily through π -π stacking interactions [42]. Numerous studies indicate that aromatic residues bind to CNTs more sturdily than other amino acids. Among aromatic residues, Trp holds the highest affinity to CNTs, which is followed by Tyr and Phe [39, 43]. A study by Ge et al. demonstrated that human serum proteins interact with SWCNTs by a competitive binding pattern with different packing and adsorption capacity [44]. The competitive bindings of proteins-SWCNT significantly alter the pathways of cellular interaction. This alteration reduces cytotoxicity of protein-coated SWCNTs. It was also reported that the SWCNT can inhibit amyloid formation. The ring of single wall carbon nanotube (SWCNT) forms a parallel π stacking with the aromatic ring of F19 and indirectly inhibits the possible cross-strand stacking in Aβ assembly (Fig. 3) [45].

Fig. (2). The aromatic stacking in the interfaces of SiaM. The ketoreductase SiaM forms a tetramer in solution. Aromatic stacking interactions are found in both N-terminal and C-terminal interfaces. The Phe227 and Phe227' form a parallel aromatic stacking interaction (shown in box). The monomers are colored in black or gray. The aromatic residues are shown in sticks and the rest of the molecules are shown in ribbon.

A bioinformatics analysis of antibody/antigen complex in PDB database indicates that aromatic stacking is an important mechanism for antigen recognition. The aromatic residues in the antigens serve as the epitopes. The antibodies bind these epitopes with a deep cavity formed between L3 and H3 regions [36]. A good example can be found in the binding of PLY-5, a mouse monoclonal antibody, which recognizes the conserved undecapeptide tryptophan-rich loop (ECTGLAWEWWR) of bacterial cholesterol-dependent cytolysins (CDCs). It is predicted that the tryptophan-rich loop is exposed on the surface of CDCs and serve as the recognition site for antibody. The antibody/antigen binding is

4. AROMATIC INTERACTIONS NUCLEOTIDE COMPLEXES

IN

PROTEIN-

4.1. Protein-DNA Interactions Protein-DNA interactions are essential for all organisms to fulfill their functions [46]. Over the years, many efforts have been made to understand the binding of protein and DNA at atomic level. In the B-form DNA structure, the bases are shielded by the DNA backbones and not accessible from the outside. The DNA-protein aromatic stacking is often found in the places where the normal DNA structure is altered. Cleavage of the N-glycosidic bond is the most common lesion under physiological conditions and leaves a site without a base [47]. A recent study shows that, in KlenTaq, the Tyr671 protrudes into the DNA lesion through aromatic stacking with the base and guide the incoming NTP to repair the abasic site. In human alkyladenine DNA glycosylase, the key tyrosine residues can be substituted with tryptophan to initiate the base excision repair pathway, indicating the aromaticity is the key for its function [48]. Furthermore, the side

Fig. (3). The schematic representation of proposed inhibitory mechanism of Aβ (16-22) nucleation by SWCNT. The random coil amyloid chains aggregate through aromatic stacking (shown in box) and form beta-sheet-rich oligomers. SWCNT may block the self-assembly process by stacking on the aromatic residues and lead to the formation of disordered coil aggregate.

chains of aromatic residues of base flipping enzymes can form an aromatic stacking with flipped base in DNA [49]. The E. coli AlkA protein flips a 1-azaribose abasic nucleotide out of DNA by stacking of Trp272 side chain with alkylated bases [50]. Another study also suggested that Trp272 is important for substrate recognition and repair system of base-excision DNA repair proteins [51]. These results suggest prominent roles of aromatic stacking in damaged DNA recognition. A notable example of aromatic stacking is also observed in prokaryotic cold shock proteins (Csp) family [52]. Csp proteins stimulate the transcription of cold shock inducible genes and often initiate translation at low temperature by destabilizing non-productive secondary structures of nucleic acids [53]. Six highly conserved aromatic residues (Phe15, Phe17, Phe27, Phe30, Trp8, and His29) are found in a patch and they form the nucleic acid binding site [54]. Substitutions of these aromatic residues to non-aromatic residues abolish DNA binding affinity of Bacillus subtilis CspB [55]. E. coli CspA is a β-sheet protein of only 69 amino acids and it possesses a continuous nonpolar patch. It has been demonstrated that the aromatic cluster is required not only for protein function but also for protein stability [56]. Phadtare et al. observed that the mutations on the aromatic patch (Trp10, Phe17, Phe19, Phe30, His32, and Phe33) of E. coli CspE have no effect on RNA binding. Instead, the aromatic residues (Phe17, Phe30 and His32) are critical for DNA melting activity, cold acclimation of cells and transcription antiter-

mination [57]. A recent study also found the importance of aromatic stacking in protein-oligonucleotide interaction by analyzing crystal structure of Csp from Bacillus caldolyticus (Bc-Csp) with dT6 oligonucleotide [58]. The oligonucleotide-Bc-Csp interaction is dominated by aromatic stacking between the side chains of Trp8- T6, Phe17- T5 and Phe27T4 residues. Such aromatic stacking plays prominent roles for Bc-Csp’s function as RNA chaperones. Aromatic interactions also have a key role in DNA strand separation. Dda, a phage T4 helicase, plays a pivotal role in strand separation [59]. It was suggested that helicase translocation along single-stranded DNA (ssDNA) and DNA strand separation is coupled. X-ray crystallography of the DdassDNA binary complex reveals a domain called “pin”. Structural analysis of “pin” shows a Phe residue at active site that mediates a transient base-stacking interaction for the separation of dsDNA [60]. Besides, it has also been suggested that, Dda tightly couples translocation of helicase to strand separation site. The prominent roles of aromatic residues in strand separation have also been observed in a study by Kim JW et al. [61]. X-ray crystal structure of the HCV nonstructural protein 3 (NS3) helicase with (dU)8 oligonucleotide shows that Trp501 is involved in a base stacking interaction crucial for helicase activity. Mutations of the Trp501 to nonaromatic amino acids lead to a damage in RNA unwinding activity, although for DNA the activity remains unchanged. Molecular analysis of the enzyme-substrate complex reveals a π-facial hydrogen bond interaction between the Trp aro-

matic ring and ribose 2’-OH. Furthermore, this site was mutated to other aromatic residues. Interestingly, they found that the mutations fully restore the damaged RNA unwinding activity, indicating the crucial role of the aromatic ring for the RNA helicase function. Moreover, Yoshikawa and coworkers observed a specific role of aromatic residues in the recognition of Holliday junction in E. coli [62]. E. coli RuvC resolvase is an endonuclease that recognizes and cleaves Holliday junctions formed during homologous recombination and repair. Structural analysis by X-ray crystallography suggests that a Phe residue (Phe69) faces the catalytic center of the enzyme. In the interaction of RuvC with nucleotides, the aromatic ring of Phe69 contributes to the DNA binding by stacking with a nucleotide base, which leads to the disruption of base paring in the Holliday junction. Substitutions of Phe69 with other amino acids abolish the ability to cleave Holliday junction. The aromatic stacking is involved in the process of DNA mismatch repair. Several studies have shown close contact between aromatic residues and bases of mismatched DNA in E. coli [63]. The bacterial MutS protein binds with mismatch DNA. Two subunits namely, subunits A and B of the MutS homodimer, have distinctive roles in the binding. Drotschmann K and coworkers demonstrated that Phe337 and Glu339 of MutS are responsible for mismatched basespecific interactions [64]. But residues of both subunits asymmetrically contact the DNA backbone surrounding the mismatched base. This result is confirmed by a recent study that a Phe is stacked against thymine in G:T mismatch-bound MutS [65]. Similarly, a short patch repair protein Vsr from E. coli recognizes a TG mismatched base pair in damaged DNA [66]. Crystal structure of Vsr complexed with DNA reveals that Phe67, Trp68, and Trp86 intercalate between the TG mismatch and the adjacent AT base pair [67]. The side chain of Phe67 is stacked with the central AT base pair in the recognition sequence while Trp68 side chain stacks with the mispaired thymidine. Trp86 stacked with the two sequential sugar rings of the strand opposite the cleaved strand. In ssDNA, the bases are exposed and may form aromatic stacking with binding proteins. It was shown that, in E. coli single-strand DNA binding protein (EcoSSB), Trp54 stacks on the nucleotide bases [68]. Similar stacking interactions between single-strand DNA and its binding proteins are also found in eukaryotes [69]. In the ssDNAbinding domain of human replication protein A (RPA), a series of Phe residues (F238, F269, F386) are involved in the aromatic stacking with DNA bases. The heterotrimeric (ssDNA)-binding protein, RPA, contains four ssDNA binding domains within its subunits [70]. Two conserved aromatic residues of each domain contribute in high affinity binding of RPA with ssDNA. A rare case of stacking interaction can be found in the ssDNA-antibody complex [71]. Anti-DNA antibodies that bind to hairpin-forming DNA ligands are particularly important in the pathogenesis of autoimmune disease systemic lupus erythematosus (SLE) [72]. A recombinant anti-ssDNA Fab (DNA-1) binds with oligonucleotides by inserting thymine bases between Tyr residues (Fig. 4). The critical Tyr residues involved in the aromatic stacking interactions are located at positions 100 and 101 [73].

Fig. (4). An unusual aromatic stacking in anti-ssDNA Faboligonucleotide complex. Two aromatic residues, Tyr 100 and Tyr 101, sandwich thymine base and form a continuous stacking, which is thought to contribute to the stability of the complex. The sandwich stacking is shown in circle. The ssDNA is shown in black and the Fab is in gray.

4.2. Protein-RNA Interactions Protein-RNA complexes have important roles in many biological processes, including mRNA stabilization, translation, and ribosome formation [74]. It is well established that DNA and RNA behave differently with amino acids [75]. Most protein-DNA interactions involve phosphate atoms, while protein-RNA interactions frequently utilize ribose atoms and bases. As a result, aromatic residues take part in the ligand recognition of many RNA-recognizing proteins through stacking interactions [76]. These RNA-binding proteins have RNA recognition motif (RRM) which is also called RNA nucleoprotein (RNP) domain. The RRM is a ubiquitous RNA binding domain of 90-100 amino residues that form an antiparallel β-sheet held by a pair of alpha helices [77]. Different RRMs modify the basic structure with high specificity to recognize the diverse single-stranded RNA targets. Birney et al. found three highly conserved aromatic residues in RRM that interact with RNA [78]. Structural studies of many RRM-RNA complexes have demonstrated that aromatic residues of RRM form stacking interaction with nucleobase for base recognition and specific binding [79]. U1A is a spliceosome protein of U1 snRNP that contains two RRMs. In these two RRMs, only the Nterminal RRM domain can binds with RNA. U1A binds with stem-loop 2 of U1 snRNP and splice most eukaryotic premRNA. Two highly conserved aromatic residues in RRM, Tyr 13 and Phe56, are found in U1A. Several studies observed that Tyr 13 stacks with base C5 while Phe56 stacks with A6 base in the untranslated region (UTR) (Fig. 5) [79a, 80]. Nolan and coworkers substituted Phe56 with other aromatic residues, Leu, and Ala to assess the contribution of Phe56 to the stability of the U1A N-terminal RRM and stemloop 2 complex [81]. Replacements of Phe56 with other aromatic residues do not destabilize the complex. However, substitution of Phe56 with Ala and Leu resulted in a binding energy loss of 5.5 and 4 kcal/mol respectively. These results strongly suggest the importance of an aromatic residue at

position 56 for high affinity binding of U1A to RNA. Recently, Shiels et al. observed that substitution of Phe56 with Tyr or Trp increased the rate of complex dissociation, while substitution with His decreased the rate of association of the complex, implying an alteration in the initial complex formation [82]. Furthermore, substitution of Phe56 with Leu reduced U1A ability to correctly recognize stem-loop 2. These experiments suggest that Phe56 contributes to high affinity binding by stacking with base A6. Besides, Phe56 also participate in energetically coupled interactions networks that enables the conserved aromatic residue to play a significant role in recognizing target site.

Fig. (5). Stacking in the human U1A spliceosomal protein-RNA complex. The N-terminal RRM domain interacts with RNA by aromatic stacking. Two highly conserved aromatic residues, Tyr13 and Phe56, stack with base C5 and A6 respectively (shown in box), which plays an important role in the recognition of RNA molecules. In the box, the RNA molecule is colored in gray and the protein molecule is in black.

L5 is an example of ribosomal protein binding to eukaryotic 5S rRNA. The L5 is in complex with 5S rRNA and then stored in the cytoplasm in Xenopus. 5S rRNA binds to either the related zinc finger protein, p43 or transcription factor IIIA (TFIIIA) during the primary phases of oogenesis [83]. The L5 contains many aromatic residues, including 19 tyrosines that interact with RNA [84]. A study by DiNitto and coworkers showed the role of numerous conserved aromatic residues in the binding of L5 protein to 5S rRNA [85]. By comparative sequence analysis, some conserved aromatic residues, mostly tyrosines, were identified. Substitutions of Tyr86, Tyr99, or Tyr226 with alanine abolish RNA-binding activity. However, substitution with Phe at Tyr86 and Tyr226 does not change binding affinity. These results indicate that intermolecular aromatic π -stacking involving two Tyr residues, Tyr86 and Tyr226, are important for specific binding and the L5-5S rRNA complex formation. In addition, several aromatic stacking interactions are also found in the structures of other RNA-RNP complexes such as MS2 viral coat protein-operator RNA [86], anticodon interactions with tRNA synthetases [87], ribosomal protein L30autoregulatory RNA element [88] and λ N protein-boxB RNA etc [89].

Aromatic stacking is also participated in the mRNA deadenylation and degradation process by coupling TIS11d protein to the 3′untranslated region (3′UTR) class II AU-rich element (ARE). Aromatic residues in the tandem zinc finger (TZF) domain of the TIS11d recognize ARE and further stabilize the complex [90]. The TIS11d proteins contain a highly conserved TZF domain with several aromatic residues and two CCCH zinc finger motifs including a (R/K)YKTEL motif for the mRNA-binding activity. Substitution of the conserved aromatic residues abolishes the TZF domain’s RNA-binding activity [91]. Hudson et al. demonstrated that the interface between the TZF domain and the 5′UAUUUAUU-3′ ARE motif is dominated by several aromatic π stacking along with some hydrogen-bonding interactions [92]. The side chains of Tyr170 of finger 1 form a coplanar stacking with the bases of U8 and U9. Another aromatic amino acid, Phe176 intercalates between U6 and A7 bases. Furthermore, Tyr208 of finger 2 intercalates between U4 and U5 with its aromatic side ring while Phe214 stacked between U2 and A3. The intercalated aromatic residues are crucial for high-affinity binding of TZF with ARE. The continuous stacking of aromatic bases is usually found in the helical nucleotide structures. However, the sandwiched stacking between residues and bases, like those in the TZF/RNA complexes, can be found in other proteinnucleotide complexes. The mammalian RNA-binding protein, Musashi1 (Msi1), regulates the target mRNAs translation by unique stacking [93]. The N-terminal regions of mouse Msi1 and Msi2 contain two RNA binding domains (RBD1 and RBD2) [94]. The complex structure of Msi1 RBD1: r (GUAGU) shows that the adenine residues at third and guanine at the fourth positions are stacked on the conserved Phe23 and Phe65, respectively. Also an unusual sandwiched stacking is formed by stacking between Phe96, the third adenine and Phe23. Additionally, the nuclear cap-binding complex (CBC) specifically recognizes and binds to mRNA 5’ cap. This binding is crucial for a vast majority of mRNA metabolic events such as pre-mRNA splicing and U snRNA transport. It is also proved to be important for the protection of mRNA from degradation. The CBC-RNA interaction is mediated by several hydrogen bonds and the aromatic stacking of the Tyr side chains with the first two bases of the capped mRNA [95]. In addition, Worch et al. described the diverse role of Tyr in the human nuclear cap binding complex [96]. It appears that the Tyr residues stabilize the cap binding through stacking interactions. Likewise, in Trypanosoma brucei cap 2' RNA methyltransferase, two aromatic amino acids, Tyr18 and Tyr187 stack with the guanine ring of the cap [97]. The removal of these aromatic residues abolishes cap-specific RNA-binding activity. 5. AROMATIC STACKING AND PATHOGENESIS OF DISEASES The aromatic stacking is a contributing factor for many severe diseases. The nucleotide excision repair (NER) is an important step in DNA repair process, and it is found in almost all eukaryotes [98]. Thus, any defect in the NER system can cause serious disorders in human body such as Cockayne syndrome, Xeroderma pigmentosum and trichothiodystrophy [99]. Several proteins can carry out in

Fig. (6). The 3D representation of the XPF homodimer. The monomers are colored in black or gray. The aromatic stacking interactions are formed between the Phe168 and Phe119 of one monomer and Phe19 and Phe68 from the second monomer of XPF homodimer (shown in circle).

vitro NER for damaged DNA, many of these acts through a dimeric complex, such as XPF-XPF homodimer, XPFERCC1 and archeal XPF homodimer. The available NMR structure of XPF homodimer presents a twofold symmetry. Due to the presence of several aromatic residues (Phe19, Phe68, Phe119, Phe168), the π-π interactions are formed on the interfaces of XPF-XPF homodimer which contribute toward the stability of this complex (Fig. 6) [100]. Meanwhile, β-amyloid peptide (Aβ) aggregation is the most acute pathological hallmark in Alzheimer’s disease [101]. The basic components of the amyloid are thin fibrils of a protein termed Aβ. Several studies have demonstrated that phenylalanine residues play a crucial role in the formation and selfassembly of amyloid fibril [102]. The stacking interactions between these residues are proven to be vital in Alzheimer’s disease [103]. Because of the importance of aromatic residues in fibril formation, researchers have attempted to block the self-assembly process of Aβ by breaking the stacking interactions between each subunit. The chemicals that can block π-stacking interactions may be the potential drugs for the control of Alzheimer’s disease [104]. Protein misfolding is one of the main reasons for neurodegenerative diseases, such as Alzheimer’s disease [105]. It is well established that protein folding is controlled by several inter- and intramolecular noncovalent interactions such as aromatic stacking. Aromatic residues stabilize local structure by stacking between near sequences while tertiary structure is formed by aromatic stacking between far sequences [106]. Research has been conducted to understand the atomic details of stacking interactions in misfolded proteins.

tion features/elements or structural elements (MoRFs/ MoREs) [109]. Recently, it has been documented that aromatic residues occur at lower frequencies in IDPs [110]. However, Mohan et al. suggests the presence of at least one aromatic amino acid in most MoRFs region [111]. A recent study of 77 protein-protein complexes of IDPs shows that 40% of the complexes contain at least two aromatic residues in stacking state [112]. These aromatic amino acids link binding and function of IDPs.

On the other hand, intrinsically disordered proteins (IDPs) are associated with many human diseases, including cancer, cardiovascular disease and neurodegenerative diseases [107]. IDPs lack well-defined structures of proteins and their conformations can fluctuate over time. Besides, they have the binding sites which can recognize multiple partners through adoption of different conformations [108]. These common binding sites are called molecular recogni-

ABBREVIATIONS

CONCLUSION The aromatic stacking interactions are common and diverse in nature. It is one of the main forces for the stabilization in the complexes of proteins and macromolecular ligands. In protein-protein complexes, the aromatic stacking can be found in the interfaces of different oligomers, such as dimer, trimer and tetramer. In protein-nucleotide complexes, the stacking interactions not only play a role in the recognition of binding site but also crucial in stabilizing the complex. Recently, in the emerging field of nanotechnology, the aromatic stacking has proven its value in the formation of protein-nanoparticle conjugate. Due to the importance of aromatic stacking, it is often involved in the pathogenesis of a variety of diseases, such as Alzheimer’s disease and cancer. Therefore, the study of aromatic stacking interactions in protein-ligand complexes will help researchers to understand the mechanisms behind many diseases and assist the design of new medicines.



= β-amyloid peptide

CBC

= Cap-binding complex

Csp

= Cold shock proteins

Eps8

= Epidermal growth factor receptor pathway substrate 8

F (Phe)

= Phenylalanine

FPPs

= Farnesyl pyrophosphate synthase

Glu

= Glutamic acid

GSTs

= Glutathione transferases

H (His)

= Histidine

IDPs

= Intrinsically disordered proteins

MoRFs

= Molecular recognition features

mPGES-1 = Microsomal prostaglandin E synthase 1 NER

= Nucleotide excision repair

OPPs

= Octaprenyl pyrophosphate synthase

RRM

= RNA recognition motif

RRM

= RNA recognition motif (RRM)

SWCNT

= Single wall carbon nanotube

TZF

= Tandem zinc finger

W (Trp)

= Tryptophan

Y (Tyr)

= Tyrosine

[8]

[9] [10]

[11] [12]

[13]

[14]

CONFLICT OF INTEREST

[15]

The authors confirm that this article content has no conflict of interest.

[16]

ACKNOWLEDGEMENTS This work is financially supported by National Key Basic Research Program of China (No. 2013CB933900).

[17]

REFERENCES [1] [2]

[3]

[4]

[5] [6]

[7]

Schleyer, P.v.R. Introduction: Aromaticity. Chemical Rev., 2001, 101(5), 1115-1118. (a) Balaban, A.T.; Schleyer, P.; Rzepa, H.S. Crocker, not Armit and Robinson, begat the six aromatic electrons. Chem Rev., 2005, 105(10), 3436-3447; (b) Schleyer, P.v.R. Introduction: DelocalizationPi and Sigma. Chemical Rev., 2005, 105(10), 3433-3435. Mrozek, A.; Karolak-Wojciechowska, J.; Kieć-Kononowicz, K. Five-membered heterocycles. Part III. Aromaticity of 1,3-imidazole in 5+n hetero-bicyclic molecules. J. Mol. Struct., 2003, 655(3), 397-403. Sinnokrot, M.O.; Valeev, E.F.; Sherrill, C.D. Estimates of the ab initio limit for pi-pi interactions: the benzene dimer. J. Am. Chem. Soc., 2002, 124(36), 10887-10893. Burley, S.K.; Petsko, G.A. Aromatic-aromatic interaction: a mechanism of protein structure stabilization. Science, 1985, 229(4708), 23-28. Riley, K.E.; Hobza, P. On the importance and origin of aromatic interactions in chemistry and biodisciplines. Acc. Chem. Res., 2013, 46(4), 927-936. (a) Zhang, Z.; Chen, H.; Bai, H.; Lai, L. Molecular dynamics simulations on the oligomer-formation process of the GNNQQNY peptide from yeast prion protein Sup35. Biophys. J., 2007, 93(5), 14841492; (b) Melet, A.; Assrir, N.; Jean, P.; Pilar Lopez-Garcia, M.; Marques-Soares, C.; Jaouen, M.; Dansette, P.M.; Sari, M.A.; Mansuy, D. Substrate selectivity of human cytochrome P450 2C9: importance of residues 476, 365, and 114 in recognition of diclofenac and sulfaphenazole and in mechanism-based inactivation by tienilic acid. Arch. Biochem. Biophys., 2003, 409(1), 80-91; (c) Chen, J.; Seebohm, G.; Sanguinetti, M.C. Position of aromatic residues in the S6 domain, not inactivation, dictates cisapride sensitivity of HERG and eag potassium channels. Proc. Nat. Acad. Sci. U. S. A., 2002, 99(19), 12461-12466.

[18]

[19]

[20]

[21]

Hunter, C.A.; Singh, J.; Thornton, J.M. Pi-pi interactions: the geometry and energetics of phenylalanine-phenylalanine interactions in proteins. J. Mol. Biol., 1991, 218(4), 837-846. Bhattacharyya, R.; Samanta, U.; Chakrabarti, P. Aromatic-aromatic interactions in and around alpha-helices. Protein Eng., 2002, 15(2), 91-100. Samanta, U.; Pal, D.; Chakrabarti, P. Packing of aromatic rings against tryptophan residues in proteins. Acta Crystallogr. D Biol. Crystallogr., 1999, 55(Pt 8), 1421-1427. Loewenthal, R.; Sancho, J.; Fersht, A.R. Histidine-aromatic interactions in barnase. Elevation of histidine pKa and contribution to protein stability. J. Mol. Biol., 1992, 224(3), 759-770. Hill, R.B.; DeGrado, W.F. Solution Structure of α2D, a Nativelike de Novo Designed Protein. J. Am. Chem. Soc., 1998, 120(6), 11381145. (a) Lynch, I.; Salvati, A.; Dawson, K.A. Protein-nanoparticle interactions: What does the cell see? Nature Nanotechnol., 2009, 4(9), 546-547; (b) Huang, R.; Carney, R.P.; Stellacci, F.; Lau, B.L. Protein-nanoparticle interactions: the effects of surface compositional and structural heterogeneity are scale dependent. Nanoscale, 2013, 5(15), 6928-6935; (c) Okoli, C.; Sengottaiyan, S.; Arul Murugan, N.; Pavankumar, A.R.; Agren, H.; Kuttuva Rajarao, G. In silico modeling and experimental evidence of coagulant protein interaction with precursors for nanoparticle functionalization. J. Biomol. Struct. & Dynam., 2013, 31(10), 1182-1190. Ghule, A.V.; Kathir, K.M.; Suresh Kumar, T.K.; Tzing, S.-H.; Chang, J.-Y.; Yu, C.; Ling, Y.-C. Carbon nanotubes prevent 2,2,2 trifluoroethanol induced aggregation of protein. Carbon, 2007, 45(7), 1586-1589. Xie, L.; Luo, Y.; Lin, D.; Xi, W.; Yang, X.; Wei, G. The molecular mechanism of fullerene-inhibited aggregation of Alzheimer's betaamyloid peptide fragment. Nanoscale, 2014, 6(16), 9752-9762. Kim, S.N.; Kuang, Z.; Slocik, J.M.; Jones, S.E.; Cui, Y.; Farmer, B.L.; McAlpine, M.C.; Naik, R.R. Preferential binding of peptides to graphene edges and planes. J. Am. Chem. Soc., 2011, 133(37), 14480-14483. (a) Kranz, J.K.; Lu, J.; Hall, K.B. Contribution of the tyrosines to the structure and function of the human U1A N-terminal RNA binding domain. Protein Sci.,1996, 5(8), 1567-1583; (b) Chevaillier, P.; Chirat, F.; Sautiere, P. The amino acid sequence of the ram spermatidal protein 3--a transition protein TP3 or TP4? Eur. J. Biochem., / FEBS, 1998, 258(2), 460-464; (c) Brun, F.; Toulme, J.J.; Helene, C. Interactions of aromatic residues of proteins with nucleic acids. Fluorescence studies of the binding of oligopeptides containing tryptophan and tyrosine residues to polynucleotides. Biochemistry, 1975, 14(3), 558-563; (d) Durand, M.; Maurizot, J.C.; Borazan, H.N.; Helene, C. Interaction of aromatic residues of proteins with nucleic acids. Circular dichroism studies of the binding of oligopeptides to poly(adenylic acid). Biochemistry, 1975, 14(3), 563-570. Hu, G.; Gershon, P.D.; Hodel, A.E.; Quiocho, F.A. mRNA cap recognition: dominant role of enhanced stacking interactions between methylated bases and protein aromatic side chains. Proc. Nat. Acad. Sci. U. S. A., 1999, 96(13), 7149-7154. (a) Lam, W.C.; Maki, A.H.; Casas-Finet, J.R.; Erickson, J.W.; Sowder, R.C., 2nd; Henderson, L.E. Evidence for stacking interactions between 5-mercurated polyuridylic acid and HIV-1 p7 nucleocapsid protein obtained by phosphorescence and optically detected magnetic resonance (ODMR). FEBS Lett., 1993, 328(1-2), 45-48; (b) Mely, Y.; Piemont, E.; Sorinas-Jimeno, M.; de Rocquigny, H.; Jullian, N.; Morellet, N.; Roques, B.P.; Gerard, D. Structural and dynamic characterization of the aromatic amino acids of the human immunodeficiency virus type I nucleocapsid protein zinc fingers and their involvement in heterologous tRNA(Phe) binding: a steady-state and time-resolved fluorescence study. Biophys. J., 1993, 65(4), 1513-1522. (a) Burley, S.K.; Petsko, G.A. Weakly polar interactions in proteins. Advances in protein chemistry, 1988, 39, 125-189; (b) Samanta, U.; Pal, D.; Chakrabarti, P. Environment of tryptophan side chains in proteins. Proteins, 2000, 38(3), 288-300. (a) Mannervik, B.; Jensson, H. Binary combinations of four protein subunits with different catalytic specificities explain the relationship between six basic glutathione S-transferases in rat liver cytosol. J. Biol. Chem., 1982, 257(17), 9909-9912; (b) Mannervik, B.; Awasthi, Y.C.; Board, P.G.; Hayes, J.D.; Di Ilio, C.; Ketterer, B.; Listowsky, I.; Morgenstern, R.; Muramatsu, M.; Pearson, W.R.; et

[22]

[23]

[24]

[25]

[26]

[27] [28]

[29]

[30] [31]

[32]

[33]

[34]

[35]

[36] [37]

al. Nomenclature for human glutathione transferases. Biochem. J., 1992, 282(Pt 1), 305-306; (c) Wongsantichon, J.; Ketterman, A.J. An intersubunit lock-and-key 'clasp' motif in the dimer interface of Delta class glutathione transferase. Biochem. J., 2006, 394(Pt 1), 135-144. Reinemer, P.; Dirr, H.W.; Ladenstein, R.; Huber, R.; Lo Bello, M.; Federici, G.; Parker, M.W. Three-dimensional structure of class pi glutathione S-transferase from human placenta in complex with Shexylglutathione at 2.8 A resolution. J. Mol. Biol., 1992, 227(1), 214-226. Grahn, E.; Novotny, M.; Jakobsson, E.; Gustafsson, A.; Grehn, L.; Olin, B.; Madsen, D.; Wahlberg, M.; Mannervik, B.; Kleywegt, G.J. New crystal structures of human glutathione transferase A1-1 shed light on glutathione binding and the conformation of the Cterminal helix. Acta Crystallogr. D Biol. Crystallogr., 2006, 62(Pt 2), 197-207. Lewis, B.C.; Mackenzie, P.I.; Miners, J.O. Homodimerization of UDP-glucuronosyltransferase 2B7 (UGT2B7) and identification of a putative dimerization domain by protein homology modeling. Biochem. Pharmacol., 2011, 82(12), 2016-2023. Guo, R.T.; Kuo, C.J.; Chou, C.C.; Ko, T.P.; Shr, H.L.; Liang, P.H.; Wang, A.H. Crystal structure of octaprenyl pyrophosphate synthase from hyperthermophilic Thermotoga maritima and mechanism of product chain length determination. J. Biol. Chem., 2004, 279(6), 4903-4912. Hosfield, D.J.; Zhang, Y.; Dougan, D.R.; Broun, A.; Tari, L.W.; Swanson, R.V.; Finn, J. Structural basis for bisphosphonatemediated inhibition of isoprenoid biosynthesis. J. Biol. Chem., 2004, 279(10), 8526-8529. Sangeetha, B.; Muthukumaran, R.; Amutha, R. Structural dynamics of native and V260E mutant C-terminal domain of HIV-1 integrase. J. Comput. Aided Mol. Des., 2015, 29(4), 371-385. Grishin, N.V.; Osterman, A.L.; Brooks, H.B.; Phillips, M.A.; Goldsmith, E.J. X-ray structure of ornithine decarboxylase from Trypanosoma brucei: the native structure and the structure in complex with alpha-difluoromethylornithine. Biochemistry, 1999, 38(46), 15174-15184. Huang, Y.T.; Rusinova, E.; Ross, J.B.; Senear, D.F. An aromatic stacking interaction between subunits helps mediate DNA sequence specificity: operator site discrimination by phage lambda cI repressor. J. Mol. Biol., 1997, 267(2), 403-417. Bork, P.; Schultz, J.; Ponting, C.P. Cytoplasmic signalling domains: the next generation. Trends Biochem. Sci., 1997, 22(8), 296298. Kishan, K.V.; Newcomer, M.E.; Rhodes, T.H.; Guilliot, S.D. Effect of pH and salt bridges on structural assembly: molecular structures of the monomer and intertwined dimer of the Eps8 SH3 domain. Protein Sci., 2001, 10(5), 1046-1055. Mineev, K.S.; Khabibullina, N.F.; Lyukmanova, E.N.; Dolgikh, D.A.; Kirpichnikov, M.P.; Arseniev, A.S. Spatial structure and dimer--monomer equilibrium of the ErbB3 transmembrane domain in DPC micelles. Biochim. Biophys. Acta, 2011, 1808(8), 2081-2088. Bocharov, E.V.; Mayzel, M.L.; Volynsky, P.E.; Mineev, K.S.; Tkach, E.N.; Ermolyuk, Y.S.; Schulga, A.A.; Efremov, R.G.; Arseniev, A.S. Left-handed dimer of EphA2 transmembrane domain: Helix packing diversity among receptor tyrosine kinases. Biophys. J., 2010, 98(5), 881-889. Xing, L.; Kurumbail, R.G.; Frazier, R.B.; Davies, M.S.; Fujiwara, H.; Weinberg, R.A.; Gierse, J.K.; Caspers, N.; Carter, J.S.; McDonald, J.J.; Moore, W.M.; Vazquez, M.L. Homo-timeric structural model of human microsomal prostaglandin E synthase-1 and characterization of its substrate/inhibitor binding interactions. J. Comput. Aided Mol. Des., 2009, 23(1), 13-24. Wang, H.; Zhang, H.; Zou, Y.; Mi, Y.; Lin, S.; Xie, Z.; Yan, Y.; Zhang, H. Structural insight into the tetramerization of an iterative ketoreductase siam through aromatic residues in the interfaces. PloS one, 2014, 9(6), e97996. Arzhanik, V.; Svistunova, D.; Koliasnikov, O.; Egorov, A.M. Interaction of antibodies with aromatic ligands: the role of pistacking. J. Bioinform. Comput. Biol., 2010, 8(3), 471-483. Gonzalez-Menendez, P.; Garcia-Ocana, M.; de los Toyos, J.R. A deeper analysis of the epitope/paratope of PLY-5, a mouse monoclonal antibody which recognises the conserved undecapeptide tryptophan-rich loop (ECTGLAWEWWR) of bacterial cholesteroldependent cytolysins. Biochem. Biophys. Res. Commun., 2013, 430(1), 14-19.

[38]

[39]

[40]

[41] [42]

[43]

[44]

[45] [46]

[47]

[48]

[49] [50]

[51] [52]

[53] [54]

[55]

[56]

[57]

Xu, R.; Krause, J.C.; McBride, R.; Paulson, J.C.; Crowe, J.E., Jr.; Wilson, I.A. A recurring motif for antibody recognition of the receptor-binding site of influenza hemagglutinin. Nat. Struct. Mol. Biol., 2013, 20(3), 363-370. Calvaresi, M.; Zerbetto, F. The devil and holy water: protein and carbon nanotube hybrids. Acc. Chem. Res., 2013, 46(11), 24542463. Zuo, G.; Kang, S.G.; Xiu, P.; Zhao, Y.; Zhou, R. Interactions between proteins and carbon-based nanoparticles: exploring the origin of nanotoxicity at the molecular level. Small, 2013, 9(9-10), 1546-1556. Umadevi, D.; Panigrahi, S.; Sastry, G.N. Noncovalent interaction of carbon nanostructures. Acc. Chem. Res., 2014, 47(8), 2574-2581. Wang, S.; Humphreys, E.S.; Chung, S.Y.; Delduco, D.F.; Lustig, S.R.; Wang, H.; Parker, K.N.; Rizzo, N.W.; Subramoney, S.; Chiang, Y.M.; Jagota, A. Peptides with selective affinity for carbon nanotubes. Nat Mater., 2003, 2(3), 196-200. Xie, H.; Becraft, E.J.; Baughman, R.H.; Dalton, A.B.; Dieckmann, G.R. Ranking the affinity of aromatic residues for carbon nanotubes by using designed surfactant peptides. J Pept Sci. 2008, 14(2), 139-151. Ge, C.; Du, J.; Zhao, L.; Wang, L.; Liu, Y.; Li, D.; Yang, Y.; Zhou, R.; Zhao, Y.; Chai, Z.; Chen, C. Binding of blood proteins to carbon nanotubes reduces cytotoxicity. Proc. Natl. Acad. Sci. U. S. A., 2011, 108(41), 16968-16973. Li, H.; Luo, Y.; Derreumaux, P.; Wei, G. Carbon nanotube inhibits the formation of beta-sheet-rich oligomers of the Alzheimer's amyloid-beta(16-22) peptide. Biophys. J., 2011, 101(9), 2267-2276. (a) Gosavi, R.A.; Moon, A.F.; Kunkel, T.A.; Pedersen, L.C.; Bebenek, K. The catalytic cycle for ribonucleotide incorporation by human DNA Pol lambda. Nucleic Acids Res., 2012, 40(15), 75187527; (b) Jain, S.; Mandal, R.S.; Anand, S.; Maiti, S.; Ramachandran, S. Probing the amino acids critical for protein oligomerisation and protein-nucleotide interaction in Mycobacterium tuberculosis PII protein through integration of computational and experimental approaches. Biochim. Biophys. Acta, 2013, 1834(12), 2736-2749. Obeid, S.; Welte, W.; Diederichs, K.; Marx, A. Amino acid templating mechanisms in selection of nucleotides opposite abasic sites by a family a DNA polymerase. J. Biol. Chem., 2012, 287(17), 14099-14108. Hendershot, J.M.; Wolfe, A.E.; O'Brien, P.J. Substitution of active site tyrosines with tryptophan alters the free energy for nucleotide flipping by human alkyladenine DNA glycosylase. Biochemistry, 2011, 50(11), 1864-1874. Bischerour, J.; Chalmers, R. Base flipping in tn10 transposition: an active flip and capture mechanism. PloS one, 2009, 4(7), e6201. Hollis, T.; Ichikawa, Y.; Ellenberger, T. DNA bending and a flipout mechanism for base excision by the helix-hairpin-helix DNA glycosylase, Escherichia coli AlkA. EMBO J., 2000, 19(4), 758766. Fujii, S.; Yamagata, Y. Structural bases for substrate recognition and repair system of base-excision DNA repair proteins. Nucleic Acids Symp. Ser., 2000, (44), 57-58. Schindelin, H.; Jiang, W.; Inouye, M.; Heinemann, U. Crystal structure of CspA, the major cold shock protein of Escherichia coli. Proc. Natl. Acad. Sci. U. S. A., 1994, 91(11), 5119-5123. Jiang, W.; Hou, Y.; Inouye, M. CspA, the major cold-shock protein of Escherichia coli, is an RNA chaperone. J. Biol. Chem, 1997, 272(1), 196-202. Max, K.E.A.; Zeeb, M.; Bienert, R.; Balbach, J.; Heinemann, U. Trich DNA Single Strands Bind to a Preformed Site on the Bacterial Cold Shock Protein Bs-CspB. J. Mol. Biol., 2006, 360(3), 702-714. Schroder, K.; Graumann, P.; Schnuchel, A.; Holak, T.A.; Marahiel, M.A. Mutational analysis of the putative nucleic acid-binding surface of the cold-shock domain, CspB, revealed an essential role of aromatic and basic residues in binding of single-stranded DNA containing the Y-box motif. Mol. Microbiol., 1995, 16(4), 699-708. Hillier, B.J.; Rodriguez, H.M.; Gregoret, L.M. Coupling protein stability and protein function in Escherichia coli CspA. Fold. Des., 1998, 3(2), 87-93. Phadtare, S.; Tyagi, S.; Inouye, M.; Severinov, K. Three amino acids in Escherichia coli CspE surface-exposed aromatic patch are critical for nucleic acid melting activity leading to transcription antitermination and cold acclimation of cells. J. Biol. Chem, 2002, 277(48), 46706-46711.

[58]

[59]

[60]

[61]

[62]

[63]

[64]

[65]

[66] [67]

[68]

[69]

[70]

[71] [72]

Max, K.E.; Zeeb, M.; Bienert, R.; Balbach, J.; Heinemann, U. Common mode of DNA binding to cold shock domains. Crystal structure of hexathymidine bound to the domain-swapped form of a major cold shock protein from Bacillus caldolyticus. FEBS J., 2007, 274(5), 1265-1279. Nanduri, B.; Byrd, A.K.; Eoff, R.L.; Tackett, A.J.; Raney, K.D. Pre-steady-state DNA unwinding by bacteriophage T4 Dda helicase reveals a monomeric molecular motor. Proc. Natl. Acad. Sci. U. S. A., 2002, 99(23), 14722-14727. He, X.; Byrd, A.K.; Yun, M.K.; Pemble, C.W.T.; Harrison, D.; Yeruva, L.; Dahl, C.; Kreuzer, K.N.; Raney, K.D.; White, S.W. The T4 phage SF1B helicase Dda is structurally optimized to perform DNA strand separation. Structure (London, England : 1993), 2012, 20(7), 1189-1200. Kim, J.W.; Seo, M.Y.; Shelat, A.; Kim, C.S.; Kwon, T.W.; Lu, H.H.; Moustakas, D.T.; Sun, J.; Han, J.H. Structurally conserved amino Acid w501 is required for RNA helicase activity but is not essential for DNA helicase activity of hepatitis C virus NS3 protein. J. Virol., 2003, 77(1), 571-582. Yoshikawa, M.; Iwasaki, H.; Shinagawa, H. Evidence that phenylalanine 69 in Escherichia coli RuvC resolvase forms a stacking interaction during binding and destabilization of a Holliday junction DNA substrate. J. Biol. Chem., 2001, 276(13), 10432-10436. (a) Dufner, P.; Marra, G.; Raschle, M.; Jiricny, J. Mismatch recognition and DNA-dependent stimulation of the ATPase activity of hMutSalpha is abolished by a single mutation in the hMSH6 subunit. J. Biol. Chem., 2000, 275(47), 36550-36555; (b) Hughes, M.J.; Jiricny, J. The purification of a human mismatch-binding protein and identification of its associated ATPase and helicase activities. J. Biol. Chem., 1992, 267(33), 23876-23882; (c) Jiricny, J.; Hughes, M.; Corman, N.; Rudkin, B.B. A human 200-kDa protein binds selectively to DNA fragments containing G.T mismatches. Proc. Natl. Acad. Sci. U. S. A., 1988, 85(23), 8860-8864; (d) Yamamoto, A.; Schofield, M.J.; Biswas, I.; Hsieh, P. Requirement for Phe36 for DNA binding and mismatch repair by Escherichia coli MutS protein. Nucleic Acids Res., 2000, 28(18), 3564-3569. Drotschmann, K.; Yang, W.; Brownewell, F.E.; Kool, E.T.; Kunkel, T.A. Asymmetric recognition of DNA local distortion. Structure-based functional studies of eukaryotic Msh2-Msh6. J. Biol. Chem., 2001, 276(49), 46225-46229. Cristovao, M.; Sisamakis, E.; Hingorani, M.M.; Marx, A.D.; Jung, C.P.; Rothwell, P.J.; Seidel, C.A.; Friedhoff, P. Single-molecule multiparameter fluorescence spectroscopy reveals directional MutS binding to mismatched bases in DNA. Nucleic Acids Res., 2012, 40(12), 5448-5464. Tsutakawa, S.E.; Jingami, H.; Morikawa, K. Recognition of a TG mismatch: the crystal structure of very short patch repair endonuclease in complex with a DNA duplex. Cell, 1999, 99(6), 615-623. Morikawa, K.; Shirakawa, M. Three-dimensional structural views of damaged-DNA recognition: T4 endonuclease V, E. coli Vsr protein, and human nucleotide excision repair factor XPA. Mut. Res., 2000, 460(3-4), 257-275. Khamis, M.I.; Casas-Finet, J.R.; Maki, A.H.; Murphy, J.B.; Chase, J.W. Role of tryptophan 54 in the binding of E. coli single-stranded DNA-binding protein to single-stranded polynucleotides. FEBS Lett., 1987, 211(2), 155-159. (a) Doneanu, C.E.; Gafken, P.R.; Bennett, S.E.; Barofsky, D.F. Mass spectrometry of UV-cross-linked protein-nucleic acid complexes: identification of amino acid residues in the single-stranded DNA-binding domain of human replication protein A. Anal. Chem., 2004, 76(19), 5667-5676; (b) Gagliardi, A.; Mullin, N.P.; Ying Tan, Z.; Colby, D.; Kousa, A.I.; Halbritter, F.; Weiss, J.T.; Felker, A.; Bezstarosti, K.; Favaro, R.; Demmers, J.; Nicolis, S.K.; Tomlinson, S.R.; Poot, R.A.; Chambers, I. A direct physical interaction between Nanog and Sox2 regulates embryonic stem cell selfrenewal. EMBO J., 2013, 32(16), 2231-2247. Bastin-Shanower, S.A.; Brill, S.J. Functional analysis of the four DNA binding domains of replication protein A. The role of RPA2 in ssDNA binding. J. Biol. Chem., 2001, 276(39), 36446-36453. Ou, Z.; Bottoms, C.A.; Henzl, M.T.; Tanner, J.J. Impact of DNA hairpin folding energetics on antibody-ssDNA association. J. Mol. Biol., 2007, 374(4), 1029-1040. Blatt, N.B.; Glick, G.D. Anti-DNA autoantibodies and systemic lupus erythematosus. Pharmacol Ther., 1999, 83(2), 125-139.

[73]

[74]

[75]

[76]

[77]

[78]

[79]

[80]

[81]

[82]

[83] [84]

[85] [86]

[87] [88]

[89]

Tanner, J.J.; Komissarov, A.A.; Deutscher, S.L. Crystal structure of an antigen-binding fragment bound to single-stranded DNA. J. Mol. Biol., 2001, 314(4), 807-822. (a) Sutch, B.T.; Chambers, E.J.; Bayramyan, M.Z.; Gallaher, T.K.; Haworth, I.S. Similarity of protein-RNA interfaces based on motif analysis. J. Chem. Inf. Model, 2009, 49(9), 2139-2146; (b) Koh, Y.Y.; Wang, Y.; Qiu, C.; Opperman, L.; Gross, L.; Tanaka Hall, T.M.; Wickens, M. Stacking interactions in PUF-RNA complexes. RNA (New York, N.Y.), 2011, 17(4), 718-727; (c) Grossoehme, N.E.; Li, L.; Keane, S.C.; Liu, P.; Dann, C.E., 3rd; Leibowitz, J.L.; Giedroc, D.P. Coronavirus N protein N-terminal domain (NTD) specifically binds the transcriptional regulatory sequence (TRS) and melts TRS-cTRS RNA duplexes. J. Mol. Biol., 2009, 394(3), 544-557. Lejeune, D.; Delsaux, N.; Charloteaux, B.; Thomas, A.; Brasseur, R. Protein-nucleic acid recognition: statistical analysis of atomic interactions and influence of DNA structure. Proteins, 2005, 61(2), 258-271. (a) Worch, R.; Stolarski, R. Stacking efficiency and flexibility analysis of aromatic amino acids in cap-binding proteins. Proteins, 2008, 71(4), 2026-2037; (b) Hu, G.; Oguro, A.; Li, C.; Gershon, P.D.; Quiocho, F.A. The "cap-binding slot" of an mRNA capbinding protein: quantitative effects of aromatic side chain choice in the double-stacking sandwich with cap. Biochemistry, 2002, 41(24), 7677-7687; (c) Hsu, P.C.; Hodel, M.R.; Thomas, J.W.; Taylor, L.J.; Hagedorn, C.H.; Hodel, A.E. Structural requirements for the specific recognition of an m7G mRNA cap. Biochemistry, 2000, 39(45), 13730-13736. Burd, C.G.; Dreyfuss, G. Conserved structures and diversity of functions of RNA-binding proteins. Science (New York, N.Y.), 1994, 265(5172), 615-621. Birney, E.; Kumar, S.; Krainer, A.R. Analysis of the RNArecognition motif and RS and RGG domains: conservation in metazoan pre-mRNA splicing factors. Nucleic Acids Res., 1993, 21(25), 5803-5816. (a) Oubridge, C.; Ito, N.; Evans, P.R.; Teo, C.H.; Nagai, K. Crystal structure at 1.92 A resolution of the RNA-binding domain of the U1A spliceosomal protein complexed with an RNA hairpin. Nature, 1994, 372(6505), 432-438; (b) Allain, F.H.; Howe, P.W.; Neuhaus, D.; Varani, G. Structural basis of the RNA-binding specificity of human U1A protein. EMBO J., 1997, 16(18), 5764-5772. Varani, L.; Gunderson, S.I.; Mattaj, I.W.; Kay, L.E.; Neuhaus, D.; Varani, G. The NMR structure of the 38 kDa U1A protein - PIE RNA complex reveals the basis of cooperativity in regulation of polyadenylation by human U1A protein. Nat. Struct. Biol., 2000, 7(4), 329-335. Nolan, S.J.; Shiels, J.C.; Tuite, J.B.; Cecere, K.L.; Baranger, A.M.J. Recognition of an essential adenine at a protein-RNA interface: comparison of the contributions of hydrogen bonds and a stacking interaction. J. Am. Chem. Soc., 1999, 121, 8951-8952. Shiels, J.C.; Tuite, J.B.; Nolan, S.J.; Baranger, A.M. Investigation of a conserved stacking interaction in target site recognition by the U1A protein. Nucleic Acids Res., 2002, 30(2), 550-558. Joho, K.E.; Darby, M.K.; Crawford, E.T.; Brown, D.D. A finger protein structurally similar to TFIIIA that binds exclusively to 5S RNA in Xenopus. Cell, 1990, 61(2), 293-300. Scripture, J.B.; Huber, P.W. Binding site for Xenopus ribosomal protein L5 and accompanying structural changes in 5S rRNA. Biochemistry, 2011, 50(18), 3827-3839. DiNitto, J.P.; Huber, P.W. A role for aromatic amino acids in the binding of Xenopus ribosomal protein L5 to 5S rRNA. Biochemistry, 2001, 40(42), 12645-12653. Valegard, K.; Murray, J.B.; Stockley, P.G.; Stonehouse, N.J.; Liljas, L. Crystal structure of an RNA bacteriophage coat proteinoperator complex. Nature, 1994, 371(6498), 623-626. Cavarelli, J.; Rees, B.; Ruff, M.; Thierry, J.C.; Moras, D. Yeast tRNA(Asp) recognition by its cognate class II aminoacyl-tRNA synthetase. Nature, 1993, 362(6416), 181-184. Mao, H.; White, S.A.; Williamson, J.R. A novel loop-loop recognition motif in the yeast ribosomal protein L30 autoregulatory RNA complex. Nat. Struct. Biol., 1999, 6(12), 1139-1147. Legault, P.; Li, J.; Mogridge, J.; Kay, L.E.; Greenblatt, J. NMR structure of the bacteriophage lambda N peptide/boxB RNA complex: recognition of a GNRA fold by an arginine-rich motif. Cell, 1998, 93(2), 289-299.

[90]

[91]

[92]

[93]

[94]

[95]

[96]

[97]

[98] [99]

[100]

[101]

Morgan, B.R.; Massi, F. A computational study of RNA binding and specificity in the tandem zinc finger domain of TIS11d. Protein Sci., 2010, 19(6), 1222-1234. Lai, W.S.; Kennington, E.A.; Blackshear, P.J. Interactions of CCCH zinc finger proteins with mRNA: non-binding tristetraprolin mutants exert an inhibitory effect on degradation of AU-rich element-containing mRNAs. J. Biol. Chem., 2002, 277(11), 96069613. Hudson, B.P.; Martinez-Yamout, M.A.; Dyson, H.J.; Wright, P.E. Recognition of the mRNA AU-rich element by the zinc finger domain of TIS11d. Nat. Struct. Mol. Biol., 2004, 11(3), 257-264. Ohyama, T.; Nagata, T.; Tsuda, K.; Kobayashi, N.; Imai, T.; Okano, H.; Yamazaki, T.; Katahira, M. Structure of Musashi1 in a complex with target RNA: the role of aromatic stacking interactions. Nucleic Acids Res., 2012, 40(7), 3218-3231. Kawahara, H.; Imai, T.; Imataka, H.; Tsujimoto, M.; Matsumoto, K.; Okano, H. Neural RNA-binding protein Musashi1 inhibits translation initiation by competing with eIF4G for PABP. J. Cell Biol., 2008, 181(4), 639-653. Worch, R.; Stepinski, J.; Niedzwiecka, A.; Jankowska-Anyszka, M.; Mazza, C.; Cusack, S.; Stolarski, R.; Darzynkiewicz, E. Novel way of capping mRNA trimer and studies of its interaction with human nuclear cap-binding complex. Nucleosides Nucleotides Nucleic Acids, 2005, 24(5-7), 1131-1134. Worch, R.; Jankowska-Anyszka, M.; Niedzwiecka, A.; Stepinski, J.; Mazza, C.; Darzynkiewicz, E.; Cusack, S.; Stolarski, R. Diverse role of three tyrosines in binding of the RNA 5' cap to the human nuclear cap binding complex. J. Mol. Biol., 2009, 385(2), 618-627. Hall, M.P.; Ho, C.K. Functional characterization of a 48 kDa Trypanosoma brucei cap 2 RNA methyltransferase. Nucleic Acids Res., 2006, 34(19), 5594-5602. (a) Sancar, A. DNA excision repair. Annual review of biochemistry, 1996, 65, 43-81; (b) Wood, R. D. DNA repair in eukaryotes. Annu. Rev. Biochem., 1996, 65, 135-167. (a) de Laat, W.L.; Jaspers, N.G.; Hoeijmakers, J.H. Molecular mechanism of nucleotide excision repair. Genes Development, 1999, 13(7), 768-785; (b) Mitchell, J.R.; Hoeijmakers, J.H.; Niedernhofer, L.J. Divide and conquer: nucleotide excision repair battles cancer and ageing. Curr. Opin. Cell Biol., 2003, 15(2), 232240; (c) Andressoo, J.O.; Hoeijmakers, J.H. Transcription-coupled repair and premature ageing. Mut. Res., 2005, 577(1-2), 179-194; (d) Lans, H.; Hoeijmakers, J.H. Cell biology: ageing nucleus gets out of shape. Nature, 2006, 440(7080), 32-34. Das, D.; Tripsianes, K.; Jaspers, N.G.; Hoeijmakers, J.H.; Kaptein, R.; Boelens, R.; Folkers, G.E. The HhH domain of the human DNA repair protein XPF forms stable homodimers. Proteins, 2008, 70(4), 1551-1563. (a) Hardy, J.; Allsop, D. Amyloid deposition as the central event in the aetiology of Alzheimer's disease. Trends Pharmacol. Sci., 1991, 12, 383-388; (b) Bemporad, F.; Taddei, N.; Stefani, M.; Chiti, F. Assessing the role of aromatic residues in the amyloid aggregation

[102]

[103]

[104] [105]

[106]

[107]

[108]

[109]

[110]

[111] [112]

of human muscle acylphosphatase. Protein Sci., 2006, 15(4), 862870. (a) Wu, C.; Lei, H.; Duan, Y. The role of Phe in the formation of well-ordered oligomers of amyloidogenic hexapeptide (NFGAIL) observed in molecular dynamics simulations with explicit solvent. Biophys. J., 2005, 88(4), 2897-2906; (b) Profit, A.A.; Felsen, V.; Chinwong, J.; Mojica, E.R.; Desamero, R.Z. Evidence of pistacking interactions in the self-assembly of hIAPP(22-29). Proteins, 2013, 81(4), 690-703. (a) Findeis, M.A.; Musso, G.M.; Arico-Muendel, C.C.; Benjamin, H.W.; Hundal, A.M.; Lee, J.J.; Chin, J.; Kelley, M.; Wakefield, J.; Hayward, N.J.; Molineaux, S.M. Modified-peptide inhibitors of amyloid beta-peptide polymerization. Biochemistry, 1999, 38(21), 6791-6800; (b) Soto, C.; Sigurdsson, E.M.; Morelli, L.; Kumar, R.A.; Castano, E.M.; Frangione, B. Beta-sheet breaker peptides inhibit fibrillogenesis in a rat brain model of amyloidosis: implications for Alzheimer's therapy. Nat. Med., 1998, 4(7), 822-826. Rao, P.P.; Mohamed, T.; Osman, W. Investigating the binding interactions of galantamine with beta-amyloid peptide. Bioorg. Med. Chem. Lett., 2013, 23(1), 239-243. (a) Chiti, F.; Dobson, C.M. Protein misfolding, functional amyloid, and human disease. Annu. Rev. Biochem., 2006, 75, 333-366; (b) Hammarstrom, P.; Wiseman, R.L.; Powers, E.T.; Kelly, J.W. Prevention of transthyretin amyloid disease by changing protein misfolding energetics. Science (New York, N.Y.), 2003, 299(5607), 713-716. Thomas, A.; Meurisse, R.; Charloteaux, B.; Brasseur, R. Aromatic side-chain interactions in proteins. I. Main structural features. Proteins, 2002, 48(4), 628-634. (a) Uversky, V.N.; Oldfield, C.J.; Dunker, A.K. Intrinsically disordered proteins in human diseases: introducing the D2 concept. Annu. Rev. Biophys., 2008, 37, 215-246; (b) Gray, D.A.; Woulfe, J. Structural disorder and the loss of RNA homeostasis in aging and neurodegenerative disease. Front. Genet., 2013, 4, 149. Cumberworth, A.; Lamour, G.; Babu, M.M.; Gsponer, J. Promiscuity as a functional trait: intrinsically disordered regions as central players of interactomes. Biochem. J., 2013, 454(3), 361-369. Disfani, F.M.; Hsu, W.L.; Mizianty, M.J.; Oldfield, C.J.; Xue, B.; Dunker, A.K.; Uversky, V.N.; Kurgan, L. MoRFpred, a computational tool for sequence-based prediction and characterization of short disorder-to-order transitioning binding regions in proteins. Bioinformatics (Oxford, England), 2012, 28(12), i75-83. Brown, C.J.; Johnson, A.K.; Daughdrill, G.W. Comparing models of evolution for ordered and disordered proteins. Mol. Biol. Evol., 2010, 27(3), 609-621. Mohan, A.; Oldfield, C.J.; Radivojac, P.; Vacic, V.; Cortese, M.S.; Dunker, A.K.; Uversky, V.N. Analysis of molecular recognition features (MoRFs). J. Mol. Biol., 2006, 362(5), 1043-1059. Espinoza-Fonseca, L.M. Aromatic residues link binding and function of intrinsically disordered proteins. Mol. BioSyst., 2012, 8(1), 237-246.