Serum albumin (SA) is the most abundant plasma protein in mammals. SA is a multifunctional protein with extraordinary ligand binding capacity, making it a transporter molecule for a diverse range of metabolites, drugs, nutrients, metals and other molecules. Due to its ligand binding properties, albumins have wide clinical, pharmaceutical, and biochemical applications. Albumins are also allergenic, and exhibit a high degree of cross-reactivity due to significant sequence and structure similarity of SAs from different organisms. Here we present crystal structures of albumins from cattle (BSA), horse (ESA) and rabbit (RSA) serums. The structural data are correlated with the results of immunological studies of SAs. We also analyze the conservation or divergence of structures and sequences of SAs in the context of their potential allergenicity and cross-reactivity. In addition, we identified a previously uncharacterized ligand binding site in the structure of RSA, and calcium binding sites in the structure of BSA, which is the first serum albumin structure to contain metal ions.
Clustered regularly interspaced short palindromic repeats (CRISPRs) together with the associated CAS proteins protect microbial cells from invasion by foreign genetic elements using presently unknown molecular mechanisms. All CRISPR systems contain proteins of the CAS2 family, suggesting that these uncharacterized proteins play a central role in this process. Here we show that the CAS2 proteins represent a novel family of endoribonucleases. Six purified CAS2 proteins from diverse organisms cleaved single-stranded RNAs preferentially within U-rich regions. A representative CAS2 enzyme, SSO1404 from Sulfolobus solfataricus, cleaved the phosphodiester linkage on the 3-side and generated 5-phosphate-and 3-hydroxyl-terminated oligonucleotides. The crystal structure of SSO1404 was solved at 1.6 Å resolution revealing the first ribonuclease with a ferredoxin-like fold. Mutagenesis of SSO1404 identified six residues (Tyr-9, Asp-10, Arg-17, Arg-19, Arg-31, and Phe-37) that are important for enzymatic activity and suggested that Asp-10 might be the principal catalytic residue. Thus, CAS2 proteins are sequence-specific endoribonucleases, and we propose that their role in the CRISPR-mediated anti-phage defense might involve degradation of phage or cellular mRNAs.
The low reproducibility of published experimental results in many scientific disciplines has recently garnered negative attention in scientific journals and the general media. Public transparency, including the availability of `raw' experimental data, will help to address growing concerns regarding scientific integrity. Macromolecular X-ray crystallography has led the way in requiring the public dissemination of atomic coordinates and a wealth of experimental data, making the field one of the most reproducible in the biological sciences. However, there remains no mandate for public disclosure of the original diffraction data. The Integrated Resource for Reproducibility in Macromolecular Crystallography (IRRMC) has been developed to archive raw data from diffraction experiments and, equally importantly, to provide related metadata. Currently, the database of our resource contains data from 2920 macromolecular diffraction experiments (5767 data sets), accounting for around 3% of all depositions in the Protein Data Bank (PDB), with their corresponding partially curated metadata. IRRMC utilizes distributed storage implemented using a federated architecture of many independent storage servers, which provides both scalability and sustainability. The resource, which is accessibleviathe web portal at http://www.proteindiffraction.org, can be searched using various criteria. All data are available for unrestricted access and download. The resource serves as a proof of concept and demonstrates the feasibility of archiving raw diffraction data and associated metadata from X-ray crystallographic studies of biological macromolecules. The goal is to expand this resource and include data sets that failed to yield X-ray structures in order to facilitate collaborative efforts that will improve protein structure-determination methods and to ensure the availability of `orphan' data left behind for various reasons by individual investigators and/or extinct structural genomics projects.
HD-domain phosphohydrolases have nucleotidase and phosphodiesterase activities and play important roles in the metabolism of nucleotides and in signaling. We present three 2.1-A-resolution crystal structures (one in the free state and two complexed with natural substrates) of an HD-domain phosphohydrolase, the Escherichia coli 5'-nucleotidase YfbR. The free-state structure of YfbR contains a large cavity accommodating the metal-coordinating HD motif (H33, H68, D69, and D137) and other conserved residues (R18, E72, and D77). Alanine scanning mutagenesis confirms that these residues are important for activity. Two structures of the catalytically inactive mutant E72A complexed with Co(2+) and either thymidine-5'-monophosphate or 2'-deoxyriboadenosine-5'-monophosphate disclose the novel binding mode of deoxyribonucleotides in the active site. Residue R18 stabilizes the phosphate on the Co(2+), and residue D77 forms a strong hydrogen bond critical for binding the ribose. The indole side chain of W19 is located close to the 2'-carbon atom of the deoxyribose moiety and is proposed to act as the selectivity switch for deoxyribonucleotide, which is supported by comparison to YfdR, another 5'-nucleotidase in E. coli. The nucleotide bases of both deoxyriboadenosine-5'-monophosphate and thymidine-5'-monophosphate make no specific hydrogen bonds with the protein, explaining the lack of nucleotide base selectivity. The YfbR E72A substrate complex structures also suggest a plausible single-step nucleophilic substitution mechanism. This is the first proposed molecular mechanism for an HD-domain phosphohydrolase based directly on substrate-bound crystal structures.
The Protein Structure Initiative’s Structural Biology Knowledgebase (SBKB, URL: http://sbkb.org) is an open web resource designed to turn the products of the structural genomics and structural biology efforts into knowledge that can be used by the biological community to understand living systems and disease. Here we will present examples on how to use the SBKB to enable biological research. For example, a protein sequence or Protein Data Bank (PDB) structure ID search will provide a list of related protein structures in the PDB, associated biological descriptions (annotations), homology models, structural genomics protein target status, experimental protocols, and the ability to order available DNA clones from the PSI:Biology-Materials Repository. A text search will find publication and technology reports resulting from the PSI’s high-throughput research efforts. Web tools that aid in research, including a system that accepts protein structure requests from the community, will also be described. Created in collaboration with the Nature Publishing Group, the Structural Biology Knowledgebase monthly update also provides a research library, editorials about new research advances, news, and an events calendar to present a broader view of structural genomics and structural biology.
A nonredundant set of 9081 protein crystal structures in the Protein Data Bank was used to examine the solvent content, the number of polypeptide chains, and the oligomeric states of proteins in crystals as a function of crystal symmetry (as classified by crystal systems and space groups). It was found that there is a correlation between solvent content and crystal symmetry. Surprisingly, proteins crystallizing in lower symmetry systems have lower solvent content compared to those crystallizing in higher symmetry systems. Nevertheless, there is no universal correlation between solvent content and preferences of macromolecules to crystallize in certain space groups. Crystal symmetry as a function of oligomeric state was examined, where trimers, tetramers, and hexamers were found to prefer to crystallize in systems where the oligomer symmetry could be incorporated in the crystal symmetry. Our analysis also shows that the frequency distribution within the enantiomorphous pairs of space groups does not differ significantly, in contrast to previous reports.Keywords: solvent content; Matthews coefficient; protein crystals; oligomerization; space group frequency Supplemental material: see www.proteinscience.orgWater plays an important role in the structure of biomolecules and often influences protein function. Water molecules not only affect protein folding, but also mediate biological processes such as enzymatic reactions and molecular recognition. Information about the fraction of water (solvent) plays a significant role in the X-ray structure determination process. First, knowledge of the solvent content helps to determine the number of molecules in the asymmetric unit (Matthews 1968), which is crucial in early stages of crystal structure determination. Second, an approximate value of solvent content is needed for significant phase improvement by solvent flattening methods (Wang 1985;Leslie 1987;Abrahams and Leslie 1996), which is necessary to resolve the inherent phase ambiguity in single anomalous diffraction (SAD) experiments. For both SAD and MAD (multiwavelength anomalous diffraction) (Hendrickson 1991;Hendrickson et al. 1990), phase improvement by solvent flattening is critical for low resolution data (Kirillova et al. 2007), especially when non-crystallographic symmetry cannot be applied.Matthews (1968) observed that the solvent content in protein crystals ranged from 27% to 65%, with an average of 43%. He also showed that the quantity V M (the Matthews coefficient, defined as the ratio of the volume of the asymmetric unit to the molecular weight of all
Introduction X-ray crystallography plays an important role in structure-based drug design (SBDD), and accurate analysis of crystal structures of target macromolecules and macromolecule–ligand complexes is critical at all stages. However, whereas there has been significant progress in improving methods of structural biology, particularly in X-ray crystallography, corresponding progress in the development of computational methods (such as in silico high-throughput screening) is still on the horizon. Crystal structures can be overinterpreted and thus bias hypotheses and follow-up experiments. As in any experimental science, the models of macromolecular structures derived from X-ray diffraction data have their limitations, which need to be critically evaluated and well understood for structure-based drug discovery. Areas covered This review describes how the validity, accuracy and precision of a protein or nucleic acid structure determined by X-ray crystallography can be evaluated from three different perspectives: i) the nature of the diffraction experiment; ii) the interpretation of an electron density map; and iii) the interpretation of the structural model in terms of function and mechanism. The strategies to optimally exploit a macromolecular structure are also discussed in the context of ‘Big Data’ analysis, biochemical experimental design and structure-based drug discovery. Expert opinion Although X-ray crystallography is one of the most detailed ‘microscopes’ available today for examining macromolecular structures, the authors would like to re-emphasize that such structures are only simplified models of the target macromolecules. The authors also wish to reinforce the idea that a structure should not be thought of as a set of precise coordinates but rather as a framework for generating hypotheses to be explored. Numerous biochemical and biophysical experiments, including new diffraction experiments, can and should be performed to verify or falsify these hypotheses. X-ray crystallography will find its future application in drug discovery by the development of specific tools that would allow realistic interpretation of the outcome coordinates and/or support testing of these hypotheses.
While three dimensional structures have long been used to search for new drug targets, only a fraction of new drugs coming to the market has been developed with the use of a structure-based drug discovery approach. However, the recent years have brought not only an avalanche of new macromolecular structures, but also significant advances in the protein structure determination methodology only now making their way into structure-based drug discovery. In this paper, we review recent developments resulting from the Structural Genomics (SG) programs, focusing on the methods and results most likely to improve our understanding of the molecular foundation of human diseases. SG programs have been around for almost a decade, and in that time, have contributed a significant part of the structural coverage of both the genomes of pathogens causing infectious diseases and structurally uncharacterized biological processes in general. Perhaps most importantly, SG programs have developed new methodology at all steps of the structure determination process, not only to determine new structures highly efficiently, but also to screen protein/ligand interactions. We describe the methodologies, experience and technologies developed by SG, which range from improvements to cloning protocols to improved procedures for crystallographic structure solution that may be applied in "traditional" structural biology laboratories particularly those performing drug discovery. We also discuss the conditions that must be met to convert the present high-throughput structure determination pipeline into a high-output structure-based drug discovery system.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.