Sever acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is a single-stranded RNA (ssRNA) virus, responsible for severe acute respiratory disease (COVID-19). A large number of natural compounds are under trial for screening compounds, possessing potential inhibitory effect against the viral infection. Keeping in view the importance of marine compounds in antiviral activity, we investigated the potency of some marine natural products to target SARS-CoV-2 main protease (M pro) (PDB ID 6MO3). The crystallographic structure of M pro in an apo form was retrieved from Protein Data Bank and marine compounds from PubChem. These structures were prepared for docking and the complex with good docking score was subjected to molecular dynamic (MD) simulations for a period of 100 ns. To measure the stability, flexibility, and average distance between the target and compounds, root mean square deviations (RMSD), root mean square fluctuation (RMSF), and the distance matrix were calculated. Among five marine compounds, C-1 (PubChem CID 11170714) exhibited good activity, interacting with the active site and surrounding residues, forming many hydrogen and hydrophobic interactions. The C-1 also attained a stable dynamic behavior, and the average distance between compound and target remains constant. In conclusion, marine natural compounds may be used as a potential inhibitor against SARS-CoV-2 for better management of COVID-19.
Gram-negative bacteria use various secretion systems to deliver their secreted effectors. Among them, type IV secretion system exists widely in a variety of bacterial species, and secretes type IV secreted effectors (T4SEs), which play vital roles in host-pathogen interactions. However, experimental approaches to identify T4SEs are time- and resource-consuming. In the present study, we aim to develop an in silico stacked ensemble method to predict whether a protein is an effector of type IV secretion system or not based on its sequence information. The protein sequences were encoded by the feature of position specific scoring matrix (PSSM)-composition by summing rows that correspond to the same amino acid residues in PSSM profiles. Based on the PSSM-composition features, we develop a stacked ensemble model PredT4SE-Stack to predict T4SEs, which utilized an ensemble of base-classifiers implemented by various machine learning algorithms, such as support vector machine, gradient boosting machine, and extremely randomized trees, to generate outputs for the meta-classifier in the classification system. Our results demonstrated that the framework of PredT4SE-Stack was a feasible and effective way to accurately identify T4SEs based on protein sequence information. The datasets and source code of PredT4SE-Stack are freely available at http://xbioinfo.sjtu.edu.cn/PredT4SE_Stack/index.php.
The emergence of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) and its new variants reported in different countries have posed a serious threat to human health and social fabrics worldwide. In addition, these new variants hindered the efforts of vaccines and other therapeutic developments. In this review article, we explained the emergence of new variants of SARS-CoV-2, their transmission risk, mortality rate, and, more importantly, the impact of each new variant on the efficacy of the developed vaccines reported in different literature and findings. The literature reported that with the emergence of new variants, the efficacy of different vaccines is declined, hospitalization is increased while the risk of reinfection also increased. The reports concluded that the emergence of a variant that entirely evades the immune response triggered by the vaccine is improbable. The emergence of new variants and reports of re-infections are creating a more distressing situation and therefore demands further investigation to formulate an effective therapeutic strategy.
Identifying drug-target interactions (DTIs) is an important step for drug discovery and drug repositioning. To reduce the experimental cost, a large number of computational approaches have been proposed for this task. The machine learning-based models, especially binary classification models, have been developed to predict whether a drug-target pair interacts or not. However, there is still much room for improvement in the performance of current methods. Multi-label learning can overcome some difficulties caused by single-label learning in order to improve the predictive performance. The key challenge faced by multi-label learning is the exponential-sized output space, and considering label correlations can help to overcome this challenge. In this paper, we facilitate multi-label classification by introducing community detection methods for DTI prediction, named DTI-MLCD. Moreover, we updated the gold standard data set by adding 15,000 more positive DTI samples in comparison to the data set, which has widely been used by most of previously published DTI prediction methods since 2008. The proposed DTI-MLCD is applied to both data sets, demonstrating its superiority over other machine learning methods and several existing methods. The data sets and source code of this study are freely available at https://github.com/a96123155/DTI-MLCD.
One of the main problems with the joint use of multiple drugs is that it may cause adverse drug interactions and side effects that damage the body. Therefore, it is important to predict potential drug interactions. However, most of the available prediction methods can only predict whether two drugs interact or not, whereas few methods can predict interaction events between two drugs. Accurately predicting interaction events of two drugs is more useful for researchers to study the mechanism of the interaction of two drugs. In the present study, we propose a novel method, MDF-SA-DDI, which predicts drug–drug interaction (DDI) events based on multi-source drug fusion, multi-source feature fusion and transformer self-attention mechanism. MDF-SA-DDI is mainly composed of two parts: multi-source drug fusion and multi-source feature fusion. First, we combine two drugs in four different ways and input the combined drug feature representation into four different drug fusion networks (Siamese network, convolutional neural network and two auto-encoders) to obtain the latent feature vectors of the drug pairs, in which the two auto-encoders have the same structure, and their main difference is the number of neurons in the input layer of the two auto-encoders. Then, we use transformer blocks that include self-attention mechanism to perform latent feature fusion. We conducted experiments on three different tasks with two datasets. On the small dataset, the area under the precision–recall-curve (AUPR) and F1 scores of our method on task 1 reached 0.9737 and 0.8878, respectively, which were better than the state-of-the-art method. On the large dataset, the AUPR and F1 scores of our method on task 1 reached 0.9773 and 0.9117, respectively. In task 2 and task 3 of two datasets, our method also achieved the same or better performance as the state-of-the-art method. More importantly, the case studies on five DDI events are conducted and achieved satisfactory performance. The source codes and data are available at https://github.com/ShenggengLin/MDF-SA-DDI.
The human papillomavirus (HPV) 58 is considered to be the second most predominant genotype in cervical cancer incidents in China. HPV type-restriction, non-targeted delivery, and the highcost of existing vaccines necessitate continuing research on the HPV vaccine. We aimed to explore the papillomaviral proteome in order to identify potential candidates for a chimeric vaccine against cervix papilloma using computational immunology and structural vaccinology approaches. Two overlapped epitope segments (23–36) and (29–42) from the N-terminal region of the HPV58 minor capsid protein L2 are selected as capable of inducing both cellular and humoral immunity. In total, 318 amino acid lengths of the vaccine construct SGD58 contain adjuvants (Flagellin and RS09), two Th epitopes, and linkers. SGD58 is a stable protein that is soluble, antigenic, and non-allergenic. Homology modeling and the structural refinement of the best models of SGD58 and TLR5 found 96.8% and 93.9% favored regions in Rampage, respectively. The docking results demonstrated a HADDOCK score of −62.5 ± 7.6, the binding energy (−30 kcal/mol) and 44 interacting amino acid residues between SGD58-TLR5 complex. The docked complex are stable in 100 ns of simulation. The coding sequences of SGD58 also show elevated gene expression in Escherichia coli with 1.0 codon adaptation index and 59.92% glycine-cysteine content. We conclude that SGD58 may prompt the creation a vaccine against cervix papilloma.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.