Research

AI-Driven Drug Discovery

Transcriptional response-based drug discovery

Traditional drug discovery approaches identify a target for a disease and find a compound that binds to the target. In this approach, structures of compounds are considered as the most important features because it is assumed that similar structures will bind to the same target. Therefore, structural analogs of the drugs that bind to the target are selected as drug candidates. However, even though compounds are not structural analogs, they may achieve the desired response. A new drug discovery method based on drug response, which can complement the structure-based methods, is needed. We implemented Siamese neural networks called ReSimNet that take as input two chemical compounds and predicts the CMap score of the two compounds, which we use to measure the transcriptional response similarity of the two compounds. ReSimNet learns the embedding vector of a chemical compound in a transcriptional response space. ReSimNet is trained to minimize the difference between the cosine similarity of the embedding vectors of the two compounds and the CMap score of the two compounds. ReSimNet can find pairs of compounds that are similar in response even though they may have dissimilar structures... Read More (1) Read More (2)

First Place in Multi-Targeting Drug DREAM Challenge, 2018 (Janssen Pharmaceutica ranked at 4th): Link, Press

Pain target identification

While hundreds of genes have been associated with pain, much of the molecular mechanisms of pain remain unknown. As a result, current analgesics are limited to few clinically validated targets. Here, we trained a machine learning (ML) ensemble model to predict new targets for 17 categories of pain. The model utilizes features from transcriptomics, proteomics, and gene ontology to prioritize targets for modulating pain. We focused on identifying novel G-protein-coupled receptors (GPCRs), ion channels, and protein kinases because these proteins represent the most successful drug target families. The model predicts hundreds of novel targets for pain; for example, GPR132 and GPR109B are highly ranked GPCRs for rheumatoid arthritis. Overall, gene-pain association predictions cluster into three groups that are enriched for cytokine, calcium, and GABA-related cell signaling pathways. These predictions can serve as a foundation for future experimental exploration to advance the development of safer and more effective analgesics... Read More

This paper is highlighted at NIH on Aug 2, 2021: Link

Drug Combination Synergy Prediction for Personalized Medicine

Drug combination therapy, which is considered as an alternative to single drug therapy, can potentially reduce resistance and toxicity, and have synergistic efficacy. As drug combination therapies are widely used in the clinic for hypertension, asthma, and AIDS, they have also been proposed for the treatment of cancer. However, it is difficult to select and experimentally evaluate effective combinations because not only is the number of cancer drug combinations extremely large but also the effectiveness of drug combinations varies depending on the genetic variation of cancer patients. A computational approach that prioritizes the best drug combinations considering the genetic information of a cancer patient is necessary to reduce the search space. We propose an in-silico method for personalized drug combination therapy discovery. We predict the synergy between two drugs and a cell line using genomic information, targets of drugs, and pharmacological information. We calculate and predict the synergy scores of 583 drug combinations for 31 cancer cell lines. The model correctly predicts the most synergistic combination, from approximately 100 candidate drug combinations, as the top choice for 15 out of the 31 cell lines. For 28 out of the 31 cell lines, the model predicts the most synergistic combination in the top 10 of approximately 100 candidate drug combinations. Finally, we analyze the results, generate synergistic rules using the features, and validate the rules through the literature survey... Read More (1) Read More(2)

Second Place in AstraZeneca-Sanger Drug Combination Prediction DREAM Challenge, 2016 (Stanford University ranked at 6th, MIT ranked at 11th): Link, Press

Biomedical Data Analysis

Jupyter Notebooks transformation into data-driven web-based applications

Jupyter Notebooks have transformed the communication of data analysis pipelines by facilitating a modular structure that brings together code, markdown text, and interactive visualizations. Here, we extended Jupyter Notebooks to broaden their accessibility with Appyters. Appyters turn Jupyter Notebooks into fully functional standalone web-based bioinformatics applications. Appyters present to users an entry form enabling them to upload their data and set various parameters for a multitude of data analysis workflows. Once the form is filled, the Appyter executes the corresponding notebook in the cloud, producing the output without requiring the user to interact directly with the code. Appyters were used to create many bioinformatics web-based reusable workflows, including applications to build customized machine learning pipelines, analyze omics data, and produce publishable figures. These Appyters are served in the Appyters Catalog at https://appyters.maayanlab.cloud. In summary, Appyters enable the rapid development of interactive web-based bioinformatics applications. Read More Link to Appyter Catalog

Bulk RNA-seq Analysis Appyter

As many researchers get their own RNA-seq datasets, the demand for data analysis pipelines is increasing. However, it is not easy for those who are lack of computational backgrounds to build their own pipelines. Therefore, we developed a bulk RNA-seq analysis appyter that enables you to analyze and visualize your datasets. First, the bulk RNA-seq analysis appyter gets two data files, meta data and expression data. The metadata contains class or labels of samples and expression data contains gene expression levels of genes in samples. The appyter provides basic data preprocessing and visualization methods and differentially expressed gene analysis methods. Link to Bulk RNA-seq Analysis Appyter

Single Cell RNA-seq Analysis Appyter

We developed a single-cell RNA-seq (scRNA-seq) analysis pipeline. This pipeline is a web-based platform developed to enable users to analyze and visualize scRNA-seq datasets with an array of algorithms and data visualization methods. This pipeline enables you to analyze and visualize your single cell RNA sequencing datasets with an array of algorithms and data visualization methods. The pipeline includes quality control, library size analysis, normalization, plotting most highly expressed genes, plotting samples, MAGIC normalization, clustering, enrichment analysis, trajectory inference, and cell type prediction. Link to scRNA-seq Analysis Appyter

Page updated

Google Sites

Report abuse