TransCDR: a deep learning model for enhancing the generalizability of cancer drug response prediction through transfer learning and multimodal data fusion for drug representation

AI-generated keywords: Precision Medicine

AI-generated Key Points

The license of the paper does not allow us to build upon its content and the key points are generated using the paper metadata rather than the full article.

  • Precision medicine relies on accurate drug response prediction for personalized treatment strategies
  • Challenges in predicting cancer drug responses include limited data modalities, suboptimal fusion algorithms, and poor generalizability to novel drugs or cell lines
  • TransCDR is a novel approach that uses transfer learning and self-attention mechanism to predict drug responses
  • TransCDR excels in evaluating generalization of CDR prediction models to new compound scaffolds and cell line clusters
  • Key factors influencing drug response prediction are Extended Connectivity Fingerprint and genetic mutations
  • TransCDR outperforms state-of-the-art models and shows strong predictive capabilities on external testing sets like CCLE
  • The model can be used to investigate biological mechanisms underlying drug response through Gene Set Enrichment Analysis
  • Availability of source code and data on GitHub allows for further exploration and application of TransCDR
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Xiaoqiong Xia, Chaoyu Zhu, Yuqi Shan, Fan Zhong, Lei Liu

arXiv: 2311.12040v1 - DOI (q-bio.QM)
8 figures

Abstract: Accurate and robust drug response prediction is of utmost importance in precision medicine. Although many models have been developed to utilize the representations of drugs and cancer cell lines for predicting cancer drug responses (CDR), their performances can be improved by addressing issues such as insufficient data modality, suboptimal fusion algorithms, and poor generalizability for novel drugs or cell lines. We introduce TransCDR, which uses transfer learning to learn drug representations and fuses multi-modality features of drugs and cell lines by a self-attention mechanism, to predict the IC50 values or sensitive states of drugs on cell lines. We are the first to systematically evaluate the generalization of the CDR prediction model to novel (i.e., never-before-seen) compound scaffolds and cell line clusters. TransCDR shows better generalizability than 8 state-of-the-art models. TransCDR outperforms its 5 variants that train drug encoders (i.e., RNN and AttentiveFP) from scratch under various scenarios. The most critical contributors among multiple drug notations and omics profiles are Extended Connectivity Fingerprint and genetic mutation. Additionally, the attention-based fusion module further enhances the predictive performance of TransCDR. TransCDR, trained on the GDSC dataset, demonstrates strong predictive performance on the external testing set CCLE. It is also utilized to predict missing CDRs on GDSC. Moreover, we investigate the biological mechanisms underlying drug response by classifying 7,675 patients from TCGA into drug-sensitive or drug-resistant groups, followed by a Gene Set Enrichment Analysis. TransCDR emerges as a potent tool with significant potential in drug response prediction. The source code and data can be accessed at https://github.com/XiaoqiongXia/TransCDR.

Submitted to arXiv on 17 Nov. 2023

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

The license of the paper does not allow us to build upon its content and the AI assistant only knows about the paper metadata rather than the full article.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2311.12040v1

This paper's license doesn't allow us to build upon its content and the summarizing process is here made with the paper's metadata rather than the article.

, , , , In the field of precision medicine, accurate and robust drug response prediction is crucial for personalized treatment strategies. Various models have been developed to predict cancer drug responses (CDR) by leveraging drug and cancer cell line representations. However, challenges such as limited data modalities, suboptimal fusion algorithms, and poor generalizability to novel drugs or cell lines still exist. To address these issues, a novel approach called TransCDR has been introduced. TransCDR utilizes transfer learning to acquire drug representations and integrates multi-modality features of drugs and cell lines through a self-attention mechanism to predict IC50 values or sensitive states of drugs on cell lines. One key aspect that sets TransCDR apart is its ability to systematically evaluate the generalization of CDR prediction models to never-before-seen compound scaffolds and cell line clusters. In comparative evaluations against 8 state-of-the-art models, TransCDR demonstrates superior generalizability. Furthermore, it outperforms five variants that train drug encoders from scratch (such as RNN and AttentiveFP) across different scenarios. The most influential factors identified among multiple drug notations and omics profiles are Extended Connectivity Fingerprint and genetic mutations. The incorporation of an attention-based fusion module further enhances the predictive performance of TransCDR. Trained on the GDSC dataset, TransCDR exhibits strong predictive capabilities on external testing sets like CCLE and is also effective in predicting missing CDRs within GDSC. Additionally, the model is employed to investigate the biological mechanisms underlying drug response by categorizing 7,675 patients from TCGA into drug-sensitive or drug-resistant groups followed by Gene Set Enrichment Analysis. Overall, TransCDR emerges as a powerful tool with significant potential in advancing drug response prediction in precision medicine. The availability of source code and data on GitHub allows for further exploration and application of this innovative approach in the field.
Created on 24 Mar. 2026

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.