About me

My Ph.D. research has focused on developing computational frameworks to explore tumor-immune interactions and to model immunotherapy resistance and response. For example, which gene could synergize with immunotherapy? How predictive is a biomarker for immunotherapy? Will a patient benefit from immunotherapy? To answer these questions, I build computational frameworks to analyze the massive scale of public datasets. Besides, I am also interested in data visualization. This drives me to string plotting functions frequently used in my research and to build visualization packages. Selected software are listed in the following contents.

Web Apps

TIDE: large-scale public data reuse to model immunotherapy response and resistance

View Website doi: 10.1186/s13073-020-0721-z

We integrated large-scale omics data and biomarkers on published ICB trials, non-immunotherapy tumor profiles, and CRISPR screens on a web platform TIDE. We processed the omics data for over 33K samples in 188 tumor cohorts from public databases, 998 tumors from 12 ICB clinical studies, and eight CRISPR screens that identified gene modulators of the anticancer immune response. Integrating these data on the TIDE web platform with three interactive analysis modules, we demonstrate the utility of public data reuse in hypothesis generation, biomarker optimization, and patient stratification.


TIMER2.0: Analysis of tumor-infiltrating immune cells

View Website doi: 10.1093/nar/gkaa407

TIMER2.0 provides robust estimation of immune infiltration levels for The Cancer Genome Atlas (TCGA) or user-provided tumor profiles using six state-of-the-art algorithms. TIMER2.0 provides four modules for investigating the associations between immune infiltrates and genetic or clinical features, and four modules for exploring cancer-related associations in the TCGA cohorts. Each module can generate a functional heatmap table, enabling the user to easily identify significant associations in multiple cancer types simultaneously. Overall, the TIMER2.0 web server provides comprehensive analysis and visualization functions of tumor infiltrating immune cells.


Data Visualization

Bioplots: The visualization package with statistics annotation.

PyPI PyPI - Format PyPI - License PyPI - Downloads

Bioplots is a Python data visualization package based on matplotlib. It provides graphics with statistical annotations.


Data Collection

TCGAdnloader: Downloading data from the TCGA on a managable data structure.

 Github

This package can download TCGA public available molecular profiles from both firebrowse and GDC Data portal. Data will be stored based on the molecular type.

rss facebook twitter github gitlab youtube mail spotify lastfm instagram linkedin google google-plus pinterest medium vimeo stackoverflow reddit quora quora