This webserver was designed as a resource for developing novel immunotherapeutics that harness multi-antigen signatures to improve cancer recognition and limit normal tissue cross-reactions. The expression data shown here cover 33 tumor types and 34 normal human tissues utilizing 21,486 samples taken from The Cancer Genome Atlas and the Genotype-Tissue Expression (GTEx) Project v7. We examine 2,358 protein-coding genes annotated to be expressed on the cell surface or that are targets in current clinical immunotherapy trials. The antigen types are referred to as novel (N) or clinical (C) antigens respectively.
While transcriptomics data does not directly reflect protein abundance, we use the expression levels of genes encoding surface proteins as an initial prediction of therapeutic potential. When gene combinations can separate normal tissue samples from cancer samples well in expression space it is likely that they are good candidates for further clinical investigation. We quantified the separation potential of all antigen combinations using a distance based heuristic inspired by clustering evaluations as well as decision tree classifiers. Together top scoring combinations (combined scores of 1) are the best at normal tissue vs tumor descrimination.
Because the space of potential combinations is vast (to analyze any two antigens is it 2.5 million pairs!), we reduce our predictions shown in the webserver to the top 100 per tumor type per antigen type (novel / clinical). We also limit the number of times a single gene can appear in a combination, keeping only its top 2 highest scoring pairings. Any of the 2,358 genes can be searched and the expression of the combination plotted with an interactive scatterplot.
Ideal antigens must separate tissue samples from tumor samples and have large differences in gene expression (very distant). To assess both of these qualities we use several metrics which are listed in our predictions both on the browse page and in single gene searches. Here are brief descriptions of the scores we have calculated. More rigorous definitions can be found in our paper.
Further details on this work can be found in our paper published in Cell Systems. Please cite this work if you use our results in a publication:
Discriminatory power of combinatorial antigen recognition in cancer T cell therapies. Dannenfelser R, Allen G, VanderSluis B, Koegel AK, Levinson S, Stark SR, Yao V, Tadych A, Troyanskaya OG, Lim WA. (2020) Cell Systems.
The antigen explorer webserver was created in the Troyanskaya Laboratory for Bioinformatics and Functional Genomics in the Lewis-Sigler Institute for Integrative Genomics at Princeton University in conjunction with the Lim Laboratory at UCSF. Use is protected under an academic and commercial license. If interested in licensing antigen combinations for use in therapeutic developement, please contact Todd Pazdera at UCSF.