OCG-Supported Resources

As OCG research programs evolve over time, they generate resources for scientific and educational communities. Learn about these resources.

Cancer Genome Anatomy Project (CGAP)

CGAP generated a wide range of genomics data on cancerous cells that are accessible through easy-to-use online tools. Researchers, educators, and students can find "in silico" answers to biological questions through the CGAP website. Request a free copy of the CGAP Website Virtual Tour CD from ocg@mail.nih.gov to learn how to navigate the website.


Funded in large part by the Initiative for Chemical Genetics (ICG), Chembank is an interactive database for small molecules. It contains data from hundreds of biomedically relevant small molecule screens that involved hundreds-of-thousands of compounds. Chembank also provides analysis tools to facilitate data mining.

Clustered Regularly Interspaced Short Palindromic Repeats (CRISPRi) plasmids

CTD2 researchers at the University of California in San Francisco developed a modified Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR) CRISPR/dCas9 system. Catalytically inactive dCas9 enables modular and programmable RNA-guided genome regulation in eukaryotes. The CRISPR/dCas9 system has several advantages:  i) enables robust gene repression (CRISPRi) or activation (CRISPRa) in human cells, ii) allows specific knockdown with minimal off-target effects in human cells, iii) works efficiently in human and yeast cells, and iv) does not cause  double-strand breaks. Plasmid design and construction for CRISPRi (human and yeast cells) are described in the manuscript listed below and are available through a distributor.


Gilbert LA, Larson MH, Morsut L, Liu Z, Brar GA, Torres SE, Stern-Ginossar N, Brandman O, Whitehead EH, Doudna JA, Lim WA, Weissman JS, Qi LS (2013). CRISPR-Mediated Modular RNA-Guided Regulation of Transcription in Eukaryotes. Cell 154(2):422-51. PMID: 23849981

Gabriella Miller Kids First (GMKF/Kids First) Pediatric Research Program

The Gabriella Miller Kids First initiative is a trans-NIH effort to increase understanding of genetic changes associated with certain devastating pediatric conditions. The initiative will develop a centralized database of well-curated clinical and genetic sequence data from childhood cancer and structural birth defects cohorts comprising thousands of patients and their families.
To learn more about the initiative and the data available, please visit

Genome-wide Association Studies from the Cancer Genetic Markers of Susceptibility (CGEMS) Initiative

CGEMS identifies common inherited genetic variations associated with a number of cancers, including breast and prostate. Data from these genome-wide association studies (GWAS) are available through the Division of Cancer Epidemiology & Genetics website.

Resources Image

Human cDNA Library from the ORFeome Collaboration (OC)

The goal of the OC, an informal volunteer multi-institutional collaboration, is to provide the research community with validated, full open reading frame (ORF) cDNA clones for all of the currently defined human genes.  The ORF clones do not include 5’ and 3’ UTRs and can be easily sub-cloned into any type of expression vector. These clones are available to researchers worldwide through multiple distributors.

Mammalian cDNA Library from the NIH Mammalian Gene Collection (MGC)

The MGC provides the research community full-length clones for most of the defined (as of 2006) human and mouse genes, along with selected clones of cow and rat genes. Clones were designed to allow easy transfer of the ORF sequences into nearly any type of expression vector. MGC provides protein ‘expression-ready’ clones for each of the included human genes. MGC is part of the ORFeome Collaboration (OC).

Protein-Protein Interaction Reagents

The CTD2 Center at Emory University has a library of genes used to study protein-protein interactions in mammalian cells. These genes are cloned in different mammalian expression vectors. A list of available cancer-associated genes can be accessed below.


Contact: Haian Fu 

The Cancer Genome Atlas (TCGA) Data Portal

The Cancer Genome Atlas Data Portal contains clinical information, genomic characterization data, and high-throughput sequencing analysis of over twenty different cancers. Search, download, and analyze datasets generated by TCGA.