CGCI Publication Guidelines
Like other National Cancer Institute large-scale genomics initiatives, the Cancer Genome Characterization Initiative (CGCI) is a community resource project. After validation, data are rapidly made available for use by other researchers. To act in accord with the Fort Lauderdale principles and support the continued prompt public release of large-scale genomic data prior to publication, researchers preparing manuscripts that (1) contain descriptions of CGCI data and/or (2) are of comparable scope to an initial CGCI disease-specific comprehensive global analysis publication, are required to coordinate their independent reports with CGCI's publication schedule. Journal editors who receive such manuscripts are strongly encouraged to contact the Office of Cancer Genomic (OCG; firstname.lastname@example.org) to ensure that their publication timeline is in line with CGCI.
Comparable scope is defined as global genome-wide analysis of CGCI disease-specific data or data across diseases from one or more platforms (for disease types that encompass CGCI subprojects, see table below). As described below, publications of comparable scope are restricted prior to the release of first disease-specific manuscript.
As a general reminder, all use and publications based on CGCI data are required to follow the Data Use Limitations outlined in the CGCI Data Use Certification at the database of Genotypes and Phenotypes (dbGaP). As stated in the Data Use Limitations, Medulloblastoma data are for pediatric research use only.
Use of CGCI Data in Manuscripts Prior to Publication of Initial CGCI Disease-Specific Integrated Analysis
Similar to standard practices of other large-scale genomic projects, the National Cancer Institute does not consider the deposition of CGCI data into its own data portal or public databases as the equivalent of publication in a peer-reviewed journal. Therefore, although the data are available to others, the data producers consider these data as unpublished and expect that the data will be used in accord with standard scientific etiquette and practices concerning unpublished data.
The first paper authored by each CGCI subproject team (https://ocg.cancer.gov/programs/cgci/projects) includes the data and analysis from a majority (~90%) of qualified CGCI cases. Specifically, these manuscripts report on the comprehensive, integrated analysis of multiple CGCI subproject datasets, including, but not limited to:
- Copy number variation, gene and miRNA expression, structural variants and DNA sequence/mutation analysis or a combination thereof; or
- An analysis of data from a single platform across more than one tumor type, where not all the subproject teams have published their first comprehensive analyses papers for the tumor types being analyzed
Investigators may only publish a manuscript before the CGCI subproject team has published the global analysis on that tumor type IF the publication uses a very limited dataset (less than 5 genes). Investigators should submit their abstract and request to OCG by emailing email@example.com.
Prior to the publication of comprehensive analysis on a specific tumor type, available datasets are subject to the same standard principles of scientific etiquette that apply to use of unpublished findings from other sources. Below is the list of tumor datasets that are available for use now. The list will be updated when the first global manuscript for each tumor is accepted for publication. For questions, do not hesitate to contact firstname.lastname@example.org.
|Tumor Type/ Data Use Limitation||Subproject Status||Data Availability Status|
|Burkitt lymphoma/ general research||Pediatric cases are completed. Adult cases are in progress.||Pediatric data are available at NCI's Genomic Data Commons and OCG Data Coordinating Center (DCC). EBV sequences are available as BAM alignments from the public directory at the DCC.|
|Medulloblastoma/ pediatric data||Completed; analyzed data are available||Controlled access copy number data at OCG Data Coordinating Center (DCC).|
|Non-Hodgkin lymphoma/ general research||Completed; all raw and analyzed data are available||Open and controlled access to mRNA-seq data and whole genome sequencing data through NCBI's Sequence Read Archive and OCG DCC.|
|HIV+ associated cervical cancer/ general research||In progress||Data not available.|
|HIV+ associated lung cancer/ general research||In progress||Data not available.|
|HIV+ associated diffuse large B cell lymphoma/ general research||In progress||Data not available.|
Use of CGCI Data in Manuscripts After Each Initial CGCI Disease-Specific Global Analysis Publication
There are no restrictions on the use of CGCI data in manuscripts after the initial CGCI global analysis publication if the data are used specifically within the Data Use Limitations. Medulloblastoma data are for pediatric research use only.
If data are used within the Data Use Limitations, there are no restrictions on the use of CGCI data for legitimate research purposes not involving publication or public presentation. For example, researchers may use CGCI data in research grant applications at any time with appropriate acknowledgement (see below), regardless of whether the initial CGCI disease-specific global analysis has been published.
OCG requests that authors who use data from CGCI acknowledge the appropriate subproject team(s) in their work by referencing the CGCI dataset in accordance with the CGCI Data Use Certification at dbGaP. Inclusion of CGCI subproject team members as authors is not required.
If investigators desire to collaborate with CGCI members, they should contact the specific subproject team. Authors are also encouraged to recognize the contribution of the appropriate specimen donors and research groups via the acknowledgements section in their publication. Similarly, OCG requests that journal editors and reviewers attempt to ascertain if CGCI is cited and if appropriate acknowledgements are made.
For questions, please contact the CGCI Publications Committee through OCG by emailing email@example.com.