***************** ORGANOID Data Set ***************** About the Pancreas Cancer Organoid Profiling Program --------------- The Pancreas Cancer Organoid Profiling Program is from the `Organoid Profiling Identifies Common Responders to Chemotherapy in Pancreatic Cancer `_ study and contains data generated from a collection of patient-derived pancreatic normal and cancer organoids. About the Pancreas Cancer Organoid Profiling Data Set -------------------- The data set consists of 70 cases and includes whole-genome, targeted exome, and RNA sequencing data on organoids as well as matched tumor and normal tissues. This data set is a valuable resource for pancreas cancer researchers, and those looking to compare primary tissue to organoid culture. The NCI GDC houses all the clinical, biospecimen, and molecular characterization data with over 130 VCF, 298 BAM, 165 TXT, and 110 TSV files in around 21.89 TB of data. The Project ID in the GDC Data Portal is `ORGANOID-PANCREATIC `_. For more information on the ORGANOID data, please refer to these sites: - `dbGaP site `_ - `GDC Data Portal `_ Accessing Pancreas Cancer Organoid Profiling Data on the Cloud -------------------------------- Besides accessing the files on the GDC Data Portal, you can also access them from the GDC Google Cloud Storage Bucket, which means that you don’t need to download them to perform analysis. ISB-CGC stores the cloud file locations in tables in the ``isb-cgc-bq.GDC_case_file_metadata`` data set in BigQuery. - To access these metadata files, go to the Google BigQuery console. - Perform SQL queries to find the ORGANOID files. Here is an example: .. code-block:: sql SELECT active.*, file_gdc_url FROM `isb-cgc-bq.GDC_case_file_metadata.fileData_active_current` as active, `isb-cgc-bq.GDC_case_file_metadata.GDCfileID_to_GCSurl_current` as GCSurl WHERE program_name = 'ORGANOID' AND active.file_gdc_id = GCSurl.file_gdc_id Accessing the ORGANOID Data in Google BigQuery ------------------------------------------------ ISB-CGC has ORGANOID data, such as clinical and RNA-seq, stored in Google BigQuery tables. Information about these tables can be found using the `ISB-CGC BigQuery Table Search `_ with ORGANOID selected for filter PROGRAM. To learn more about this tool, see the `ISB-CGC BigQuery Table Search documentation <../BigQueryTableSearchUI.html>`_. The ORGANOID tables are in project isb-cgc-bq. To learn more about how to view and query tables in the Google BigQuery console, see the `ISB-CGC BigQuery Tables documentation <../BigQuery.html>`_. - Data set ``isb-cgc-bq.ORGANOID`` contains the latest tables for each data type. - Data set ``isb-cgc-bq.ORGANOID_versioned`` contains previously released tables, as well as the most current table.