User Defined Functions

BigQuery now supports User Defined Functions (UDFs) in SQL and JavaScript that extend BigQuery for more specialized computations and that can be reused in notebooks and queries. To facilitate the analysis of cancer data, ISB-CGC offers a set of UDFs that implement commonly used statistical tests and methods in cancer research and bioinformatics. The UDFs are located in the isb-cgc-bq.functions data set, and the source code of the functions and examples of how to use them can be found in our Community Notebook GitHub Repository. The following table lists all the functions available in ISB-CGC.

UDF (click for details)

Description

Notebooks

kmeans

K-means method for clustering data

Python

p_fisherexact

p value of the Fisher exact test

mannwhitneyu

Mann–Whitney U test

kruskal_walis

Kruskal Walis test

Python

significance_level_ttest2

Significance level of the two sided T test

Python

complement_chisquare_cdf

One minus the CDF of the Chi Square distribution

jstat_normal_cdf

CDF of the Normal distribution


Have feedback or corrections? Please email us at feedback@isb-cgc.org. Follow us on BlueSky and X!