Download Resources
Gene-SCOUT aims to find similar genes to a particular gene of interest where for each gene a unique signature is constructed. The method exploits associations derived from 450,000 exomes sequenced in the UK Biobank. For a given gene, its signature comprises a collection of association between variants of the gene and phenotypic traits measured in the UK Biobank. The gene signature is provided by the vector of associations with quantitative traits in the UK Biobank database. More specifically, given a vector of associations and accompanying \(p\)-values, the signature is the vector of associations that are significant (to \(10^{-5}\)).
Attached are the vectors of significant traits, from which the similarities can be derived.
Similarities are computed using the genetic signatures, where for a given seed gene, an alternative gene must share at least one significant association with the seed gene. The similarity is then computed using the cosine similarity (see the About page for more details), where assocations that are not significant in the alternative gene are imputed with zero.
The following provides the matrix of similarities, with missing values indicating that no similarity could be calculated between these two genes. The seed gene is given by the header of each column.