FAQ

How do I use the R package?

The R package is used to download the data from the API and perform the colocalization and rare variant analysis.

How do I upload my own data?

You can upload your own GWAS summary statistics to run colocalization and rare variant analysis against the GPMap database. You can optionally specify one or more existing GWAS upload GUIDs to also compare your upload against those (in addition to the main database).

Is my uploaded data available to others?

Your uploaded data is not automatically added to the official GPMap database that others can view or search for. However, as we do not require a login to use the website, we cannot prevent you from sharing the URL of your uploaded data with others. Anyone who knows the GUID of your uploaded data can access it. The results are not discoverable by other users, but are available to anyone who knows the GUID.

How does the GWAS upload pipeline work?

GWAS Upload Pipeline

The GWAS upload pipeline is a process that allows you to upload a GWAS and perform the colocalization and rare variant analysis. It uses the same data processing pipeline used to created this resource, with some caveats. There are a series of steps that are taken to process the data, some of which will remove the data and make it look potentially inconsistent.

How are common and rare variant results linked?

To integrate common and rare variant datasets, we explicitly linked 1,485 studies that utilized identical UK Biobank data fields, while the remaining 14,512 studies were treated as independent phenotypic entries to maintain a conservative approach toward cross-study phenotypic equivalence.

We acknowledge that a significantly larger number of studies likely share overlapping biology or phenotypic definitions. However, to maintain a conservative strategy, we restricted explicit cross-category linking to instances where the phenotypic definitions were identical (e.g., matching UK Biobank data fields). For the remaining studies, we treat them as independent entries in the resource to avoid making assumptions about phenotypic equivalence across different cohorts or coding systems. We encourage the user to consider the potential for phenotypic equivalence of traits when investigating the results of their specfic trait of interest.

Choosing p-value thresholds

It is important to note that GPMap is intended to serve as a general-purpose research tool. Consequently, individual researchers should select p-value thresholds that are appropriate for their specific use case, ranging from conservative multi-testing corrections required for hypothesis-free discovery to more relaxed thresholds suitable for hypothesis-driven investigations or the validation of established signals.

How do I interpret this graph?

Graph options

Trait view

Displays colocalised results of the study in question, and shows all studies which colocalise with it, overlayed on top of the the Manhattan plot of the phenotype. Also displays significant rare and non-colocalising results. To compare 2 specific traits, please use the 'Filter Results By' dropdown.

Variant view

SNP view displays the colocalisation results for a single SNP. Each circle represents a trait that is in the colocalisation group.

Gene and region view

Displays the colocalisation results for a gene or region. Each circle represents a result that has a study marked with that gene. Results may not align with the exonic region of the gene, as some studies may have QTLs which are in the regulatory region of the gene.