Presenter notes contain extra information which might be useful if you intend to use these slides for teaching.
Press P
again to switch presenter notes off
Press C
to create a new window where the same presentation will be displayed.
This window is linked to the main window. Changing slides on one will cause the
slide to change on the other.
Useful when presenting.
Presenter notes contain extra information which might be useful if you intend to use these slides for teaching.
Press P
again to switch presenter notes off
Press C
to create a new window where the same presentation will be displayed.
This window is linked to the main window. Changing slides on one will cause the
slide to change on the other.
Useful when presenting.
Before diving into this slide deck, we recommend you to have a look at:
What is Gene Ontology (GO) enrichment analysis, and why should I perform it on my marker genes?
How can I use GO enrichment analysis to better understand the biological functions of the genes in my clusters?
Can GO enrichment analysis help me confirm that my clusters represent distinct cell types or states?
Here is a typical workflow for analyzing single-cell RNA sequencing data. We can break this process down into three main sections:
A standardized vocabulary for expressing knowledge within a specific domain.
.footnote [Adapted from Parlar ]
Gene Ontology has 3 main classifications (Biological process, Molecular function, and Cellular component) this allows scientists to precisely describe what a gene does, how it does it, and where it happens in the cell.
This simple example illustrates how Gene Ontology (GO) adds clarity and standardization when describing gene functions.
Now that we understand what GO mean, let’s explore what is GO Enrichment analysis in the context of scRNA-Seq data.
We start the analysis by selecting a list of differentially expressed genes (marker genes). Marker genes could be a list of differentially expressed genes between 2 different conditions or between different cell types.
Each gene can be "tagged" with one or more GO terms, similar to how books are categorized by genre or topic.
We then count the number of times each GO term shows up in your list of genes. For example, GO term A appears in 2 out of 3 genes
After transforming the long list of marker genes into a shorter list of biological themes in the form of GO terms, we can proceed with the interpretation of the results. This can be done by visualizing the most common themes to identify patterns or relationships between the GO terms. Additionally, we can analyze the GO hierarchy, where higher-level categories (parent terms) provide broader biological contexts, while lower-level categories (child terms) offer more specific insights. We can also relate the enriched GO terms to existing biological knowledge.
In a study by Zúñiga-León et al. (2018) they explored the differentially expressed platelet proteins in early-stage cancer in comparison to a control group.
The study revealed that out of 4,384 unique proteins expressed in platelets, 85 proteins showed significant differences in abundance in early-stage cancer patients compared to controls. This highlights the potential of platelets as biomarkers for early cancer detection.
The enrichment analysis identified 19 enriched biological processes, it also uncovered six enriched molecular functions.
Non-small cell lung cancer (NSCLC) is the most prevalent lung cancer. This study aims to identify gene biomarkers for NSCLC diagnosis and prognosis using single-cell RNA sequencing (scRNA-seq) data and bioinformatics techniques.
158 differentially expressed genes (DEGs) were identified, comprising 48 upregulated and 110 downregulated genes.
Gene Ontology enrichment was then conducted on the differentially expressed genes (Marker genes), statistical testing was performed on each GO term and only GO terms with P-value < 0.01 were selected.
GO enrichment analysis helps identify which biological processes, molecular functions, and cellular components are significantly represented in a set of genes.
The analysis employs statistical methods to determine whether the observed enrichment of GO terms is greater than what would be expected by chance.
This material is the result of a collaborative work. Thanks to the Galaxy Training Network and all the contributors!
Author(s) |
![]() ![]() |
Reviewers |
|
Tutorial Content is licensed under Creative Commons Attribution 4.0 International License.
Before diving into this slide deck, we recommend you to have a look at:
Keyboard shortcuts
↑, ←, Pg Up, k | Go to previous slide |
↓, →, Pg Dn, Space, j | Go to next slide |
Home | Go to first slide |
End | Go to last slide |
Number + Return | Go to specific slide |
b / m / f | Toggle blackout / mirrored / fullscreen mode |
c | Clone slideshow |
p | Toggle presenter mode |
t | Restart the presentation timer |
?, h | Toggle this help |
Esc | Back to slideshow |