Analyzing Data

The MOST IMPORTANT thing you must look out for on the Accession Display page is the blue "Analyze with GEO2R" button at the bottom of the page. You are only able to perform the data analysis described in this lesson if the blue button is there. Not every dataset has the blue button.

Now that we have our dataset (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE164805), the next step is to analyze the data.

Sample raw data from dataset GSE137140

Usually, the raw data is a huge file with potentially thousands of rows and columns. It's not easy to work with as a beginner and without using code. Luckily, the awesome "Analyze with GEO2R" button can do the analysis for us!

From the accession page, click on the blue Analyze with GEO2R button.

You should see the following page.

This page will allow you to split the samples into an experimental group and a control group (you have to tell the GEO2R program what to compare with what).

ALWAYS define the EXPERIMENTAL/DISEASE/MUTANT group FIRST!!! This is so the system can properly define what is positive expression and negative expression.

First, click on the Title column bar to automatically alphabetically sort your samples. It makes it easier to group the samples by type.

This video shows the steps to define sample groups. Sorry for the low resolution.

In order to select more than one sample at once, simply click on one sample, hold the shift button, and click on another sample. All the samples in between will get selected as well.

Your screen should look like this. In this example, we are trying to study the differences in gene expression between COVID-19 patients and normal controls.

After splitting your data into 2 groups (10 COVID-19 and 5 controls), click the blue Analyze button (you may need to scroll down the screen).

It may take a few minutes but eventually GEO2R will display the results from your analysis.

Go to the next lesson to learn how to interpret these results.