Cell Ranger

Download Presentation: General introduction

Download Presentation: Introduction to scRNA-seq

Download Presentation: Introduction to Cell Ranger

Learning outcomes

After having completed this chapter you will be able to:

  • Explain what kind of information single-cell RNA-seq (scRNA-seq) can give you to answer a biological question
  • Describe essential considerations during the design of a single-cell RNA-seq experiment
  • Describe the pros and cons of different single-cell sequencing methods
  • Use cellranger to:
    • To align reads and generate count tables
    • Perform basic QC on alignments and counts

Running cellranger count

The exercises below assume that you are enrolled in the course, and have access to the server. These exercises are not essential to run for the rest of the course, so you can skip them. If you want to do them anyway, you can try to install cellranger locally (only on Linux or WSL). In addition, you will need to download the references. You can get it by with this code (choose your OS):

Cloud server/Linux/MacOS/WSL

Run the following commands in the terminal or other command line prompt application:

    wget https://single-cell-transcriptomics.s3.eu-central-1.amazonaws.com/cellranger_index.tar.gz
    tar -xvf cellranger_index.tar.gz
    rm cellranger_index.tar.gz

Windows (only relevant if working locally)

If you are working on the cloud server, follow the instructions above. Download using the link, and unpack in your working directory. This will download and extract the index in the current directory. Specify the path to this reference in the exercises accordingly.

Data overview

Have a look in the directory course_data/reads and cellranger_index. In the reads directory you will find reads on one sample: ETV6-RUNX1_1. In the analysis part of the course we will work with six samples, but due to time and computational limitations we will run cellranger count on one of the samples, and only reads originating from chromsome 21 and 22.

The input you need to run cellranger count are the sequence reads and a reference. Here, we have prepared a reference only with chromosome 21 and 22, but in ‘real life’ you would of course get the full reference genome of your species. The reference has a specific format. You can download precomputed human and mouse references from the 10X website. If your species of interest is not one of those, you will have to generate it yourself. For that, have a look here.

To be able to run cellranger in the compute environment, first run:

export PATH=/data/cellranger-8.0.1:$PATH

Have a look at the documentation of cellranger count (scroll down to Command-line argument reference).

You can find the input files here:

  • reads: course_data/reads/ (from the downloaded tar package in your home directory)
  • pre-indexed reference: cellranger_index

Exercise 1: Fill out the missing arguments (at FIXME) in the script below, and run it:

cellranger count \
--id=FIXME \
--sample=FIXME \
--transcriptome=FIXME \
--fastqs=FIXME \
--localcores=4 \
--create-bam=true

This will take a while…

Once started, the process will need approximately 15 minutes to finish. Have a coffee and/or have a look at the other exercises.