Primary Analysis
A. Prepare SampleInformation.tab file
- Current SampleInformation.tab file is a tab delimited text file. This file could be edited in a plain text editor or a spreadsheet program. Update this file when a new sample is added to the project.
B. Prepare the input data files, output folders and the swarm job file
- Make sure you have the Results/cellranger folder - to store your results
- The Tools/prep.sh file is a bash script file that should be run first, to prepare files/folders and to generate the swarm job file.
- Open the prep.sh file in a plain text editor and edit the "samples" variable to update new sample names as it is entered in the SampleInformation.tab file.
- Open a shell/terminal, to change to the Tools directory, run the below command
cd /data/../TotalSeq/Tools/
- Run the prep.sh script using the below command
bash prep.sh
- Check the Results folder to make sure you have directories for new samples and that the directory has corresponding fastq files and library file
- Check the Tools folder to make sure you have the "totalseqall.swarm" file
C. Primary analysis - barcode/umi/cell counting using cellranger
- Change to the Tools directory using the below command;
cd /data/../TotalSeq/Tools/
- Run the swarm job using the below command;
swarm -f totalseqall.swarm -g 64 -t 28 --gres=lscratch:200 --partition=norm --module cellranger
Note
Note down your job id
- Check your job status using the "jobload" command or the squeue command;
squeue --job 14354599
- Extend the swarm job running time using the below command;
newwall --jobid 13558558 --time 10:00:00
D. Organize the outputs and cleanup
- Once the swarm job is complete (should take about three hours), change to the Tools directory using the below command;
cd /data/../TotalSeq/Tools/
- Open the "organizeout.sh" script in text editor and updated the variable "samples" with your new sample names
- Run the "organizeout.sh" script using the below command;
bash organizeout.sh
- Check out the Results folder to make sure you have the expected output files.
Example CellRanger Output location for Sample NS3R189BTS
/data/../TotalSeq/Results/cellranger/NS3R189BTS/cellranger_output/
E. Aggregate cellranger summary outputs
- Login to biowulf and request an interactive node using the below 'sinteractive' command, with 64 cores, 160gb RAM and 300gb of storage;
sinteractive --cpus-per-task=64 --mem=160g --gres=lscratch:300
- Once the interacive node is assigned, change to the Tools directory using the below command;
cd /data/../TotalSeq/Tools/
- Load the R module using the below command;
module load R
- Once the R module is loaded, run the 'cellrangerReport.R' script using the below command;
Rscript cellrangerReport.R
- The cellranger report is now available in the "Reports" folder. A link to this report is also given here
Software
- cellranger 6.0.0
- R 4.0.5
- mkdocs 1.1.2