Sample Processing

UMIs and deduplication

Library preparation

When included in your sample prep, the UMI will reside in the R2 reads and will be handled by the Toolkit. If you only did single end (SE) sequencing you will not have a UMI and will not be able to deduplicate reads.

How does it work?

The deduplication occurs by the barcode parser first identifying the UMI based on the position in the read and labeling it by using the XU tag for downstream processing. After reads are aligned to the reference genome, the deduplication works by finding a sequence and UMI and building a graph with corrections for potential sequencing errors in the UMI, and collapses the graph to remove those reads determined to be duplicates.

BAM tags

The process of UMI tagging is done at the BAM file level in the process UMI tagging as part of the deduplication. Where the UMI is added to this tag. There are other tags that can be used and have a full list from the SAM specification.

What if I see an error that says invalid UMI?

This can occur when someone runs SEQuoia Complete data sets in the SEQuoia Express Toolkit, or if you forgot to add the UMI in your sample prep.

Trimming

There are default trimming quality cutoffs. Our defaults are suggestions and can be modified by the user to suit their needs. After running the Toolkit for the first time, you will have the FASTQC output that will show the quality of the reads and allow for more informative trimming if low quality reads are present.

Filtering of read counts

The SEQuoia Express toolkit has an option to allow users to filter the reads based on a threshold. the option to do this is two parts:

minGeneType = "none" : this can be ["none","reads","RPKM","TPM"]
minGeneCutoff = 0 : threshold you want to use The results of this filtering are not in the report folder, instead they are put in output/SampleFiles/sample_name/RNACounts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Sample Processing

UMIs and deduplication

Library preparation

How does it work?

BAM tags

What if I see an error that says invalid UMI?

Trimming

Filtering of read counts

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Clone this wiki locally