Question: Normalising RNA-seq samples from bam files for UCSC Genome browser visualization
1
gravatar for m93
12 months ago by
m93170
m93170 wrote:

I have just over 100 files from an RNA-seq experiment and I have working on converting these BAM files to Bigwig files for visualization on UCSC Genome Browser.

I am new to the field of RNA-seq but from what I understand, I need to scale/normalise the BAM files before conversion to bigwig format. There appear to be all sorts of ways and different methods to normalise BAM files. I have found out that bedtools can allow you to scale samples (using the -scale option) using a specific scale factor. I am tempted to use this option as I was already using bedtools in my script doing the converting of BAM files to Bigwig files.

I guess my question is: which method do I use to normalise my samples? How do I decide which scaling factor to use in my bedtools command?

Thanks.

ucsc rna-seq bigwig bam • 627 views
ADD COMMENTlink modified 12 months ago by Ian5.5k • written 12 months ago by m93170
4
gravatar for ATpoint
12 months ago by
ATpoint21k
Germany
ATpoint21k wrote:

The most convenient solution is IMHO deeptools bamCoverage. It offers multiple options for normalization, binning and strand-specificity. Have a look at the docs.

ADD COMMENTlink modified 12 months ago • written 12 months ago by ATpoint21k
1
gravatar for trausch
12 months ago by
trausch1.4k
Germany
trausch1.4k wrote:

Alfred (disclaimer: my own tool) can create UCSC browser tracks for paired-end RNA-Seq data

alfred tracks -o ucsc.bedGraph.gz input.rna.bam

The resolution parameter (-r) determines the file size (how aggressively coverage values are binned). By default, the method normalizes to ~30 million pairs.

ADD COMMENTlink written 12 months ago by trausch1.4k
1
gravatar for Ian
12 months ago by
Ian5.5k
University of Manchester, UK
Ian5.5k wrote:

I have tackled this on a small scale by retaining SAM reads used by htseq-count (--samout), and removing those excluded from the final counts, using sed '/XF:Z:$/d;/XF:Z:__/d'.

After normalisation by DESeq2 I use the scaleFactor to scale the SAM > BAM files, using bedtools genomecov (-scale).

Obviously you have the problem of running DESeq2 with many samples, and R isn't very forgiving. However, I found a thread which may be of use to you: https://support.bioconductor.org/p/73399/

ADD COMMENTlink written 12 months ago by Ian5.5k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1073 users visited in the last hour