Question

RNA_Seq Data Analyse

0

Entering edit mode

5.7 years ago

Diango • 0

hello everybady , I'm new in rnaseq data analyse and I need to do the transcriptomic profiling between two biological conditions (wildtype and mutant) follow this plan : QC ---> indexation and mapping with STAR ----> quantication with featurecount ----> and now I would like to analyse the the featurecount.txt output in R using edgeR.

EdgeR dataset : in each featurecount output.txt I use this linux commande to :

1. cut -f1 featurecount.txt > gene_id.txt #extraire the first column called gene_id
2. cut -f7 featurecount.txt > file.txt # for each sample
3. paste gene_id file.txt(for each sample) > count.txt

I export the count.txt in using edgeR package following this script:

library(edgeR)
## Loading required package: limma
library(limma)
library(Glimma)
library(gplots)
## Attaching package: 'gplots'

library(dplyr)
#library(org.Mm.eg.db
#library(RColorBrewer
options(width = 100)

setwd("C:/Users/DIANGO/Desktop/GNF_Matrix")
wt1 <- read.delim("./GWT_vs_B6.txt", stringsAsFactors = FALSE, comment.char = "#")
dim(wt1)

## Create a new data object that contains just the counts.

countdata <- wt1[,7, drop = FALSE]
head(countdata)
dim(countdata)

## Add rownames i.e. GeneIDs to data
rownames(countdata) <- wt1[,1]
head(countdata)


# Taking a look at column names to know the sample names
colnames(countdata)

## [1] "X.data.cephfs.punim0010.projects.Kanwal_RNASeq_Testing.seqc.test.rna.seq.work.kallisto.RNA.Test.kallisto.pseudoalignment.pseudoalignments.sorted.bam"

#Renames sample name to a meaningful title
colnames(countdata) <- "WILD1"
head(countdata)

if there's anyone who can help me with this analysis or suggest other methods.

RNA-Seq R gene • 1.5k views

ADD COMMENT • link updated 5.7 years ago by brianj.park ▴ 70 • written 5.7 years ago by Diango • 0

0

Entering edit mode

Please edit your question to re-format the code. Essentially, highlight your code chunks and then click on the 101 010 button. Thanks.

ADD REPLY • link 5.7 years ago by Kevin Blighe 89k

0

Entering edit mode

My questions is juste focus to how analysis featurecount output file with edgeR or DESeq2 R package. thk @kevin for your clarification

ADD REPLY • link 5.7 years ago by Diango • 0

0

Entering edit mode

What is unclear after reading the manual from edgeR? It involves all code for a standard analysis.

ADD REPLY • link 5.7 years ago by ATpoint 89k

score 1 · Answer 1 · 2020-02-23

1

Entering edit mode

5.7 years ago

brianj.park ▴ 70

After you imported your count table into R, you have to create a DGEList object that contains your counts, gene IDs, and sample group info needed for differential expression. Follow this vigenette and it should be straightforward.