How create annotation file for WGCNA for RNA-Seq data?
0
0
Entering edit mode
3.2 years ago

Hello Biostars, I am currently working on WGCNA analysis for RNA-Seq data, the input gene count matrix files contain Ensemble gene ids and MSTRG ids. Now I want to create the annotation files. I have stringTie merged files, can I use it as annotation file?

If possible means that file contains many repetitions gene symbols and Ensembl ids. My query is, How to extract ensemble gene ids and corresponding gene names from the StringTie file into separate CSV files.

annotation file format

S.No  Ensembl id     Gene name
RNA-Seq WGCNA Annotaion file • 999 views
ADD COMMENT
1
Entering edit mode

Which StringTie files have you? - please elaborate. Also, obviously, a pre-requisite to using WGCNA is to have already completed the WGCNA tutorials.

ADD REPLY
0
Entering edit mode

Thank you very much for your response I used to follow this protocol https://www.nature.com/articles/nprot.2016.095 at one step I have to make stringTie merge to combine all the samples with the reference annotation file of h38 downloaded from Ensembl ftp.

$ stringtie --merge -p 8 -G chrX_data/genes/chrX.gtf -o stringtie_merged.gtf chrX_data/mergelist.txt

Please refer this paper: Transcript-level expression analysis of RNA-seq experiments with HISAT, StringTie and Ballgown

ADD REPLY

Login before adding your answer.

Traffic: 2822 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6