Question: How create annotation file for WGCNA for RNA-Seq data?
gravatar for mathavanbioinfo
4 months ago by
mathavanbioinfo50 wrote:

Hello Biostars, I am currently working on WGCNA analysis for RNA-Seq data, the input gene count matrix files contain Ensemble gene ids and MSTRG ids. Now I want to create the annotation files. I have stringTie merged files, can I use it as annotation file?

If possible means that file contains many repetitions gene symbols and Ensembl ids. My query is, How to extract ensemble gene ids and corresponding gene names from the StringTie file into separate CSV files.

annotation file format

S.No  Ensembl id     Gene name
annotaion file rna-seq wgcna • 176 views
ADD COMMENTlink modified 4 months ago by Kevin Blighe67k • written 4 months ago by mathavanbioinfo50

Which StringTie files have you? - please elaborate. Also, obviously, a pre-requisite to using WGCNA is to have already completed the WGCNA tutorials.

ADD REPLYlink written 4 months ago by Kevin Blighe67k

Thank you very much for your response I used to follow this protocol at one step I have to make stringTie merge to combine all the samples with the reference annotation file of h38 downloaded from Ensembl ftp.

$ stringtie --merge -p 8 -G chrX_data/genes/chrX.gtf -o stringtie_merged.gtf chrX_data/mergelist.txt

Please refer this paper: Transcript-level expression analysis of RNA-seq experiments with HISAT, StringTie and Ballgown

ADD REPLYlink written 4 months ago by mathavanbioinfo50
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1234 users visited in the last hour