Hi everyone!
I am new in bioinformatics field and my problem is that I have a RNA-seq data, which includes transcript IDs and gene IDs (from enasmbl database) and I wanted to check transcript lengths for every transcript. I was wondering if it is possible using any Python/R tools and if so could anyone explain rather simply how to perform it? Thank you for any help!
PS. I want to do thtat, because I have FPKM values, which I want to convert back to raw counts in order to perform differential expression analysis using DESeq2.
I am also working with ensmble transcripts and I need length for every transcripts. Is there any downloadable file which contains all ensemble transcript length?
I don't think there is a file readily available to download. You can either follow the BioMart instructions above or use
seqkit(LINK) to get that information from Ensembl cDNA file found here (for human as example).complete fasta names
OR
just the accession numbers
You will get the lengths in column 2