Hordeum vulgare transcripts addnotation
1
0
Entering edit mode
19 months ago
martta95 ▴ 10

Hello,

I have list of transcript mapped to Hordeum vulgare genome, e.g.,

HORVU.MOREX.r2.UnG0628590.1
HORVU.MOREX.r2.5HG0361000.1
HORVU.MOREX.r2.UnG0628880.1

I would like to add annotation about gene function and gene name, What tools I can use?

gene-annotation hordeum-vulgare • 674 views
ADD COMMENT
0
Entering edit mode

Please don't post same content in multiple threads. Prior thread for reference : Hordeum genes addnotation

ADD REPLY
0
Entering edit mode

I'm deleting the other thread as your answer here is more comprehensive than your comment there. But yes, martta95 - DO NOT open multiple threads for the same question. Next time, your duplicate posts will be deleted without warning.

ADD REPLY
0
Entering edit mode
19 months ago
GenoMax 146k

Looks like your ID's are from a previous genome assembly.

A. Get the annotation file from Ensembl

wget https://ftp.ensemblgenomes.ebi.ac.uk/pub/plants/release-56/gff3/hordeum_vulgare_goldenpromise/Hordeum_vulgare_goldenpromise.GPv1.56.gff3.gz
gunzip Hordeum_vulgare_goldenpromise.GPv1.56.gff3.gz

B. Extract column 9 that contains information you want for only the transcript lines (mRNA)

awk -F "\t" '$3 ~ "mRNA" {print $3,$9}' Hordeum_vulgare_goldenpromise.GPv1.56.gff3 > Hordeum_col9 

C. Loop through your file if ID's (as shown above) to extract the relevant lines for transcript

for i in `cat id`; do grep -w "${i}" Hordeum_col9 ;done
mRNA ID=transcript:HORVU.MOREX.r2.UnG0628590.1.mrna1;Parent=gene:HORVU.MOREX.r2.UnG0628590.1;biotype=protein_coding;tag=Ensembl_canonical;transcript_id=HORVU.MOREX.r2.UnG0628590.1.mrna1
mRNA ID=transcript:HORVU.MOREX.r2.5HG0361000.1.mrna1;Parent=gene:HORVU.MOREX.r2.5HG0361000.1;biotype=protein_coding;tag=Ensembl_canonical;transcript_id=HORVU.MOREX.r2.5HG0361000.1.mrna1
mRNA ID=transcript:HORVU.MOREX.r2.UnG0628880.1.mrna1;Parent=gene:HORVU.MOREX.r2.UnG0628880.1;biotype=protein_coding;tag=Ensembl_canonical;transcript_id=HORVU.MOREX.r2.UnG0628880.1.mrna1
mRNA ID=transcript:HORVU.MOREX.r2.UnG0634500.1.mrna1;Parent=gene:HORVU.MOREX.r2.UnG0634500.1;biotype=protein_coding;tag=Ensembl_canonical;transcript_id=HORVU.MOREX.r2.UnG0634500.1.mrna1

Since this does not get you much in terms of annotation you may want to try using BioMart at Gramene site: https://ensembl.gramene.org/biomart/martview

ADD COMMENT

Login before adding your answer.

Traffic: 1040 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6