How to identify gene list present in Whole genome sequencing data?
1
0
Entering edit mode
6.1 years ago
mail2steff ▴ 70

Dear All,

I am new to Whole genome sequencing analysis. I have two sample fastq files. I have run fastqc and bowtie2 using reference genome (papaya). I got the sam file. Now How to computationally identify the list of genes and their locations present in my file. I need all the gene list. So I can find GO annotations

whole genome sequencing samtools gene prediction • 1.9k views
ADD COMMENT
0
Entering edit mode

Hi, i have some problem, i mean i want to extract gene list from 200 GTF file E.coli sequences but the problem is how can i separate a gene which are not annotated or present in gff file with their gene name.

Thank you

ADD REPLY
0
Entering edit mode
6.1 years ago
JC 13k

If you are using a reference genome, there probably a gene annotation file (GTF, GFF3). Search in Phytozome for the annotations and try to see gene coverage, for that you can use Bamtools or similar tools.

ADD COMMENT
0
Entering edit mode

I downloaded papaya reference genome in fasta file from NCBI and did alignment using Bowtie2

ADD REPLY
0
Entering edit mode
##species Carica papaya
contig_23646    phytozomev11    gene    5096    5506    .   -   .   ID=evm.TU.contig_23646;Name=evm.TU.contig_23646
contig_23646    phytozomev11    mRNA    5096    5506    .   -   .   ID=PAC:16429340;Name=evm.TU.contig_23646.3;pacid=16429340;longest=1;Parent=evm.TU.contig_23646
contig_23646    phytozomev11    CDS 5096    5506    .   -   0   ID=PAC:16429340.CDS.1;Parent=PAC:16429340;pacid=16429340

I downloaded the gff3 file from Phytozome. But in that , there is no gene name in it.

ADD REPLY
0
Entering edit mode

Cpapaya_113_ASGPBv0.4.annotation_info.txt file contains the annotation and mapping to Arabidopsis

ADD REPLY

Login before adding your answer.

Traffic: 2509 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6