Question: how to fetch the specific sequence of gene from rna seq data
0
gravatar for fatimarasool135
3 months ago by
fatimarasool1350 wrote:

can we retrived the spesific seqence of the gene from rna seq data of one species.?

For this pupose i perform the following steps but the resulyt out put file contain long string of nnnn with nuclotide sequnce

Indexing by bowtie2

bowtie2-build --large-index -f  wheat.fa  wheat

mapping by tophat

tophat -p 30 -G wheat.gff3  wheat  G1_cleaned_R1.fastq  G1_cleaned_R2.fastq -o Alingment

Get consensus fastq file

samtools mpileup -uf REFERENCE.fasta  Aceepted_hits.bam | bcftools call -c | vcfutils.pl vcf2fq > batis_cns.fastq

Convert .fastq to .fasta

seqtk seq -aQ64 baris_cns.fastq > batis_cns.fasta

cordinate fetching

zcat Triticum_aestivum.gff3.gz |grep "TraesCS5A02G213300" > coordinate.bed12

Command for sequence fetching

bedtools getfasta -fi batis_cns.fasta -bed coordinate.bed12 -fo OUTPUT-GENE-SEQUENCE.fa
rna-seq alignment gene • 185 views
ADD COMMENTlink modified 3 months ago by finswimmer11k • written 3 months ago by fatimarasool1350
2

Hello fatimarasool135 ,

  • Please use the formatting bar (especially the code option) to present your post better. I've done it for you this time.
    code_formatting

  • You should know that the old 'Tuxedo' pipeline of Tophat(2) and Cufflinks is no longer the "advisable" tool for RNA-seq analysis. The software is deprecated/ in low maintenance and should be replaced by HISAT2, StringTie and ballgown. See this paper: Transcript-level expression analysis of RNA-seq experiments with HISAT, StringTie and Ballgown. There are also other alternatives, including alignment with STAR and bbmap, or pseudo-alignment using salmon.

Thank you!

ADD REPLYlink modified 3 months ago • written 3 months ago by finswimmer11k

Thank you please tell me steps to get the specific sequence of gene from rna seq data .like there is a gene abc in wheat i want to retrieve this abc from mine sample of rna seq .

ADD REPLYlink written 3 months ago by fatimarasool1350

You have described the procedure to get the consensus sequence for the gene you are interested in in your original post. So is there a question beyond that?

ADD REPLYlink written 3 months ago by genomax63k

I simply follow these step to get gene sequence.... i want to know how i fetch the desired seq of gene from rna seq of mine sample

ADD REPLYlink written 3 months ago by fatimarasool1350

Hi. what will the reference sequence used for mapping of rna seq read ? is it shuold be cDNA or genomic sequence?

ADD REPLYlink written 3 months ago by fatimarasool1350

Both can potentially be used. If you are not interested in discovery of novel transcripts you could map against transcriptome ( and if you are using salmon or kallisto you would go this route). General recommendation is to align against the genome and then use a GTF (gene model file) to count reads that fall under the boundaries defined in that file.

ADD REPLYlink modified 3 months ago • written 3 months ago by genomax63k

from where i can got GTF file ?

ADD REPLYlink written 3 months ago by fatimarasool1350

Looks like you have a GFF3 file. That can work.

ADD REPLYlink written 3 months ago by genomax63k

i have to generate it from tool or download it from ensembl ?

ADD REPLYlink written 3 months ago by fatimarasool1350
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 2429 users visited in the last hour