Question: how to get percent identity between two RNA-seq fastq files?
0
gravatar for lkianmehr
12 months ago by
lkianmehr30
France
lkianmehr30 wrote:

Hello, Does anybody have an idea about the way to get percent identity between two or more RNA-seq fastq files ? or to show how many of the sequences in one of them are found in another one?

thanks in advance

rna-seq percent identity • 328 views
ADD COMMENTlink modified 12 months ago by Pierre Lindenbaum124k • written 12 months ago by lkianmehr30
0
gravatar for Pierre Lindenbaum
12 months ago by
France/Nantes/Institut du Thorax - INSERM UMR1087
Pierre Lindenbaum124k wrote:
 comm \
     <(gunzip  -c 11.fq.gz | paste - - - - | cut -f 2 | sort)  \
     <(gunzip  -c f2.fq.gz | paste - - - - | cut -f 2 | sort) | \
     awk -F '\t' '{if($1=="" && $2=="") {C+=1.0;};} END {print C/NR;}'
ADD COMMENTlink written 12 months ago by Pierre Lindenbaum124k

I have run that command on two fastq files that are sequenced on the same library, I got one value = 0.119817, what does it mean? it means they have just 0.1 percent difference?

ADD REPLYlink modified 12 months ago • written 12 months ago by lkianmehr30

1/10 of reads in common.

ADD REPLYlink written 12 months ago by Pierre Lindenbaum124k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1934 users visited in the last hour