Hi,
I have RNA-seq data for a cell line that was transformed by viral transfection with the SV40 virus, I would like to calculate the expression levels of the viral gene large-T in these samples. I have started out by downloading the viral genome from ncbi and merged it with the human genome by adding it to the FASTA file as an extra chromosome. I have built a bowtie index and mapped the reads with bowtie, and now I wonder how to proceed and evaluate how many reads have mapped to the viral genome.
Frida
The third column of SAM/BAM file tells the reference sequence name. You can filter based on third column and count the reads. Lets say your "viral chromosome" name in fasta file is ">SV40", the following command gives you the number of reads mapped to "viral chromosome"
Hi,
What is the different between
and
The last one just lists all the chromosomes.