Use one few genes instead of reference genome in mapping
2
0
Entering edit mode
3.1 years ago
bcube ▴ 10

Hi.

My aim is to generate a phylogenetic tree from my analysis.

For this, I have a reference genome and Illumina reads of multiple samples.

I was thinking of using my own gene list generated from sequences downloaded from NCBI instead of using a reference genome for mapping.

Is this approach correct or do you guys have a simpler way?

Any suggestion will be much appreciated.

Thank you.

mapping read phylogenetic analysis reference genome • 1.1k views
ADD COMMENT
1
Entering edit mode
3.1 years ago

That is in essence not a bad approach, though it will also be sub-optimal (to some extent). The issue here being that when you only use a subsample to map your reads on you will never be sure perhaps some of the reads that align to your subsample would not have had a better (== true ) match against the rest of the genome/transcriptome. In theory you will have the same bias for all your samples so that might even it out a bit, but not eliminate it.

Bottom line: there is a risk to over-count the number of reads that map

ADD COMMENT
1
Entering edit mode

Understood. Will use a reference genome. Thanks.

ADD REPLY
1
Entering edit mode
3.1 years ago

Mapping to a truncated reference runs the risk of forcing reads to align where they do not belong. Don't do this. Align to a proper reference that contains all of the expected sequence.

ADD COMMENT
0
Entering edit mode

Thank you very much. I will use the reference genome for mapping.

ADD REPLY

Login before adding your answer.

Traffic: 1729 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6