Question: How to handle 'N' in Nucleotide/Genes Sequences retrieved from NCBI GeneBank?
0
gravatar for ammaraakhtar3
11 months ago by
ammaraakhtar30 wrote:

Some sequences retrieved from NCBI, contain letter 'N', which means that these nucleotide bases are not deciphered correctly, leaving an unidentified nucleotide. Should I replace N with any other base i.e. AGTC, assuming N can be any nucleotide, or I should exclude such sequences assuming that the sequencing done was not of good quality. If none of these, what I can do with such sequences in my dataset?

ADD COMMENTlink modified 11 months ago by Pierre Lindenbaum129k • written 11 months ago by ammaraakhtar30

context is missing. What is the purpose of those sequences ?? How To Ask Good Questions On Technical And Scientific Forums

ADD REPLYlink written 11 months ago by Pierre Lindenbaum129k

Most good tools will allow Ns in the sequences but it depends what you’re intending to do with them.

The one thing you almost certainly should not do, is replace them with a random nucleotide.

ADD REPLYlink written 11 months ago by Joe17k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1164 users visited in the last hour