Suppose I supply Newbler my own trimming points, so that a new sff file is created, from the original sff file, now with this new information. Additionally I also supply files with adaptor and possible contaminants sequences through the -vs and -vt command options.
In this case will my trimming points appear under the columns "Orig Trimpoints Orig Trimmed Length", and any alterations Newbler does based on the -vs and -vt files I supplied, under the columns "Trimpoints Used Used Trimmed Length"?
Or do the original trimming points, i.e. the ones the 454 sequencing platform originally set by including under-case characters, still prevail and appear under "Orig Trimpoints Orig Trimmed Length", while the ones I set appear under "Trimpoints Used Used Trimmed Length"?
That seems to be the case with my data but I don't have access to the cluster where I performed the runs with Newbler so I couldn't confirm.
Do you know how exactly the Vs and vt work, i.e. does Newbler just check if the sequences in these files are wholly contained in the reads, or does it perform alignments accounting for gaps and indels?
In any case if Newbler thinks that the adaptor or contamination is on the middle of a read, will it split the read, or exclude it?
Sometimes the reads, as set by the trimming points, will have only three less base pairs than the original that I supposedly defined. Or will have both start and end trimming points equal, that is as if they were excluded. My guess is that this because of those parameters, but I was unsure.
As an aside, thanks for your blog on Newbler, it is really informative !
Sometimes the reads, as set by the trimming points, will have only three less base pairs than the original that I supposedly defined. Or will have both start and end trimming points equal, which means that reads have been excluded.
My guess is that this because of those parameters, but I was unsure.
As an aside note, thanks for your blog on Newbler, it is really informative !
I don't know exactly how vs and vt work (there is no descprition in the manual). With vt, newbler only looks at the beginning and end of each read to see if an adaptor is present (surely with a certain identity <100%) and removes only that part of the read. With vs, the whole read is matched and removed or not. If start and end trimming points are equal, the read has not been excluded. Excluded reads don't show up anymore in the TrimStatus file.