User: spaladug
spaladug • 10
- Reputation:
- 10
- Status:
- New User
- Location:
- United States
- Last seen:
- 1 month ago
- Joined:
- 5 years, 4 months ago
- Email:
- s*******@gmail.com
Posts by spaladug
<prev
• 9 results •
page 1 of 1 •
next >
0
votes
1
answer
4.8k
views
1
answers
... You should use TPM values for within sample comparison. ...
written 7 weeks ago by
spaladug • 10
1
vote
1
answer
877
views
1
answer
... I have set of subject sequences (25 to 40 bps long). My query sequences are 200-300 bp long. I am trying to find the location of short subject sequences in long query sequences. This is the exact inverse of the alignment problem, where we have long reference and we have short reads that we align to ...
0
votes
3
answers
1.7k
views
3
answers
... sorry, fixed the code. it should be record.format, It looks like most of the aligners ignore the third line and I haven't faced any problems with this format in our pipeline. You are right I cannot use SeqIO anymore on this record. But I don't intend to use Biopython after this step is done. ...
written 4.2 years ago by
spaladug • 10
1
vote
3
answers
1.7k
views
3
answers
... Here is my python based solution to this, if anyone else is looking to modify the line with "+" sign:
with open(fh, 'w') as unique_file:
for record in SeqIO.parse(readsFile, "fastq"):
rec = record.format("fastq").splitlines()
#compute your stats for this rec ...
written 4.2 years ago by
spaladug • 10
0
votes
3
answers
1.7k
views
3
answers
... Not an incompatible version of fastq, I will be adding info next to + sign, so all the other aligner tools will still work as most of the tools ignore info in the third line. We have tools in our own custom pipeline that read info next to the + sign to compute stats. ...
written 4.2 years ago by
spaladug • 10
0
votes
3
answers
1.7k
views
3
answers
... I need to store information pertaining to each read in the third line. We have other tools in the pipeline that read the third line in each sequence read to compute some stats. ...
written 4.2 years ago by
spaladug • 10
17
votes
3
answers
1.7k
views
8 follow
3
answers
... Is there a way to update the third line in a fastq file (i.e, lines starting with +) programmatically using Biopython? I can parse the file using SeqIO but I don't know which field to update.
Thanks ...
written 4.2 years ago by
spaladug • 10
• updated
4.2 years ago by
Alex Reynolds ♦ 31k
0
votes
1
answer
2.1k
views
1
answers
... The following code should do it:
vcf <- readVcf(x, genome = genome)
vr <- as(vcf, "VRanges") ...
written 4.9 years ago by
spaladug • 10
0
votes
0
answers
1.7k
views
0
answers
... I am trying to use StringTie for transcript assembly and I am getting a strange error. I was able to run StringTie successfully for two of my samples and I am getting the following error when I run it on my third sample:
"Error at GBitVec: index 7 out of bounds (size 7)"
Has anybody seen this erro ...
Latest awards to spaladug
Popular Question
2.7 years ago,
created a question with more than 1,000 views.
For RnaSeq StringTie runtime error
Popular Question
4.2 years ago,
created a question with more than 1,000 views.
For RnaSeq StringTie runtime error
Use of this site constitutes acceptance of our User
Agreement
and Privacy
Policy.
Powered by Biostar
version 2.3.0
Traffic: 2132 users visited in the last hour