Convert colorspace fastq to basespace fastq
1
0
Entering edit mode
5.5 years ago
aflrios • 0

Convert Solid color space fastq in base space fastq

Hi all I am a beginner in NGS. I downloaded some SRA files (from SRA-NCBI) and, I convert in ABI-solid fastq using fastq-dump. I would like to convert these ABI-solid files in Illumina fastq format. Does anyone know of any program that does this? Thank you. Álvaro.

RNA-Seq • 2.8k views
ADD COMMENT
1
Entering edit mode

Your data may already be in fastq format, if you used fastq-dump. Can you show us the output of head -8 your.fq?

ADD REPLY
0
Entering edit mode

Hi,

I would like to use this colorspace files for transcriptome assembly, but I think the softwares I am using (HISAT and StringieTie) work with base space files. So, I would like to convert colorspace to base space.

ADD REPLY
0
Entering edit mode

Assuming that the data you need is not available on any other platform, you may want to use colorspace reads directly for the analysis. No need to convert the reads. You can use Tophat with bowtie1 option. Use bowtie version 0.12.7 or lower.

ADD REPLY
0
Entering edit mode

You can use Subjunc and then proceed with StringTie, it has been reported to work:

Whole RNA-Sequencing and Transcriptome Assembly of Candida albicans and Candida africana under Chlamydospore-Inducing Conditions

Question: Clarification of good pipeline for transcriptome assembly of RNA-seq data

I've updated my answer bellow to clarify why it is not a good idea to convert colorspace to basespace.

ADD REPLY
0
Entering edit mode
5.5 years ago
h.mon 35k

First, do you really need to work with those Solid reads? It is a defunct platform, with not much software available.

I would advise against converting colorspace to basespace, use a mapper / assembler that can take colorspace reads. For RNAseq, Subread and Subjunc (for spliced alignment) can map colorspace reads.

Due to how colorspace reads are encoded, if you convert a colorspace read with a sequencing error to basespace, the error causes a "frameshift" and all subsequent bases are also wrongly converted - this could affect adversely both mapping and (to a greater extent) de novo assembly. However, if you map in colorspace, sequencing errors can detected - as they can be differentiated from true SNPs - and corrected.

See better explanations at:

A: convert SOLID color space to base-space

http://seqanswers.com/forums/showpost.php?p=4977&postcount=2

ADD COMMENT

Login before adding your answer.

Traffic: 2970 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6