Convert colorspace fastq to basespace fastq
1
0
Entering edit mode
3.1 years ago
aflrios • 0

Convert Solid color space fastq in base space fastq

Hi all I am a beginner in NGS. I downloaded some SRA files (from SRA-NCBI) and, I convert in ABI-solid fastq using fastq-dump. I would like to convert these ABI-solid files in Illumina fastq format. Does anyone know of any program that does this? Thank you. Álvaro.

RNA-Seq • 1.7k views
1
Entering edit mode

Your data may already be in fastq format, if you used fastq-dump. Can you show us the output of head -8 your.fq?

0
Entering edit mode

Hi,

I would like to use this colorspace files for transcriptome assembly, but I think the softwares I am using (HISAT and StringieTie) work with base space files. So, I would like to convert colorspace to base space.

0
Entering edit mode

Assuming that the data you need is not available on any other platform, you may want to use colorspace reads directly for the analysis. No need to convert the reads. You can use Tophat with bowtie1 option. Use bowtie version 0.12.7 or lower.

0
Entering edit mode

You can use Subjunc and then proceed with StringTie, it has been reported to work:

Whole RNA-Sequencing and Transcriptome Assembly of Candida albicans and Candida africana under Chlamydospore-Inducing Conditions

Question: Clarification of good pipeline for transcriptome assembly of RNA-seq data

I've updated my answer bellow to clarify why it is not a good idea to convert colorspace to basespace.

0
Entering edit mode
3.1 years ago
h.mon 33k

First, do you really need to work with those Solid reads? It is a defunct platform, with not much software available.

I would advise against converting colorspace to basespace, use a mapper / assembler that can take colorspace reads. For RNAseq, Subread and Subjunc (for spliced alignment) can map colorspace reads.

Due to how colorspace reads are encoded, if you convert a colorspace read with a sequencing error to basespace, the error causes a "frameshift" and all subsequent bases are also wrongly converted - this could affect adversely both mapping and (to a greater extent) de novo assembly. However, if you map in colorspace, sequencing errors can detected - as they can be differentiated from true SNPs - and corrected.

See better explanations at:

A: convert SOLID color space to base-space