Extracting a special chromosome in fastq file

1

Entering edit mode

6.4 years ago

Sakhaa ▴ 10

Hello,

I'm a beginner in bioinformatic filed and I want to extract chr.22 from the fastq file. I don't know which tool should I use. Could anyone help me in that?

Thank you

alignment sequencing genome sequence gene • 3.6k views

ADD COMMENT • link 6.4 years ago by Sakhaa ▴ 10

2

Entering edit mode

Hello and welcome to biostars Sakhaa ,

fastq files doesn't contain any information about the origin of the reads. You first have to align those reads to the genome resulting in a bam file. From there you can extract anything specific for a chromosome.

fin swimmer

ADD REPLY • link 6.4 years ago by finswimmer 16k

2

Entering edit mode

Fastq file contains reads from all the complete genome (at least you sequenced chr22), therefore, you have to align reads against the genome to clasify them by its corresponding genome location, then you can extract reads of your interest. 1.- Read about bowtie2 to align your sequences. 2.- Read about sam/bam files samtools (result of bowtie2 aligner) and how to extract data from specific location.

ADD REPLY • link 6.4 years ago by Buffo ★ 2.4k

1

Entering edit mode

Is it a fasta or fastq? head your.file please. If fasta:

>name of sequence
sequence
>name of next sequence
next sequence

then use samtools faidx. Details via google and the search function. Asked many times before, e .g. Filtering Fasta Sequences By Chromosomes Names From A Big Fasta File

ADD REPLY • link 6.4 years ago by ATpoint 88k

Login before adding your answer.