fasta as input file : mapping sequences to a genome
2
1
Entering edit mode
9.7 years ago
Shripathi ▴ 10

Hi to all,

I have a fasta file having sequences (similar input file for command line blast program) that I extracted from one of Stacks outputs. I would like to align these sequences to a reference genome and use IGV like visualization tool to check the locations of these sequences. I just wonder is there any program that uses these kind of simple input file for alignment purpose.

Thank you in advance,

Shripathi

snp next-gen • 12k views
ADD COMMENT
3
Entering edit mode

Bwa seamlessly works with both fasta and fastq.

ADD REPLY
2
Entering edit mode
9.7 years ago

Blat and gmap are useful general-purpose aligners that are reasonably fast and take fasta as input. If you sequences are not too large and there are not too many of them, blat is available online at the UCSC genome browser site.

ADD COMMENT
1
Entering edit mode
9.7 years ago
Fred ▴ 780

gmap indeed allows you to align sequences in FASTA with a reference genome. It can output alignments in SAM format:

# Build reference genome database
gmap_build -D dir/ -d refgenome dir/refgenome.fa
# Alignment
gmap -D dir -d yourgenome -f samse -t 8 yourfasta.fa | samtools view -Shb - | samtools sort - alignment
samtools index aligment.bam

You can then visualize the alignment with IGV

ADD COMMENT

Login before adding your answer.

Traffic: 1478 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6