Question: problem for setting BWA arguments for alignment
0
gravatar for modarzi
5 months ago by
modarzi20
modarzi20 wrote:

Hi,

I want to align my samples by BWA. my samples are RNA-seq and Single-End. as I read on Manual Reference Pages - bwa (1), I have to run below command:

bwa mem [-aCHMpP] [-t nThreads] [-k minSeedLen] [-w bandWidth] [-d zDropoff] [-r seedSplitRatio] [-c maxOcc] [-A matchScore] [-B mmPenalty] [-O gapOpenPen] [-E gapExtPen] [-L clipPen] [-U unpairPen] [-R RGline] [-v verboseLevel] db.prefix reads.fq [mates.fq]

all arguments in [ ] are optional but my question is about "db.prefix" argument.I don't know what it is and how can provide it? based on alignment logic, it should be reference genome but I am not sure and if it is reference I don't know how to prepare it? I appreciate if you share your comment with me.

Best Regards,

Mohammad

ADD COMMENTlink modified 5 months ago by toralmanvar650 • written 5 months ago by modarzi20
1

bwa is not an appropriate aligner for RNA-seq (unless it's prokaryotic), you should use a splice-aware aligner such as STAR or HISAT2.

ADD REPLYlink written 5 months ago by WouterDeCoster32k
0
gravatar for toralmanvar
5 months ago by
toralmanvar650
toralmanvar650 wrote:

For using bwa, you need two things:

1. reads file
2. reference sequence/s on which you want to map the reads

Now, first before using 'mem' for mapping, you have to index the reference sequences using 'bwa index' followed by bwa mem where you will have to provide a prefix of reference file indexed. For example:

bwa index ref.fa (it will generate ref.fa.amb, ref.fa.ann, ref.fa.bwt, ref.fa.pac, ref.fa.sa files )
bwa mem ref.fa reads_SE.fastq -t 30 > reads_on_reference.sam

So, here in above command "ref.fa" is your db.prefix.

ADD COMMENTlink written 5 months ago by toralmanvar650

Thanks.

my data set belong to human. I consruct bwa index(wg.fa) and for alignment I run below command:

bwa mem ~/cirRNA/apps/wg.fa sra_data_SRR1427482.fastq -t 30 > sra_data_SRR1427482_bwa.sam

and then in my terminal I have below result:

[M::bwa_idx_load_from_disk] read 0 ALT contigs

[M::process] read 8646532 sequences (300000008 bp)...

but after some minutes, my process will killed.

I also, ran this command for another sample and receive below message:

[M::bwa_idx_load_from_disk] read 0 ALT contigs

Segmentation fault (core dumped)

I appreciate if you share your comment with me.

Best Regards,

Mohammad

ADD REPLYlink modified 5 months ago • written 5 months ago by modarzi20
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1790 users visited in the last hour