automating multiple fastq.gz -> SAM
1
0
Entering edit mode
9.6 years ago
s.vandenhurk ▴ 10

Hello Guys,

I am interested in automating the combining of multiple fastq.gz files. As you can see in the image below I have already indexed my reference genome with BWA and I have combined the fastq.gz files from my COL strain. Because this Arabidopsis is a test set, I could do it manually. However, in the near future I plan to do the same thing with over 500 strains. So I would like to automate the process.

I have used the linux command cat *_1.fastq.gz > Combined_1.fastq.gz to get it to work at the COL strain. Should I create some sort of bash loop for things like this?

tree

I have already aligned these Combined_1 and Combined_2 to the reference genome and I have used these results with BWA sampe to get a SAM file. So far, results look promising. But, could this process also be automated?

Assembly sequencing • 2.6k views
ADD COMMENT
1
Entering edit mode
9.6 years ago

See my demo project https://github.com/lindenb/ngsxml to generate Makefile-based workflows using XSLT

wordlow

ADD COMMENT
1
Entering edit mode

I had considered posting a comment of the form, "Pierre's reply mentioning ngsxml in 3, 2, 1...". I guess I should have!

ADD REPLY
0
Entering edit mode

:-)

ADD REPLY
0
Entering edit mode

How did you make that graph?

ADD REPLY
0
Entering edit mode
ADD REPLY

Login before adding your answer.

Traffic: 2075 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6