Simple Redirection, I/O Problem With Bedtools
3
0
Entering edit mode
11.2 years ago

Hi Guys, Just a quick question. Its more of a Bash question rather than Bioinformatics, with Bedtools in question.

I mostly pipe the bedtools I/O. Here's a general scenario :

sed 1d fileA.bed | intersectBed -a stdin -b peaks.bed | intersectBed -u -a stdin -b fileB.bed

Now, the problem is fileB is also having a head, which is reported as an error by intersectBed (makes sense, non-integer start).

How can I remove the first line or the head of the fileB on the fly in the pipe.

Thanks

bedtools bash pipeline linux intersect • 4.1k views
ADD COMMENT
8
Entering edit mode
11.2 years ago
Ryan Dale 5.0k

As of 2.13.0 bedtools supports FIFOs, so you can do:

intersectBed -a <(sed 1d fileA.bed) -b <(sed 1d peaks.bed) | intersectBed -u -a stdin -b fileB.bed
ADD COMMENT
2
Entering edit mode

If I am right, this is a bash trick, which can be used with any programs, irrelevant of bedtools and its versions. But as it is shell dependent, you cannot use it in a C-shell.

ADD REPLY
0
Entering edit mode

That's right, FIFOs are a shell technology, not a bedtools technology. What @Daler is describing is that, owing to a silly mistake, early versions of bedtools were incapable of using FIFOs as input.

ADD REPLY
0
Entering edit mode

That's a great news and post!! Cheers

ADD REPLY
0
Entering edit mode

Very cool, I didn't know you could do this. I'm going to have to start using this method.

ADD REPLY
0
Entering edit mode

More generally, sed can be used to extract any part of a multiline text file, between lines n and m and you can do a similar trick by piping head and tail, see e.g. http://linux.byexamples.com/archives/130/head-and-tail/

ADD REPLY
4
Entering edit mode
11.2 years ago
Fwip ▴ 500

Have you tried using named pipes? Your commands would basically be the same as those listed by Damian, except with an additional mkfifo command at the front.

mkfifo tmp_pipe ;
sed 1d fileB.bed > tmp_pipe & 
sed 1d fileA.bed | intersectBed -a stdin -b peaks.bed | intersectBed -u -a stdin -b tmp_pipe ; 
rm -f tmp_pipe ;

(I split this into separate lines for ease of reading).

Edit: lh3 is right, you need to write to the pipe in a background process, or your shell will stall while waiting for the program to close. The commands have been updated. Thanks for the correction!

ADD COMMENT
2
Entering edit mode

One small thing: you need to put the second line to the background. i.e. you need "&" at the end of the 2nd line.

ADD REPLY
1
Entering edit mode

Good one, but Daler's answer saves more typing as FIFO is inbuilt now. Thanks

ADD REPLY
0
Entering edit mode

Thank you, I didn't know "mkfifo"

ADD REPLY
0
Entering edit mode
11.2 years ago

Might be easier to just use several commands separated by semi-colon:

sed 1d fileB.bed > fileB.temp.bed;sed 1d fileA.bed | intersectBed -a stdin -b peaks.bed | intersectBed -u -a stdin -b fileB.temp.bed;rm -rf fileB.temp.bed
ADD COMMENT
0
Entering edit mode

Yeah, but this is a workaround, need something better.

ADD REPLY

Login before adding your answer.

Traffic: 2517 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6