Closed:MuMRescueLite.py - multi-mapping reads rescue script failing
0
0
Entering edit mode
6.8 years ago
newscient ▴ 20

I am trying to use the script MuMRescueLite.py from http://genome.gsc.riken.jp/osc/english/dataresource/. That's the link for the software: http://genome.gsc.riken.jp/osc/english/software/src/MuMRescueLite_090522.tar.gz What it does more or less: After mapping, it examines the mapping locations of all reads (single and multi mapping reads) and using the information from single mapping reads, it probabilistically assigns where a multi mapping read maps The script needs a specific input of the format:

#ID locations   chromosome  strand  start   end count
read_x  1   chr1    +   100 126 100
read_y  2   chr1    +   102 128 100
read_y  2   chrX    +   102 128 100

While the .bed format output of an aligner (such as bowtie,bwa) looks like:

 chr6   135135525   135135550   HWI-ST897:159:C1ACKACXX:1:1101:1127:2068    255 +
chr5    140027416   140027441   HWI-ST897:159:C1ACKACXX:1:1101:1280:2087    255 +
chr16   57219907    57219932    HWI-ST897:159:C1ACKACXX:1:1101:1414:2089    255 -

I am using awk to turn the second(.bed format) in to the first format, as: awk '/./ {print $1"\t"$7 + 1"\t"$3"\t"$2"\t"$4"\t"$4 + length($5)"\t1"}3333' < <mapping output> > <MuM Input>

or awk '{$7=$7+1;end=$4+length($5);}{OFS="\t";print$1,$7,$3,$2,$4,end,"1"}'

Nevertheless MuMrescueLite.py keeps failing with the following message:

Traceback (most recent call last):
  File "../../tools/MuMRescueLite_090522/MuMRescueLite.py", line 194, in <module>
    remainSingleMapper(inFile, singleMappers, inputHeaderFlag)
  File "../../tools/MuMRescueLite_090522/MuMRescueLite.py", line 55, in remainSingleMapper
    chromosome, strand, position = retPosition(columns)
  File "../../tools/MuMRescueLite_090522/MuMRescueLite.py", line 39, in retPosition
    raise Exception 
Exception

I guess my issue is at the awk command as i am pretty new in using awk for file manipulations! Anyone that has previously used MumRescueLite or any awk expert that could help me out???

Thanks in advance

mapping reads • 262 views
ADD COMMENT
This thread is not open. No new answers may be added
Traffic: 1825 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6