How to quantify the RNA-seq reads using Kallisto?
1
0
Entering edit mode
4.8 years ago
Farah ▴ 80

Hello, I am running Kallisto (for three paired files) as below. However, I got the following Errors:

mkdir /scratch/fs/ye/kallisto
cd /scratch/fs/ye/kallisto

find /scratch/fs/ye/trimmomatic -name "*trimmomatic_[12].fq.gz" | sort | head -n 6 | while read FW_READ
do
 read RV_READ
 FILEBASE=$(basename "${FW_READ%_1.fq.gz}")
 kallisto quant -i /home/fs/Homo_sapiens.GRCh38.cdna.all.release-97.idx -b 100 \
 o . -t 16 "$FW_READ" "$RV_READ"

 # because Kallisto doesn't let us specify an output filename so we rename all
 # output files

 mv "abundance.tsv" $FILEBASE-"abundance.tsv"

 mv "abundance.h5" $FILEBASE-"abundance.h5"

 mv "run_info.json" $FILEBASE-"run_info.json"

done

The ERRORs are as follows:

Error: file not found o
Error: need to specify output directory

mv: cannot stat `abundance.tsv': No such file or directory

mv: cannot stat `abundance.h5': No such file or directory

mv: cannot stat `run_info.json': No such file or directory

I would highly appreciate if you can help me how to fix this error.

Thank you very much.

RNA-Seq Kallisto • 1.9k views
ADD COMMENT
3
Entering edit mode
4.8 years ago
Ram 43k

Does your kallisto command need a -o . instead of o .? That's what the error message seems to be saying.

Plus, why the read RV_READ? Can that not be derived from FW_READ?

ADD COMMENT
0
Entering edit mode

Dear RamRS, Thank you so much for your advice. As you suggested, I used -o . instead of o . It generated the below messages in my standard error output file:

[quant] fragment length distribution will be estimated from the data
[index] k-mer length: 31
[index] number of targets: 189,154
[index] number of k-mers: 109,813,275
[index] number of equivalence classes: 763,796
[quant] running in paired-end mode
[quant] will process pair 1: /scratch/fs/ye/trimmomatic/SRR6822797-sortmerna-trimmomatic_1.fq.gz
                             /scratch/fs/ye/trimmomatic/SRR6822797-sortmerna-trimmomatic_2.fq.gz
[quant] finding pseudoalignments for the reads ... done
[quant] processed 12,755,479 reads, 11,205,135 reads pseudoaligned
[quant] estimated average fragment length: 161.419
[   em] quantifying the abundances ... done
[   em] the Expectation-Maximization algorithm ran for 1,258 rounds
[bstrp] number of EM bootstraps complete: 100

[quant] fragment length distribution will be estimated from the data
[index] k-mer length: 31
[index] number of targets: 189,154
[index] number of k-mers: 109,813,275
[index] number of equivalence classes: 763,796
[quant] running in paired-end mode
[quant] will process pair 1: /scratch/fs/ye/trimmomatic/SRR6822798-sortmerna-trimmomatic_1.fq.gz
                             /scratch/fs/ye/trimmomatic/SRR6822798-sortmerna-trimmomatic_2.fq.gz
[quant] finding pseudoalignments for the reads ... done
[quant] processed 5,362,940 reads, 4,798,586 reads pseudoaligned
[quant] estimated average fragment length: 166.668
[   em] quantifying the abundances ... done
[   em] the Expectation-Maximization algorithm ran for 1,057 rounds
[bstrp] number of EM bootstraps complete: 100


[quant] fragment length distribution will be estimated from the data
[index] k-mer length: 31
[index] number of targets: 189,154
[index] number of k-mers: 109,813,275
[index] number of equivalence classes: 763,796
[quant] running in paired-end mode
[quant] will process pair 1: /scratch/fs/ye/trimmomatic/SRR6822799-sortmerna-trimmomatic_1.fq.gz
                             /scratch/fs/ye/trimmomatic/SRR6822799-sortmerna-trimmomatic_2.fq.gz
[quant] finding pseudoalignments for the reads ... done
[quant] processed 4,315,257 reads, 3,750,150 reads pseudoaligned
[quant] estimated average fragment length: 164.278
[   em] quantifying the abundances ... done
[   em] the Expectation-Maximization algorithm ran for 1,242 rounds
[bstrp] number of EM bootstraps complete: 100

It also produced abundance.tsv, abundance.h5, and run_info.json files for each of my paired files. But, how can I be sure that all of these files are correct, and how to check if kallisto was performed correctly? as my standard error output is not zero, and it has a size of 15.

Many thanks for your help.

ADD REPLY
1
Entering edit mode

STDERR is not a placeholder for just error messages. It logs everything that does not belong in STDOUT. Progress messages, warnings and error messages are examples of what could go into STDERR. Do not judge your program status by STDERR file size.

I am not familiar with kallisto but if the above is all of your STDERR, the program seems to have completed OK (I don't see any error message).

ADD REPLY
0
Entering edit mode

Many thanks for your help. Yes you are right, it is now working well. Thanks a lot.

ADD REPLY

Login before adding your answer.

Traffic: 1555 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6