Error with trinity
0
0
Entering edit mode
3.3 years ago
haniyounes • 0

Good morning,

I am new in this domain, and I want to learn about trinity denovo assembly. I tried to lunch this line command from Ubuntu Vmbox , but I always get 2 errors:

~/sharedUbuntu/S.scutata$ sudo docker run --rm -v `pwd`:`pwd` trinityrnaseq/trinityrnaseq Trinity \
>       --seqType fq \
>       --left `pwd`/FastqOutput/SRR3574594_1.fastq  \
>       --right `pwd`/FastqOutput/SRR3574594_2.fastq \
>       --max_memory 28G --min_kmer_cov 2 --CPU 4 --monitoring --output /home/sharedUbuntu/S.scutata/Trinityresults


     ______  ____   ____  ____   ____  ______  __ __
    |      ||    \ |    ||    \ |    ||      ||  |  |
    |      ||  D  ) |  | |  _  | |  | |      ||  |  |
    |_|  |_||    /  |  | |  |  | |  | |_|  |_||  ~  |
      |  |  |    \  |  | |  |  | |  |   |  |  |___, |
      |  |  |  .  \ |  | |  |  | |  |   |  |  |     |
      |__|  |__|\_||____||__|__||____|  |__|  |____/

    Trinity-v2.11.0



Left read files: $VAR1 = [
          '/home/sharedUbuntu/S.scutata/FastqOutput/SRR3574594_1.fastq'
        ];
Right read files: $VAR1 = [
          '/home/sharedUbuntu/S.scutata/FastqOutput/SRR3574594_2.fastq'
        ];
Trinity version: Trinity-v2.11.0
** NOTE: Latest version of Trinity is v2.11.0, and can be obtained at:
    https://github.com/trinityrnaseq/trinityrnaseq/releases

Monday, December 21, 2020: 10:54:52 CMD: java -Xmx64m -XX:ParallelGCThreads=2  -jar /usr/local/bin/trinityrnaseq/util/support_scripts/ExitTester.jar 0
Monday, December 21, 2020: 10:54:52 CMD: java -Xmx4g -XX:ParallelGCThreads=2  -jar /usr/local/bin/trinityrnaseq/util/support_scripts/ExitTester.jar 1
Monday, December 21, 2020: 10:54:52 CMD: mkdir -p /home/sharedUbuntu/S.scutata/Trinityresults
Monday, December 21, 2020: 10:54:52 CMD: mkdir -p /home/sharedUbuntu/S.scutata/Trinityresults/chrysalis
STARTING COLLECTL
I'M THE PARENT, COLLECTL_PID=67
I'M THE CHILD RUNNING TRINITY
Running CMD: bash -c "set -eou pipefail && cd /usr/local/src/collectl && exec nohup /usr/local/bin/trinityrnaseq/trinity-plugins/COLLECTL/collectl/collectl --procopts w  --procfilt u0 --interval :60 -sZ -oD | egrep 'Trinity|trinityrnaseq|Inchworm|Chrysalis|Butterfly|jellyfish|samtools|sort|bowtie' | egrep -v 'egrep' > /usr/local/src/collectl/collectl.dat " 


----------------------------------------------------------------------------------
-------------- Trinity Phase 1: Clustering of RNA-Seq Reads  ---------------------
----------------------------------------------------------------------------------

---------------------------------------------------------------
------------ In silico Read Normalization ---------------------
-- (Removing Excess Reads Beyond 200 Coverage --
---------------------------------------------------------------

# running normalization on reads: $VAR1 = [
          [
            '/home/sharedUbuntu/S.scutata/FastqOutput/SRR3574594_1.fastq'
          ],
          [
            '/home/sharedUbuntu/S.scutata/FastqOutput/SRR3574594_2.fastq'
          ]
        ];


Monday, December 21, 2020: 10:54:52 CMD: /usr/local/bin/trinityrnaseq/util/insilico_read_normalization.pl --seqType fq --JM 28G  --max_cov 200 --min_cov 2 --CPU 4 --output /home/sharedUbuntu/S.scutata/Trinityresults/insilico_read_normalization --max_CV 10000  --left /home/sharedUbuntu/S.scutata/FastqOutput/SRR3574594_1.fastq --right /home/sharedUbuntu/S.scutata/FastqOutput/SRR3574594_2.fastq --pairs_together  --PARALLEL_STATS  
-prepping seqs
Converting input files. (both directions in parallel)CMD: seqtk-trinity seq -A -R 1  /home/sharedUbuntu/S.scutata/FastqOutput/SRR3574594_1.fastq >> left.fa
CMD: seqtk-trinity seq -A -R 2  /home/sharedUbuntu/S.scutata/FastqOutput/SRR3574594_2.fastq >> right.fa
CMD finished (318 seconds)
CMD finished (319 seconds)
CMD: touch left.fa.ok
CMD finished (0 seconds)
CMD: touch right.fa.ok
CMD finished (0 seconds)
Done converting input files.CMD: cat left.fa right.fa > both.fa
CMD finished (6 seconds)
CMD: touch both.fa.ok
CMD finished (0 seconds)
-kmer counting.
-------------------------------------------
----------- Jellyfish  --------------------
-- (building a k-mer catalog from reads) --
-------------------------------------------

CMD: jellyfish count -t 4 -m 25 -s 100000000  --canonical  both.fa
CMD finished (370 seconds)
CMD: jellyfish histo -t 4 -o jellyfish.K25.min2.kmers.fa.histo mer_counts.jf
CMD finished (62 seconds)
CMD: jellyfish dump -L 2 mer_counts.jf > jellyfish.K25.min2.kmers.fa
CMD finished (122 seconds)
CMD: touch jellyfish.K25.min2.kmers.fa.success
CMD finished (0 seconds)
-generating stats files
CMD: /usr/local/bin/trinityrnaseq/util/..//Inchworm/bin/fastaToKmerCoverageStats --reads left.fa --kmers jellyfish.K25.min2.kmers.fa --kmer_size 25  --num_threads 2  --DS  > left.fa.K25.stats
-reading Kmer occurrences...
CMD: /usr/local/bin/trinityrnaseq/util/..//Inchworm/bin/fastaToKmerCoverageStats --reads right.fa --kmers jellyfish.K25.min2.kmers.fa --kmer_size 25  --num_threads 2  --DS  > right.fa.K25.stats
-reading Kmer occurrences...

 done parsing 42222591 Kmers, 42222591 added, taking 118 seconds.

 done parsing 42222591 Kmers, 42222591 added, taking 120 seconds.
STATS_GENERATION_TIME: 390 seconds.
STATS_GENERATION_TIME: 392 seconds.
CMD finished (516 seconds)
CMD finished (520 seconds)
CMD: touch left.fa.K25.stats.ok
CMD finished (0 seconds)
CMD: touch right.fa.K25.stats.ok
CMD finished (0 seconds)
-sorting each stats file by read name.
CMD: head -n1 left.fa.K25.stats > left.fa.K25.stats.sort && tail -n +2 left.fa.K25.stats | /usr/bin/sort --parallel=4 -k1,1 -T . -S 14G >> left.fa.K25.stats.sort
CMD: head -n1 right.fa.K25.stats > right.fa.K25.stats.sort && tail -n +2 right.fa.K25.stats | /usr/bin/sort --parallel=4 -k1,1 -T . -S 14G >> right.fa.K25.stats.sort
CMD finished (44 seconds)
CMD finished (45 seconds)
CMD: touch left.fa.K25.stats.sort.ok
CMD finished (0 seconds)
CMD: touch right.fa.K25.stats.sort.ok
CMD finished (0 seconds)
-defining normalized reads
CMD: /usr/local/bin/trinityrnaseq/util/..//util/support_scripts//nbkc_merge_left_right_stats.pl --left left.fa.K25.stats.sort --right right.fa.K25.stats.sort --sorted > pairs.K25.stats
-opening left.fa.K25.stats.sort
-opening right.fa.K25.stats.sort
-done opening files.
CMD finished (338 seconds)
CMD: touch pairs.K25.stats.ok
CMD finished (0 seconds)
CMD: /usr/local/bin/trinityrnaseq/util/..//util/support_scripts//nbkc_normalize.pl --stats_file pairs.K25.stats --max_cov 200  --min_cov 2 --max_CV 10000 > pairs.K25.stats.C200.maxCV10000.accs
2402255 / 8330662 = 28.84% reads selected during normalization.
0 / 8330662 = 0.00% reads discarded as likely aberrant based on coverage profiles.
386418 / 8330662 = 4.64% reads discarded as below minimum coverage threshold=2
CMD finished (334 seconds)
CMD: touch pairs.K25.stats.C200.maxCV10000.accs.ok
CMD finished (0 seconds)
-search and capture.
-preparing to extract selected reads from: /home/sharedUbuntu/S.scutata/FastqOutput/SRR3574594_1.fastq ... done prepping, now search and capture.
-capturing normalized reads from: /home/sharedUbuntu/S.scutata/FastqOutput/SRR3574594_1.fastq
-preparing to extract selected reads from: /home/sharedUbuntu/S.scutata/FastqOutput/SRR3574594_2.fastq ... done prepping, now search and capture.
-capturing normalized reads from: /home/sharedUbuntu/S.scutata/FastqOutput/SRR3574594_2.fastq
CMD: touch /home/sharedUbuntu/S.scutata/Trinityresults/insilico_read_normalization/SRR3574594_1.fastq.normalized_K25_maxC200_minC2_maxCV10000.fq.ok
CMD finished (0 seconds)
CMD: touch /home/sharedUbuntu/S.scutata/Trinityresults/insilico_read_normalization/SRR3574594_2.fastq.normalized_K25_maxC200_minC2_maxCV10000.fq.ok
CMD finished (0 seconds)
CMD: ln -sf /home/sharedUbuntu/S.scutata/Trinityresults/insilico_read_normalization/SRR3574594_1.fastq.normalized_K25_maxC200_minC2_maxCV10000.fq left.norm.fq
ln: failed to create symbolic link 'left.norm.fq': Operation not permitted

**Error**, cmd: ln -sf /home/sharedUbuntu/S.scutata/Trinityresults/insilico_read_normalization/SRR3574594_1.fastq.normalized_K25_maxC200_minC2_maxCV10000.fq left.norm.fq died with ret 256 at /usr/local/bin/trinityrnaseq/util/insilico_read_normalization.pl line 793.

**Error**, cmd: /usr/local/bin/trinityrnaseq/util/insilico_read_normalization.pl --seqType fq --JM 28G  --max_cov 200 --min_cov 2 --CPU 4 --output /home/sharedUbuntu/S.scutata/Trinityresults/insilico_read_normalization --max_CV 10000  --left /home/sharedUbuntu/S.scutata/FastqOutput/SRR3574594_1.fastq --right /home/sharedUbuntu/S.scutata/FastqOutput/SRR3574594_2.fastq --pairs_together  --PARALLEL_STATS   died with ret 512 at /usr/local/bin/trinityrnaseq/Trinity line 2826.
    main::process_cmd("/usr/local/bin/trinityrnaseq/util/insilico_read_normalization"...) called at /usr/local/bin/trinityrnaseq/Trinity line 3379
    main::normalize("/home/sharedUbuntu/S.scutata/Trinityresults/insilico_re"..., 200, ARRAY(0x5625fba791e0), ARRAY(0x5625fba79210)) called at /usr/local/bin/trinityrnaseq/Trinity line 3319
    main::run_normalization(200, ARRAY(0x5625fba791e0), ARRAY(0x5625fba79210)) called at /usr/local/bin/trinityrnaseq/Trinity line 1372
TERMINATING COLLECTL, PID = 67

Knowing that the command works well when I try it with the test samples provided with trinity package.

I used Omicbox (trial version), and I get also an error messages. But when I asked the application to not do normalisation step, it worked and I got a fasta file with contigs.

Can you please give me some information about this problem ?

Thanks

RNA-Seq Assembly trinity • 1.5k views
ADD COMMENT

Login before adding your answer.

Traffic: 2426 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6