Hi,
I tried to run the alignment, but the script does not find my raw files (NO such file or directory).
Nov 21 09:26:43 ..... Started STAR run
Nov 21 09:26:43 ..... Loading genome
Nov 21 09:27:12 ..... Processing annotations GTF
Nov 21 09:27:20 ..... Inserting junctions into the genome indices
gunzip: 1_2.fastq.gz: No such file or directory
gunzip: 2_1.fastq.gz: No such file or directory
gunzip: 2_2.fastq.gz: No such file or directory
gunzip: 3_1.fastq.gz: No such file or directory
gunzip: 3_2.fastq.gz: No such file or directory
gunzip: 4_1.fastq.gz: No such file or directory
gunzip: 4_2.fastq.gz: No such file or directory
gunzip: 5_1.fastq.gz: No such file or directory
gunzip: 5_2.fastq.gz: No such file or directory
gunzip: 6_1.fastq.gz: No such file or directory
gunzip: 6_2.fastq.gz: No such file or directory
gunzip: 7_1.fastq.gz: No such file or directory
gunzip: 7_2.fastq.gz: No such file or directory
gunzip: 8_1.fastq.gz: No such file or directory
gunzip: 8_2.fastq.gz: No such file or directory
Nov 21 09:29:43 ..... Started mapping
Nov 21 09:29:48 ..... Started sorting BAM
Nov 21 09:29:48 ..... Finished successfully
I tried changing the command to gunzip, it also did not work.
I tried to put every name of all my files:
/moreno/STAR-master/bin/Linux_x86_64_static/STAR --runThreadN 8 --genomeDir /home/acamar2/files/reference/ --sjdbGTFfile /home/acamar2/files/annotation/Rattus_norvegicus.Rnor_6.0.94.gtf --readFilesIn /home/acamar2/rawdata/1_1.fastq.gz,1_2.fastq.gz,2_1.fastq.gz,2_2.fastq.gz,3_1.fastq.gz,3_2.fastq.gz,4_1.fastq.gz,4_2.fastq.gz,5_1.fastq.gz,5_2.fastq.gz,6_1.fastq.gz,6_2.fastq.gz,7_1.fastq.gz,7_2.fastq.gz,8_1.fastq.gz,8_2.fastq.gz --readFilesCommand gunzip --quantMode TranscriptomeSAM GeneCounts --outSAMtype BAM SortedByCoordinate --outFileNamePrefix /home/acamar2/STAR1/fastq
the BAM directory only generates an empty folder.
Is it that I should create a variable for the samples, so does the STAR read the variable?
Like:
/home/acamar2/rawdata/fastqc $f/*.fastq.gz
for filename_1 in $f/*_1_*.fastq.gz
do
echo $filename_1
done
for filename_2 in $f/*_2_*.fastq.gz
do
echo $filename_2
done
/moreno/STAR-master/bin/Linux_x86_64_static/STAR --runThreadN 8 --genomeDir /home/acamar2/files/reference/ --sjdbGTFfile /home/acamar2/files/annotation/Rattus_norvegicus.Rnor_6.0.94.gtf --readFilesIn /home/acamar2/rawdata/1_1.fastq.gz,1_2.fastq.gz,2_1.fastq.gz,2_2.fastq.gz,3_1.fastq.gz,3_2.fastq.gz,4_1.fastq.gz,4_2.fastq.gz,5_1.fastq.gz,5_2.fastq.gz,6_1.fastq.gz,6_2.fastq.gz,7_1.fastq.gz,7_2.fastq.gz,8_1.fastq.gz,8_2.fastq.gz --readFilesCommand gunzip --quantMode TranscriptomeSAM GeneCounts --outSAMtype BAM SortedByCoordinate --outFileNamePrefix /home/acamar2/STAR1/fastq
echo $f"Complete"
done
Thanks
Hello,
I see you are working on RNAseq data. I presume you want to see your reads on the whole genome, so you need a fasta file with all chromosomes.
On ENSEMBL ftp server for rat, you have information about the files in the README file.
You should have take this file to create your index
First recreate your index with the proper command line then do the alignment
Copy/Paste your index command line in your post please
Also, be careful with french grammar in your post (de genome, Chr1 ou Chr2)
Hi,
I tried to run the alignment, but the script does not find my raw files (NO such file or directory).
Nov 21 09:26:43 ..... Started STAR run Nov 21 09:26:43 ..... Loading genome Nov 21 09:27:12 ..... Processing annotations GTF Nov 21 09:27:20 ..... Inserting junctions into the genome indices gunzip: 1_2.fastq.gz: No such file or directory gunzip: 2_1.fastq.gz: No such file or directory gunzip: 2_2.fastq.gz: No such file or directory gunzip: 3_1.fastq.gz: No such file or directory gunzip: 3_2.fastq.gz: No such file or directory gunzip: 4_1.fastq.gz: No such file or directory gunzip: 4_2.fastq.gz: No such file or directory gunzip: 5_1.fastq.gz: No such file or directory gunzip: 5_2.fastq.gz: No such file or directory gunzip: 6_1.fastq.gz: No such file or directory gunzip: 6_2.fastq.gz: No such file or directory gunzip: 7_1.fastq.gz: No such file or directory gunzip: 7_2.fastq.gz: No such file or directory gunzip: 8_1.fastq.gz: No such file or directory gunzip: 8_2.fastq.gz: No such file or directory Nov 21 09:29:43 ..... Started mapping Nov 21 09:29:48 ..... Started sorting BAM Nov 21 09:29:48 ..... Finished successfully
I tried changing the command to gunzip, it also did not work. I tried to put every name of all my files:
/moreno/STAR-master/bin/Linux_x86_64_static/STAR --runThreadN 8 --genomeDir /home/acamar2/files/reference/ --sjdbGTFfile /home/acamar2/files/annotation/Rattus_norvegicus.Rnor_6.0.94.gtf --readFilesIn /home/acamar2/rawdata/1_1.fastq.gz,1_2.fastq.gz,2_1.fastq.gz,2_2.fastq.gz,3_1.fastq.gz,3_2.fastq.gz,4_1.fastq.gz,4_2.fastq.gz,5_1.fastq.gz,5_2.fastq.gz,6_1.fastq.gz,6_2.fastq.gz,7_1.fastq.gz,7_2.fastq.gz,8_1.fastq.gz,8_2.fastq.gz --readFilesCommand gunzip --quantMode TranscriptomeSAM GeneCounts --outSAMtype BAM SortedByCoordinate --outFileNamePrefix /home/acamar2/STAR1/fastq
the BAM directory only generates an empty folder.
Is it that I should create a variable for the samples, so does the STAR read the variable? Like:
/home/acamar2/rawdata/fastqc $f/*.fastq.gz
for filename_1 in $f/_1_.fastq.gz do echo $filename_1 done for filename_2 in $f/_2_.fastq.gz do echo $filename_2 done /moreno/STAR-master/bin/Linux_x86_64_static/STAR --runThreadN 8 --genomeDir /home/acamar2/files/reference/ --sjdbGTFfile /home/acamar2/files/annotation/Rattus_norvegicus.Rnor_6.0.94.gtf --readFilesIn /home/acamar2/rawdata/1_1.fastq.gz,1_2.fastq.gz,2_1.fastq.gz,2_2.fastq.gz,3_1.fastq.gz,3_2.fastq.gz,4_1.fastq.gz,4_2.fastq.gz,5_1.fastq.gz,5_2.fastq.gz,6_1.fastq.gz,6_2.fastq.gz,7_1.fastq.gz,7_2.fastq.gz,8_1.fastq.gz,8_2.fastq.gz --readFilesCommand gunzip --quantMode TranscriptomeSAM GeneCounts --outSAMtype BAM SortedByCoordinate --outFileNamePrefix /home/acamar2/STAR1/fastq
echo $f"Complete"
done
Thanks
Please,
I still have trouble generating genome index:
This is my command line: /moreno/STAR-master/bin/Linux_x86_64_static/STAR --runThreadN 8 --runMode genomeGenerate --genomeDir /home/acamar2/files/reference --genomeFastaFiles /home/acamar2/files/RattuN/Rattus_norvegicus.Rnor_6.0.dna.chromosome.toplevel.fa --sjdbGTFfile /home/acamar2/annotation/Rattus_norvegicus.Rnor_6.0.94.gtf
when they are finished, these are the generated files (I used de top
drwxr-xr-x 2 acamar2 acamar2 4,0K Nov 20 10:26 . drwxr-xr-x 5 acamar2 acamar2 53 Nov 27 14:00 .. -rw-rw-r-- 1 acamar2 acamar2 5,1K Nov 20 09:28 chrLength.txt -rw-rw-r-- 1 acamar2 acamar2 17K Nov 20 09:28 chrNameLength.txt -rw-rw-r-- 1 acamar2 acamar2 12K Nov 20 09:28 chrName.txt -rw-rw-r-- 1 acamar2 acamar2 11K Nov 20 09:28 chrStart.txt -rw-rw-r-- 1 acamar2 acamar2 11M Nov 20 10:17 exonGeTrInfo.tab -rw-rw-r-- 1 acamar2 acamar2 4,5M Nov 20 10:17 exonInfo.tab -rw-rw-r-- 1 acamar2 acamar2 611K Nov 20 10:17 geneInfo.tab -rw-rw-r-- 1 acamar2 acamar2 3,0G Nov 20 10:22 Genome -rw-rw-r-- 1 acamar2 acamar2 747 Nov 20 09:26 genomeParameters.txt -rw-rw-r-- 1 acamar2 acamar2 22G Nov 20 10:26 SA -rw-rw-r-- 1 acamar2 acamar2 1,5G Nov 20 10:26 SAindex -rw-rw-r-- 1 acamar2 acamar2 5,7M Nov 20 10:17 sjdbInfo.txt -rw-rw-r-- 1 acamar2 acamar2 4,5M Nov 20 10:17 sjdbList.fromGTF.out.tab -rw-rw-r-- 1 acamar2 acamar2 4,5M Nov 20 10:17 sjdbList.out.tab -rw-rw-r-- 1 acamar2 acamar2 2,5M Nov 20 10:17 transcriptInfo.tab
I think my genome was not generated because I get to run the alignment, my generated files are zeroed. Could you help me, please?