Question: Analyzing only forward reads in mothur
0
gravatar for giderk
21 months ago by
giderk0
giderk0 wrote:

I am new to using mothur and in attempting to follow the MiSeq_SOP guide with my data on 18S rRNA v4 sequences I am stumped in the beginning. The guide assumes you want to make contigs from paired-end reads, but I would like to analyze my forward reads separately. How do I combine just my read 1 fastq files, retaining the sample name for all the sequences in each file of course, so that I can proceed with the following alignment and classification steps? I can't seem to find an explanation on how to do this anywhere.

forward read miseq mothur • 1.5k views
ADD COMMENTlink modified 1 day ago by rgn50110 • written 21 months ago by giderk0

I have a similar question. Any luck in finding a solution?

ADD REPLYlink written 17 months ago by pmatson0

Can't you just skip the make.contigs()? Try preparing the stability files with just R1.

You may also have to hack your way and change your file names to the same pattern Mothur uses after the make.contigs() step.

ADD REPLYlink written 17 months ago by h.mon19k

But make.contigs() is the step where a quality information filter is applied as fastq files are converted to fasta files. If you skip make.contigs how do you get a fasta file? I tried making a .files table with only the forward fastq listed but wasn't able to get it to work. If you did, could you please share an example of the format?

Thanks! I am having this same problem trying to re-analyze some old public datasets (sequenced before paired end was a thing.)

Here is the description of the .files file format. You'll notice every format option includes a forward and reverse read: https://www.mothur.org/wiki/Make.contigs#file

ADD REPLYlink modified 17 months ago • written 17 months ago by rrr40
1
gravatar for alealdre
13 months ago by
alealdre10
alealdre10 wrote:

Hey there, you could use Fastq.info() to obtain the fasta and quality file for either forward or reverse sequences. Here is the link to the mothur help:

https://www.mothur.org/wiki/Fastq.info

Hope it's what you needed!

ADD COMMENTlink written 13 months ago by alealdre10

That's what I did. I wanted to test the pipeline on paired ends and on forward and reverse separately, so I used fastq.info() to obtain a fasta file from the fastq. I think make.contigs() also outputs a .groups file which is not made when you use fastq.info, so you need to make a .groups file too as you need that for unique.seqs()...

ADD REPLYlink written 12 months ago by vali0
0
gravatar for rgn5011
1 day ago by
rgn50110
rgn50110 wrote:

Hey guys, I know this is over a year old but I found out something that can fix this. Currently we are using mothur to analyze our PacBio data which comes as a bam file. We then convert the bam to a fastq file and ran into the same issue mentioned above.

So lets say I have 5 samples after we convert to fastq called 101.fastq, 102.fastq, 103.fastq, 104.fastq and 105.fastq.

First go into mothur and run fastq.info on all the samples (if using pacbio make sure to use the pacbio=T flag).

Then run a command called make.groups(), for this example this command would look like;

make.group(fasta=101.fasta-102.fasta-103.fasta-104.fasta-105.fasta, groups=101-102-103-104-105)

This results in a file called mergegroups (you can easily rename this)

Then exit mothur (or run a system() command) and run

cat *fasta > combo.fasta

to combine your fasta files. Then go back into mothur and treat the mergegroups file and the combo.fasta file like the output you would get from a make.contigs() command.

Hope this helps!

Bob

ADD COMMENTlink modified 1 day ago • written 1 day ago by rgn50110
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1489 users visited in the last hour