how to create a dataset from fastq sequence files in ubuntu
1
0
Entering edit mode
5 weeks ago
Flo • 0

I am very new to bioinformatics and i need to analyse the microbiome in extracted dna samples (Illumina sequencing, single end). I have filtered the data (deleted barcodes and deleted the ones with a too high expected error with usearch) and created OTU files ( do not know if that was what i was supposed to do). i need to generate a dataset with the samples as rows and the columnnames as the microbiome species inside. the data is best relative abundancy data but at this point i am happy with binary data. just the presence of microbiota in one dataframe to preform a pca on the microbial communities in R. i already have a library of fastq files of all the bacteria, fungi and virus dna that i can use to detect the presence of microbiota. unfortunatly, i do not know how to do this or to use which commands. Ive been working on this for the past 3 weeks and dont know how to continue.

metagenomics microbiome dataset usearch ubuntu • 234 views
ADD COMMENT
0
Entering edit mode
5 weeks ago
Mark ★ 1.1k

Sounds like you need to use QIIME or QIIME2, they have good tutorials that will help you get started:

https://qiime2.org/

ADD COMMENT

Login before adding your answer.

Traffic: 1532 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6