FASTA-format file using terminal command line
0
0
Entering edit mode
5.0 years ago
12sp • 0

using two files which are a subset of paired-end reads from a eukaryotic transcriptome (read length 90, insert size 200 I am trying to create a FASTA-format file containing all assembled transcripts 300 bp or larger. Each FASTA record should include a unique gene id (any alphanumeric identifier, e.g. gene 1234).

sequence • 1.5k views
ADD COMMENT
0
Entering edit mode

Ok. But what's your question and what does your data look like?

ADD REPLY
0
Entering edit mode

Since you have not clarified this I will ask.

  1. Are these RNAseq data?
  2. Are you trying to assemble those reads into a de novo fasta format transcriptome?
  3. Have you looked into what is involved in going from raw data to assembled sequence?
  4. If this is eukaryotic data then one possible place to start is Trinity page: https://github.com/trinityrnaseq/trinityrnaseq/wiki
ADD REPLY
0
Entering edit mode

Now this is sounding like an assignment based on the edits you made. If it is you should clearly state that. At this point you have a starting point with trinity. You now will need to show some effort on your part to get further assistance.

ADD REPLY

Login before adding your answer.

Traffic: 2424 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6