Question: Is it me or everyone, DECIPHER package holds good for GenBank format, especially for nucleotides.
gravatar for arpankbasak
7 months ago by
arpankbasak10 wrote:

Hi, I'm struggling here working with amino acid sequences and DECIPHER package. I figured DECIPHER works smoothly when the format is GenBank. Which can be challenging to download manually for these number of sequences. Some of my peptides are unidentified. Is anyone aware of the usage and flexibility (may be a pipeline) of this package or any other handling amino acid seqs.

R decipher • 204 views
ADD COMMENTlink written 7 months ago by arpankbasak10

I'm not familiar with the package. What are you trying to do? Identify the genomes which your peptides are found in?

ADD REPLYlink written 7 months ago by jrj.healey10k

In short I want reverse translate my peptides such that I can fetch possible exons from genome coordinates coding for my peptides, more like de-novo method.

BY THE WAY/S, DECIPHER is quite popular package or pipe, you can do almost everything with it and its really fast and not at all furious on RAM. But there are limitations it takes GenBank format but not GenPept. When it comes to AAStringsets it DECIPHER fails to keep its name. I'm open to any other options or pipes.

ADD REPLYlink written 7 months ago by arpankbasak10

They only mention FASTA, FASTQ, and GenBank in the manual:

To begin, we can import a FASTA, FASTQ, or GenBank file into a sequence database. Here we will import the sequences into an in-memory database that will be removed when we disconnect from the database. In-memory databases are useful for temporary examples, but typically we would specify the path to a file where we want to store the database. This is especially the case when there are many sequences, as they might not all fit into memory.


If your data is GenPept, then consider converting it (?).

ADD REPLYlink written 7 months ago by Kevin Blighe37k

Well, its pretty complicated even when you convert it, (I tried it). Whats behind SQL is a "Black Box". But I waiting for the next update. Probably they'll fix these issues, with AAstringsset objects. The simple way will be to take the data into .txt format. But still, the deeper operations are becoming complicated.

ADD REPLYlink written 7 months ago by arpankbasak10
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1902 users visited in the last hour