Is it me or everyone, DECIPHER package holds good for GenBank format, especially for nucleotides.
0
0
Entering edit mode
5.8 years ago
arpankbasak ▴ 10

Hi, I'm struggling here working with amino acid sequences and DECIPHER package. I figured DECIPHER works smoothly when the format is GenBank. Which can be challenging to download manually for these number of sequences. Some of my peptides are unidentified. Is anyone aware of the usage and flexibility (may be a pipeline) of this package or any other handling amino acid seqs.

DECIPHER R • 1.2k views
ADD COMMENT
0
Entering edit mode

I'm not familiar with the package. What are you trying to do? Identify the genomes which your peptides are found in?

ADD REPLY
0
Entering edit mode

In short I want reverse translate my peptides such that I can fetch possible exons from genome coordinates coding for my peptides, more like de-novo method.

BY THE WAY/S, DECIPHER is quite popular package or pipe, you can do almost everything with it and its really fast and not at all furious on RAM. But there are limitations it takes GenBank format but not GenPept. When it comes to AAStringsets it DECIPHER fails to keep its name. I'm open to any other options or pipes.

ADD REPLY
0
Entering edit mode

They only mention FASTA, FASTQ, and GenBank in the manual:

To begin, we can import a FASTA, FASTQ, or GenBank file into a sequence database. Here we will import the sequences into an in-memory database that will be removed when we disconnect from the database. In-memory databases are useful for temporary examples, but typically we would specify the path to a file where we want to store the database. This is especially the case when there are many sequences, as they might not all fit into memory.

[source: https://www.bioconductor.org/packages/devel/bioc/vignettes/DECIPHER/inst/doc/DECIPHERing.pdf]

If your data is GenPept, then consider converting it (?).

ADD REPLY
0
Entering edit mode

Well, its pretty complicated even when you convert it, (I tried it). Whats behind SQL is a "Black Box". But I waiting for the next update. Probably they'll fix these issues, with AAstringsset objects. The simple way will be to take the data into .txt format. But still, the deeper operations are becoming complicated.

ADD REPLY

Login before adding your answer.

Traffic: 2793 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6