User: tdmurphy

gravatar for tdmurphy
tdmurphy160
Reputation:
160
Status:
Trusted
Location:
Last seen:
6 months, 1 week ago
Joined:
3 years, 5 months ago
Email:
t**********@gmail.com

Posts by tdmurphy

<prev • 21 results • page 1 of 3 • next >
0
votes
2
answers
306
views
2
answers
Answer: A: How to download the feature table file of whole genome
... REDO sounds like it is specific for organelles -- is it valid to run on a whole genome? There are some annotated organelle sequences available for wheat, including two in RefSeq: https://www.ncbi.nlm.nih.gov/nuccore/?term=txid4565%5BOrganism%3Aexp%5D+AND+complete+genome%5Btitle%5D You can get featu ...
written 6 months ago by tdmurphy160
5
votes
1
answer
508
views
1
answers
Answer: A: 'CDS' but not 'exon' in GFF
... Organelles and prokaryote genomes in both GenBank and RefSeq are typically annotated with only CDS and no corresponding mRNA features, so the GFF3 has no exon features. This is mentioned briefly under "ANNOTATION DATA MODEL" in: ftp://ftp.ncbi.nlm.nih.gov/genomes/README_GFF3.txt ...
written 6 months ago by tdmurphy160
0
votes
2
answers
2.7k
views
2
answers
Comment: C: What is the difference between GRCh37 and hs37? And hg19?
... > Ultimately GENCODE is the organization responsible for managing human/mouse genome data. They provide the authoritative genome data that is used by everyone including NCBI/UCSC/Ensembl. I believe you mean the [Genome Reference Consortium][1] manages the human and mouse genome data. GENCODE is ...
written 11 months ago by tdmurphy160
0
votes
2
answers
647
views
2
answers
Answer: A: How to download large protein data from NCBI?
... NCBI RefSeq includes nearly all bacteria proteins, and has files available for download at: https://ftp.ncbi.nlm.nih.gov/refseq/release/bacteria/ ...
written 12 months ago by tdmurphy160
2
votes
3
answers
1.5k
views
3
answers
Answer: A: why the same gene is located at different chromosomes ?
... UCSC has two types of RefSeq tracks. The old "RefSeq Genes" or refgene track is based on alignments generated by UCSC, and can't distinguish between different locations with the same sequence. The newer "NCBI RefSeq" tracks are based on annotation imported from NCBI's RefSeq project, which uses addi ...
written 12 months ago by tdmurphy160
1
vote
1
answer
739
views
1
answers
Comment: C: Where I can download separate complete human chromosome genomes in GenBank forma
... [RefSeqGene][1] is a set of genomic records for individual clinically-relevant genes, not whole chromosomes. Genome annotation in GenBank flatfile format is available at: ftp://ftp.ncbi.nlm.nih.gov/genomes/refseq/vertebrate_mammalian/Homo_sapiens/all_assembly_versions/GCF_000001405.37_GRCh38.p11/GCF ...
written 16 months ago by tdmurphy160
0
votes
3
answers
878
views
3
answers
Answer: A: Easiest way to download all Enterobacteria
... You can easily do this from NCBI's Assembly resource: https://www.ncbi.nlm.nih.gov/assembly/?term=Enterobacteria%5Borgn%5D+latest_refseq%5Bfilter%5D click the blue "Download Assemblies" button, pick "refseq" and the filetype you're after (e.g. genomic FASTA), and it should work. It might take a whi ...
written 16 months ago by tdmurphy160
0
votes
1
answer
698
views
1
answers
Answer: A: Where to download gff3 file for EcoCyc data?
... I think EcoCyc uses the same IDs as EcoGene (EG#####), which are available on the GenBank and RefSeq annotations at NCBI. Try the "Download" links at the right from this page: https://www.ncbi.nlm.nih.gov/assembly/GCF_000005845.2 Both GFF3 and flatfiles are available. ...
written 16 months ago by tdmurphy160
1
vote
1
answer
625
views
1
answers
Answer: A: How to generate a new FASTA from an assembly-assembly mapping ?
... On the NCBI Remap FTP site, the alignment files for each assembly pair are provided twice (assm1/assm2 and assm2/assm1). The alignments themselves are flip-flopped for query and target, which would take care of most of your sorting problem. ...
written 16 months ago by tdmurphy160
0
votes
1
answer
555
views
1
answers
Answer: A: How to get Gene symbols & nuclotide FASTA for taxid :1239
... Many of the bacteria RefSeq genomes aren't available in NCBI's Gene database, so e-utils with the gene db won't work. If you have a specific set of assemblies in mind, try downloading the "feature_table.txt" files for that set and parsing what you need from there. e.g.: https://www.ncbi.nlm.nih.gov/ ...
written 16 months ago by tdmurphy160

Latest awards to tdmurphy

Teacher 6 months ago, created an answer with at least 3 up-votes. For A: 'CDS' but not 'exon' in GFF
Scholar 17 months ago, created an answer that has been accepted. For A: Retrieve GFF3 file from ncbi

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 845 users visited in the last hour