Question: How to efetch specific attributes from BioSamples?
gravatar for Arindam Ghosh
5 months ago by
Arindam Ghosh160
Arindam Ghosh160 wrote:

I need to extract the sample source for a list of sample ids. I tried using efetch as follows:

efetch -db biosample -id SAMN04383980 -format xml | xtract -pattern BioSampleSet -division BioSample -group Attributes -element Attribute

The problem here is that there are often more than two attributes:

         <Attribute attribute_name="source_name" harmonized_name="source_name" display_name="source name">H7 hESCs</Attribute>
         <Attribute attribute_name="cell line" harmonized_name="cell_line" display_name="cell line">H7 derived</Attribute> 

How do I extract the one with attribute_name="source_name" only?

eutils xml ncbi • 272 views
ADD COMMENTlink modified 5 months ago by Pierre Lindenbaum121k • written 5 months ago by Arindam Ghosh160
gravatar for Pierre Lindenbaum
5 months ago by
France/Nantes/Institut du Thorax - INSERM UMR1087
Pierre Lindenbaum121k wrote:

using a xpath expression:

 wget -q -O - "" |\
   xmllint --xpath "/BioSampleSet/BioSample/Attributes/Attribute[@attribute_name='source_name']/text()"  -

H7 hESCs
ADD COMMENTlink written 5 months ago by Pierre Lindenbaum121k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1266 users visited in the last hour