Entering edit mode
8.3 years ago
wangshx
▴
10
The chromosome_strand column should indicate "-" or "+". It is right when I use zcat simple_somatic_mutation.open.tsv.gz |head -5 to see the head of data (from ICGC tsv file). But when I use awk to get some columns I am interested in, chromosome_strand all change to 1. Anybody know why?
$zcat simple_somatic_mutation.open.tsv.gz |head -5 | awk 'BEGIN{FS="\t";OFS="\t";} {print $1,$2,$3,$12}'
icgc_mutation_id icgc_donor_id project_code chromosome_strand
MU28469596 DO50633 BOCA-FR 1
MU28469596 DO50633 BOCA-FR 1
MU28469596 DO50633 BOCA-FR 1
MU28469596 DO50633 BOCA-FR 1
paste the output of
zcat simple_somatic_mutation.open.tsv.gz |head -5............. too many columns. Thanks for reminding me. I am wrong~
So your 12th col is not the strand, as you might have noticed too.