Entering edit mode
7.2 years ago
Lila M
★
1.2k
Hi every body,
I have the coverage for different genes as:
GENE COVERAGE
A 0.7
A 0.2
A 0.9
B 0.5
B 1.2
B 0.3
B 0.5
B 0.6
C 0.1
and I want to get ONLY the highest coverage for each gene as follow
GENE COVERAGE
A 0.9
B 1.2
C 0.1
Because I need the most representative to calculate the density of each one. Any suggestion?
PS. I can't install mySQL in the computer, I know that it will be the best option...
Thanks!
can you explain a bit the code please?
Thank you!
I am not very good using this language, it seems quite easy to use, for example, if I want to order based on different colunm for example 7 and 11 I can try:
But can you explain what LC_ALL=C means?
Thanks!
this 'language' is bash.
The column are specified by the '-k' option. So it would be
(....) -k7,7 -k11,11rg (...)
yes, sorry about that, I've just figure it out very quickly ! Nice approach to handling files :)
LC_ALL=C http://unix.stackexchange.com/questions/87745/what-does-lc-all-c-do