Say I have multiple files like this:
N_1_Log.final.out N_2_Log.final.out N_3_Log.final.out Ni_N1_Log.final.out Ni_N2_Log.final.out Ni_N3_Log.final.out Ni_1_Log.final.out Ni_2_Log.final.out Ni_3_Log.final.out Ni_SM1_Log.final.out Ni_SM2_Log.final.out Ni_SM3_Log.final.out
An example of what the .out file looks like:
Started job on | Apr 23 23:17:02 Started mapping on | Apr 23 23:17:04 Finished on | Apr 23 23:26:52 Mapping speed, Million of reads per hour | 115.68 Number of input reads | 18894432 Average input read length | 298 UNIQUE READS: Uniquely mapped reads number | 17704240 Uniquely mapped reads % | 93.70% Average mapped length | 297.39 Number of splices: Total | 3119841 Number of splices: Annotated (sjdb) | 2663436 Number of splices: GT/AG | 3080422 Number of splices: GC/AG | 14219 Number of splices: AT/AC | 248 Number of splices: Non-canonical | 24952 Mismatch rate per base, % | 0.49% Deletion rate per base | 0.02% Deletion average length | 2.70 Insertion rate per base | 0.02% Insertion average length | 2.30 MULTI-MAPPING READS: Number of reads mapped to multiple loci | 806405 % of reads mapped to multiple loci | 4.27% Number of reads mapped to too many loci | 1146 % of reads mapped to too many loci | 0.01%
I want to make a .csv file where I only take: Number of input reads, Uniquely mapped reads number, and Uniquely mapped reads %. Is there a tool that can do this? Any help would be appreciated.