Hello I have a file like
ENTRY EC 188.8.131.52 Enzyme NAME alcohol dehydrogenase; CLASS Oxidoreductases; SYSNAME alcohol:NAD+ oxidoreductase REACTION (1) a primary alcohol + NAD+ = an aldehyde + NADH + H+ [RN:R00623]; ALL_REAC R00623 > R00754 R02124 R02878 R04805 R04880 R05233 R05234 R06917 R06927 R08281 R08306 R08557 R08558 R10783; SUBSTRATE primary alcohol [CPD:C00226]; PRODUCT aldehyde [CPD:C00071]; ENTRY EC 184.108.40.206 Enzyme NAME 3-hydroxybutyryl-CoA dehydrogenase; CLASS Oxidoreductases; SYSNAME (S)-3-hydroxybutanoyl-CoA:NADP+ oxidoreductase REACTION (S)-3-hydroxybutanoyl-CoA + NADP+ = 3-acetoacetyl-CoA + NADPH + H+ [RN:R01976] ALL_REAC R01976; SUBSTRATE (S)-3-hydroxybutanoyl-CoA [CPD:C01144]; PRODUCT 3-acetoacetyl-CoA [CPD:C00332];
and i need to convert it to
ENTRY NAME CLASS SYSNAME REACTION ALL_REAC SUBSTRATE PRODUCT
and the corresponding values in rows. can anybody help me writing a script for this purpose.
output (tab separated):
This command gives correct output for first entry only. Can you please manipulate it to the entire file. I am not meticulous in awk.
miller can be installed via ubuntu (till xenial-16.04)/mint (sonya- 18.2) repos. However, you would need latest version of Miller. Compile it from miller github.