I want to extract the human-HCV(Hepatitis C virus) protein-protein interactions (PPI). For doing this, I have downloaded the entire content of the IntAct database as a .txt file. This .txt file has a huge size (4GB). I tried to convert this text file to a CSV file by Python and then extract just human-HCV PPIs. The problem is the size of the file, and I encounter a memory error.
import pandas as pd
read_file = pd.read_csv('intact.txt', delimiter='\t')
output: `MemoryError: Unable to allocate 162. MiB for an array with shape (41, 1035669) and data type object`
how should I solve this issue?
I sincerely would appreciate your help.