Hi, I have a big matrix (peak by cell) that is more than 75Gb, and I could not open it by Scanpy or in R environment, so I want to separate them into 20 matrix by various rows ( 1-20000, 20000-32300, 32300-33300.....) in linux.
The way I know is "split" in linux, but it just separates file into smaller one which is the same size or the same column, right?
So could you please tell me whether there is way can help me? Thank you.
Thank you for helping me. So sorry I did not describe my data clearly. It is a sparse matrix with 1323041 rows and 1154611 columns, but there are no row names and column names. And it may look like this below
"." means there is no value. I want to separate the matrix by the number of rows. For example, the first smaller file I want is the data from ROW1 to ROW20000, containing the first 20000 rows data in original large matrix. And the second file would be from ROW20000 to ROW32300.
it will be better if the last smaller files still are sparse matrix format.