Hi All,
I am new to the field of Bioinformatics, RNASeq etc. I managed to download differential gene expression data for a cohort of my study from GDC database. This data is having a column named as "cluster", and gene ids are occurring multiple times in different cluster numbers with different p values and log2FC values. I am having a hard time understanding what is this column-cluster representing in my dataset, and how should I consider them before I filter the data using thresholds for adjusted p value and log2FC? I am optimistically looking forward to a guidance from the experts of the field.
Thanks in advance.
can you show the name of the file and the first 10 lines of it?
biostars refused me to post answers as "not supported language"