Entering edit mode
3.2 years ago
lee_victoria
•
0
Hi, i have RNAseq data provided to me in a VCF file format. Need help interpreting the "ID" column of the file. For example, for a single gene MDM4, there are 4 unique lines, where the ID is "MDM4.E1E2", "MDM4-MDM4.M2M11", "MDM4.E3E4" and "MDM4-MDM4.M7M10".
What do these nomenclature mean? are they different splice variants or alt transcripts of MDM4? When i do DEG analyses do I use the total READ COUNT for these 4 lines as the read count for MDM4, or analyse each line as 1 separate gene?
you need to tell use what software was used to insert those names in the ID column . It's usually defined in the VCF header.
Thanks for your reply! in the VCF header i can see the following which may be what you are looking for?
so you should ask Oncomine...