Pandas Data Frame
0
0
Entering edit mode
5 months ago
priya.bmg ▴ 50

Hello

I have a column with many semicolon separated values and it is not same number of values in each row. For example, in row 1, there are five headings separated by semicolon and in row 2 there are seven headings separated by semicolon and so on

Row1 SVTYPE=DEL;END=42784444;SVLEN=-2206;BCSQ=sv:cds|TTBK2||protein_coding|-||;hetalt=002-020_dedup

Row2 IMPRECISE;SVTYPE=DEL;END=63493240;SVLEN=-2060;SVANN=TANDEM;BCSQ=sv:cds|EEF1A2||protein_coding|-||;hetalt=002-016,002-051

Is there a way to organize the data in Pandas dataframe, such a way that when I divide the semicolon separated values into columns, these values don't get interchanged or misplaced in the separate columns

Thanks

vcf GATK-SV pandas annotation • 296 views
0
Entering edit mode

maybe you want to consider vcfpy in case you are handling a VCF file!