Pandas Data Frame
0
0
Entering edit mode
18 months ago
priya.bmg ▴ 60

Hello

I have a column with many semicolon separated values and it is not same number of values in each row. For example, in row 1, there are five headings separated by semicolon and in row 2 there are seven headings separated by semicolon and so on

Row1 SVTYPE=DEL;END=42784444;SVLEN=-2206;BCSQ=sv:cds|TTBK2||protein_coding|-||;hetalt=002-020_dedup

Row2 IMPRECISE;SVTYPE=DEL;END=63493240;SVLEN=-2060;SVANN=TANDEM;BCSQ=sv:cds|EEF1A2||protein_coding|-||;hetalt=002-016,002-051

Is there a way to organize the data in Pandas dataframe, such a way that when I divide the semicolon separated values into columns, these values don't get interchanged or misplaced in the separate columns

Thanks

vcf GATK-SV pandas annotation • 510 views
ADD COMMENT
0
Entering edit mode

maybe you want to consider vcfpy in case you are handling a VCF file!

ADD REPLY

Login before adding your answer.

Traffic: 1921 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6