Question: Is There A Samtools/Bcftools Setting To Call Variants No Matter How Infrequent?
0
gravatar for Jeremy Leipzig
8.1 years ago by
Philadelphia, PA
Jeremy Leipzig18k wrote:

I can't get this position to be called by the samtools mpileup/bcftools stack.

I get the variant if I ask for all positions (not just variant):

samtools mpileup -u -f data/refs/myRef.fa data/bam/myBamFile.bam | bcftools view -

(showing one position for brevity)

myRef    21    .    C    T,G,X    0    .    DP=3796;I16=3730,0,55,0,135424,4994460,1972,72624,42241,1163355,634,7544,26664,192714,488,4340    PL    0,255,199,255,217,199,255,217,199,199

but if I ask for only variant positions I don't get anything.

samtools mpileup -u -f data/refs/myRef.fa data/bam/myBamFile.bam | bcftools view -v -

It is filtering but I can't figure out which filters to turn off.

The pileup itself is here:

myRef    21    C    3796    ...................................................................................................................TT........T.........T.....TT..T...T.T.T.....TT.TT...T.....T.TT..........T..TT......TT.T....T...TT.TT......T..T..T.T..TT...TT....TT.T..T..TT.TTT..........TT......TT............................................................................T...............................................................................................................................................................................T...........................................................................T................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................G.....................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................T..............................................................................................................................................................................................................................................................................................................................................................................................................................    HG7GHHDFGFHHHGHHBH8FHGHHHHFDHHHCGEHEH6>HHHH4HFHHDHAHHHHFGCHH<G@GHCHH:FD?EHHEGHH7EHHHGEE?;1HGGHB14EDHHFEE:HH?HH<GHBFEHHHGH>HEHHGHHGH@EHGGB>HFH<GEH>HHGHFHFEHGHBGHHHHHHGHHEHHGHGBGHF@HFHHHHGEH4HDGHHEFDEHHEF@CBG/HHHHHHGHDHHH?HGDFHHHHF@3HGHHHHHHHE=:HEHHHHHHE@DHHHHHHHGHDHEEHGHFEBFFHGDGGFD<GEHHBHF=HGHHGGHHH/FHHEHEFH;HDFHEHHG?BEHEHHHDHHGHHEH;EHGEHDFD?DBHFHGGEHHC:HFHHGGHHGHBHGEHFHFHEEHBHHHHHDHHH6HHEHH8EH>HHHFBGHHGEHGGE?H0HF<HGGHFHFBHHH>HHBBHBHHHEHHGGB3:HHE7@=@EGEHHHHHHFHHE>HHHCHHEGEHHGHHBHEDHHDEEHDGHEG@GCH;HFHH>5EC>FHHGHGHCEHCHBHBCH=B<GHHDHEHHDEHFGHHFEHHFHHHEEH?GGHHH3HEEC?HGF:GH7EHCHHHEGHE4HH??>EEBHCDGBHHDHDG4?EG9HGGG@EEEHB1EHHHHGGDHHHCE6GGHHHFGHHFHGGHHCHHFHFH/4GGGEHFGHHGHH9EHHHHEGFGHGEE?G>EGEHHH@BEE?HGEGGHHGHDFDEHHHHFEHHEHHHHGHHHFGHHHFCHHHHGHHFHFH2HHFBHGHHHHGHBHHFG?HHH;H4EHCHHHHHBGHEHE=GGHHGCHHF2HHHFHBHHHHHGHHD2H<HH>HFHEHG9HHEHEHHGGHHCHEGG3HHHEHGHGEHGFBHGHG@GH>DHHH1HHHHHHE@HHHH0EBHBDE=FHH?GHHEHHHHHHBHHDFH>FG@EHFEHEAFHHHGHHEGHH<GGHHHH@H;HHHGH6HHCEHBEHGH>E;H4GHHBHDHH<F9FHEHHHHHHDE7GHHCHEHEEGHEHHHCHHHD;8E@G?HH9HBHBH>C@H?HHE@HHEHBGHEFEHEBFDHHEHH9HHH>GFHHHH;EHHBHHHHF6EGHHH@HEEGHBHFB>EHHH@HBHBHHHHHHEEHHFE?HHDEHGHEHHGH=HGG@GHGEEH7F:HFH:EHH@HEHGHECGBBGEBEF>FEGHHHEDHHEH/BHHHHEHGDC,<BBHCHFBHDGG2?DHHH?GF9GBGHGH<HHHHHHGGHHHEB@EGHCG=EGEHGG;CBBECH7EEGGAHFGEHHH=?EGH9HHHHFGGHH8HGHE83H>H>EGHFGHGGHHEGHGDHHHHGHGHH=EFGEHHH7?@HHHF2HEGGHDHHFFHHCHHHHFHHHH.DH;?CHHHHHHFHDHHH2H6HFFGB9EGEHEHHHEBHHH@H6H=HEDBHDHHHHE:HHHGCHEGHHEEEHHGHGHHHHDHBEHHGHEHHDGEHHFHHHDBHHHBHHHHFHHHH?HHCH?HH25/FHD4HHHDDEEHEEHHDHHEEDEHHH?DH2HHHHEBBHHBH@HHHDH@HHEFHHGGGHHHBF>F?HGHEGHHGHGHGDHHHEFHHHHH@HHDDGHG8C;GGEEH8H@HDHH<HHEHHH4EGHHBHBHGGFH@DHFH=EEHHHHHH?HGHHHHHHHGEFF,GG?H6GDBH@HHFHHHHHHHEHHGDGHBE<GGHHHEHHEF4DFGHHEHGH0HFH=GGGEH/HGDGHHBHEHHHHDEHCGHHEH?HFD9@FH3BHHFEEFGH>HHHA6GEH@2B/<EGEHHHH8HHFC/GGDFHEHHEBE=8HFEHBHG=HGHHBHEH?EHHHGHEHHFGHEHHE%H<HHHBH8HEEHE<DGG?@EGEHGHBFHHGBEEH@E<GH?HHGFDFH9B6GEBHHFHGFHHHHHHGGHFGHH@=GHFHHECEHFFAHHEFHE*G/?HFHBGF?G7G:GDHDHHGGDHHE@GHHG@HH?H>HF/FEEH3EFHHC;HHH?;GHHFEEF4EHH<;E=H9EHFCHGGHE>GEEDHHHHEFHHHGHHD?GHHEBB8EEBHBH>HH;FDCGHGGCC?HHGHHHEFHHHEHDHE?@HFE2HHDHHHEHFG:CDF:6>E>GHFFH9EDHH4EHHF6HD7HHHGFGDH4GDHEHHHEG6HGHH*HHHHEFGH.HHHFHHHBCCHB6HG=HHHHHGHHGEDHH6FGH>FDFHHFHHHHFEHHGHGHDHG?7HDHH;4@HEEHHHGGH:CEHFHHHHGHH:H2<HFGEDEEHHEFHEH?HEGHHH>BHCHBHHHHFHHFEH0HHHH=BC6HHHGHGGBGHEHHHH:EAHFHEHHHHHEHEHHHHEHHGHFGHHHEHHHHH?HHEH7HH4?HHEHEHHEFFED=GHGD;HHBHHHGHHFHHEHHGCHBFGHHHHEHGHEHGHE7HEHGFHHHF8HAGH=HEFGHEDHHHCHHHHHHEHHHDHEHHEDHECHCED:HEH:HHHHGGHH;;FEEHEEEHEHG@DHHDH5HFFFHHEDHHEEFHH=FHEHBHHHHHDHHHEGFDHGFHHHHHHFCHEHHH?G:EFGEHCGHGHHGHHHGE?GH*H>HHGHHHEFGHHEGGHHHHHH>GGH@HH=HHEHHBEGEHHDHHGHHHEHH9EDHHEGHHHHGHHGHHFECHH:HFGHHE9DE6HFHHH;EHE>HEEFEHBHHHHEHEFEEHHHHHG0@HC=HHBHBEGHHGH:BFEHHHFHH:HBGHDGHGEEHGHEHGGE6HBHHCEH?EB7GHEHFHH?GBG=@EEEBCE?HBHH?HEFHHHGHHHH6EHGHGGGGHHBEEHHCDHHHE=HHH:.HFHGGHEHHHGFGFHBHGHD@HHGHFBHHBHFHEGHHGHEHH9H3HHHHHBHFCHFHGE2GHHHHH<HHEHHCDH>EHHHBHHH>FECH0?EDHDE@HFGH<HHHFEFEHEFAHH@HEHGHHHH@BGEHHHHEHHEGHHHHHH?HH4FHHHEHHHHDHHHHE?FHHH?E:GHHHHHFHGHHHHEHHHHHHEG,EHBEHEHHHGDHBH8GDHH2HHGHHHHGHHBBEH;6HDHHH3H:HHH2HBHDHHHFHEHGHGFHHHGHGHH9HDHHE9HHE@HHHHHHHGHHHHAH=HH9HGECHBGBHEEHHHH@BHHEHHHGHHHHHHDHFHG@E=HD77DEHEH0HEDGEDEDHGHHHBHHHB;EHE)HGEEEFE:HHHEH7HEHHGHHHBHHGHGFHHGEHEHHEH@HEEHHH@?H@DHHHHFEEGC>GH9DH5H=H:HHHGG?H?EHHG;GEEHHHDGHHHHHHE@FCDHHEHHF9HHCCHGHEFHH=HH=G;HE=>HGHH<HHHHGDEF7G-GEH:GEHEFHHHHH>G4HGHHEGHHHFEF;HH@HHHHECHHH<HHHEHFE<GDFHHE'HHE=GHHH@9H@CHHHFHHHH>GH=HHHH?HH4HHE?GHHHH>HGDACHG2GEHHEEDGEHDHH5HH8HEHHHH>BHHBEH?DHBHH7EH1EHFHG:HHEGH<CHHGHH:GHH<=HGHGHH>EHHEGHGE=GFFHHHGHBDHHGGHH:BHHHEBHHHHEHHHG?HGFHHBHHHG>HFEEGGEHHHEGEFHHEHHGG6HGHGHHHHHDHHEGHAGFE1DEH:HHHDGH2G8CEFHHHHHGHGHHHBHHGAFHHGHH?HFHGGHHGHH?HH=HGHHEH;EFFGHDGHEEHHHHGHGHHHFGHE<HH?GHBHGHH>HHGHGFHFGHGHG<HHHHHH@HH,C@HGHABHHHHHHHGHHGEHGGHDHGBCDBGGGHHHHHHHGFHGHHGDGHBHGHHHHGHHEHDHFHHHHGHFHHHDBHGBHH?GHHHHHHGHHGG7HGHHHHGG=GC5HHHH?H:5?GHEHGHFGHFG
samtools bcftools • 4.6k views
ADD COMMENTlink written 8.1 years ago by Jeremy Leipzig18k
1
gravatar for Doctoroots
8.1 years ago by
Doctoroots780
Doctoroots780 wrote:

Hi Jeremy, this could be a possible duplicate of this question, where the answer is that the use of BAQ by samtools reduces some true positives together with the false positive reduction and it is suggested to user either -B or -E in order to fix it.

also, your problem could be due to high coverage filtration, in which case you should use the -D / -d options to set the maximal accepted depth.

ADD COMMENTlink modified 8 weeks ago by RamRS24k • written 8.1 years ago by Doctoroots780
1
gravatar for lh3
8.1 years ago by
lh331k
United States
lh331k wrote:

Bcftools assumes your sample is diploid, which greatly improves the power for variant calling from diploid sequences. Your SNP has <2% frequency, even approaching the sequencing error rate. There is no way bcftools can call it. If you really want to call it as a SNP "no matter how infrequent", you should just parse the I16 field (first 4 numbers are identical to DP4).

ADD COMMENTlink written 8.1 years ago by lh331k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1067 users visited in the last hour