Question: Is There A Samtools/Bcftools Setting To Call Variants No Matter How Infrequent?
0
gravatar for Jeremy Leipzig
6.0 years ago by
Philadelphia, PA
Jeremy Leipzig17k wrote:

I can't get this position to be called by the samtools mpileup/bcftools stack.

I get the variant if I ask for all positions (not just variant):

samtools mpileup -u -f data/refs/myRef.fa data/bam/myBamFile.bam | bcftools view -

(showing one position for brevity)

myRef    21    .    C    T,G,X    0    .    DP=3796;I16=3730,0,55,0,135424,4994460,1972,72624,42241,1163355,634,7544,26664,192714,488,4340    PL    0,255,199,255,217,199,255,217,199,199

but if I ask for only variant positions I don't get anything.

samtools mpileup -u -f data/refs/myRef.fa data/bam/myBamFile.bam | bcftools view -v -

It is filtering but I can't figure out which filters to turn off.

The pileup itself is here:

myRef    21    C    3796    ...................................................................................................................TT........T.........T.....TT..T...T.T.T.....TT.TT...T.....T.TT..........T..TT......TT.T....T...TT.TT......T..T..T.T..TT...TT....TT.T..T..TT.TTT..........TT......TT............................................................................T...............................................................................................................................................................................T...........................................................................T................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................G.....................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................T..............................................................................................................................................................................................................................................................................................................................................................................................................................    HG7GHHDFGFHHHGHHBH8FHGHHHHFDHHHCGEHEH6>HHHH4HFHHDHAHHHHFGCHH<G@GHCHH:FD?EHHEGHH7EHHHGEE?;1HGGHB14EDHHFEE:HH?HH<GHBFEHHHGH>HEHHGHHGH@EHGGB>HFH<GEH>HHGHFHFEHGHBGHHHHHHGHHEHHGHGBGHF@HFHHHHGEH4HDGHHEFDEHHEF@CBG/HHHHHHGHDHHH?HGDFHHHHF@3HGHHHHHHHE=:HEHHHHHHE@DHHHHHHHGHDHEEHGHFEBFFHGDGGFD<GEHHBHF=HGHHGGHHH/FHHEHEFH;HDFHEHHG?BEHEHHHDHHGHHEH;EHGEHDFD?DBHFHGGEHHC:HFHHGGHHGHBHGEHFHFHEEHBHHHHHDHHH6HHEHH8EH>HHHFBGHHGEHGGE?H0HF<HGGHFHFBHHH>HHBBHBHHHEHHGGB3:HHE7@=@EGEHHHHHHFHHE>HHHCHHEGEHHGHHBHEDHHDEEHDGHEG@GCH;HFHH>5EC>FHHGHGHCEHCHBHBCH=B<GHHDHEHHDEHFGHHFEHHFHHHEEH?GGHHH3HEEC?HGF:GH7EHCHHHEGHE4HH??>EEBHCDGBHHDHDG4?EG9HGGG@EEEHB1EHHHHGGDHHHCE6GGHHHFGHHFHGGHHCHHFHFH/4GGGEHFGHHGHH9EHHHHEGFGHGEE?G>EGEHHH@BEE?HGEGGHHGHDFDEHHHHFEHHEHHHHGHHHFGHHHFCHHHHGHHFHFH2HHFBHGHHHHGHBHHFG?HHH;H4EHCHHHHHBGHEHE=GGHHGCHHF2HHHFHBHHHHHGHHD2H<HH>HFHEHG9HHEHEHHGGHHCHEGG3HHHEHGHGEHGFBHGHG@GH>DHHH1HHHHHHE@HHHH0EBHBDE=FHH?GHHEHHHHHHBHHDFH>FG@EHFEHEAFHHHGHHEGHH<GGHHHH@H;HHHGH6HHCEHBEHGH>E;H4GHHBHDHH<F9FHEHHHHHHDE7GHHCHEHEEGHEHHHCHHHD;8E@G?HH9HBHBH>C@H?HHE@HHEHBGHEFEHEBFDHHEHH9HHH>GFHHHH;EHHBHHHHF6EGHHH@HEEGHBHFB>EHHH@HBHBHHHHHHEEHHFE?HHDEHGHEHHGH=HGG@GHGEEH7F:HFH:EHH@HEHGHECGBBGEBEF>FEGHHHEDHHEH/BHHHHEHGDC,<BBHCHFBHDGG2?DHHH?GF9GBGHGH<HHHHHHGGHHHEB@EGHCG=EGEHGG;CBBECH7EEGGAHFGEHHH=?EGH9HHHHFGGHH8HGHE83H>H>EGHFGHGGHHEGHGDHHHHGHGHH=EFGEHHH7?@HHHF2HEGGHDHHFFHHCHHHHFHHHH.DH;?CHHHHHHFHDHHH2H6HFFGB9EGEHEHHHEBHHH@H6H=HEDBHDHHHHE:HHHGCHEGHHEEEHHGHGHHHHDHBEHHGHEHHDGEHHFHHHDBHHHBHHHHFHHHH?HHCH?HH25/FHD4HHHDDEEHEEHHDHHEEDEHHH?DH2HHHHEBBHHBH@HHHDH@HHEFHHGGGHHHBF>F?HGHEGHHGHGHGDHHHEFHHHHH@HHDDGHG8C;GGEEH8H@HDHH<HHEHHH4EGHHBHBHGGFH@DHFH=EEHHHHHH?HGHHHHHHHGEFF,GG?H6GDBH@HHFHHHHHHHEHHGDGHBE<GGHHHEHHEF4DFGHHEHGH0HFH=GGGEH/HGDGHHBHEHHHHDEHCGHHEH?HFD9@FH3BHHFEEFGH>HHHA6GEH@2B/<EGEHHHH8HHFC/GGDFHEHHEBE=8HFEHBHG=HGHHBHEH?EHHHGHEHHFGHEHHE%H<HHHBH8HEEHE<DGG?@EGEHGHBFHHGBEEH@E<GH?HHGFDFH9B6GEBHHFHGFHHHHHHGGHFGHH@=GHFHHECEHFFAHHEFHE*G/?HFHBGF?G7G:GDHDHHGGDHHE@GHHG@HH?H>HF/FEEH3EFHHC;HHH?;GHHFEEF4EHH<;E=H9EHFCHGGHE>GEEDHHHHEFHHHGHHD?GHHEBB8EEBHBH>HH;FDCGHGGCC?HHGHHHEFHHHEHDHE?@HFE2HHDHHHEHFG:CDF:6>E>GHFFH9EDHH4EHHF6HD7HHHGFGDH4GDHEHHHEG6HGHH*HHHHEFGH.HHHFHHHBCCHB6HG=HHHHHGHHGEDHH6FGH>FDFHHFHHHHFEHHGHGHDHG?7HDHH;4@HEEHHHGGH:CEHFHHHHGHH:H2<HFGEDEEHHEFHEH?HEGHHH>BHCHBHHHHFHHFEH0HHHH=BC6HHHGHGGBGHEHHHH:EAHFHEHHHHHEHEHHHHEHHGHFGHHHEHHHHH?HHEH7HH4?HHEHEHHEFFED=GHGD;HHBHHHGHHFHHEHHGCHBFGHHHHEHGHEHGHE7HEHGFHHHF8HAGH=HEFGHEDHHHCHHHHHHEHHHDHEHHEDHECHCED:HEH:HHHHGGHH;;FEEHEEEHEHG@DHHDH5HFFFHHEDHHEEFHH=FHEHBHHHHHDHHHEGFDHGFHHHHHHFCHEHHH?G:EFGEHCGHGHHGHHHGE?GH*H>HHGHHHEFGHHEGGHHHHHH>GGH@HH=HHEHHBEGEHHDHHGHHHEHH9EDHHEGHHHHGHHGHHFECHH:HFGHHE9DE6HFHHH;EHE>HEEFEHBHHHHEHEFEEHHHHHG0@HC=HHBHBEGHHGH:BFEHHHFHH:HBGHDGHGEEHGHEHGGE6HBHHCEH?EB7GHEHFHH?GBG=@EEEBCE?HBHH?HEFHHHGHHHH6EHGHGGGGHHBEEHHCDHHHE=HHH:.HFHGGHEHHHGFGFHBHGHD@HHGHFBHHBHFHEGHHGHEHH9H3HHHHHBHFCHFHGE2GHHHHH<HHEHHCDH>EHHHBHHH>FECH0?EDHDE@HFGH<HHHFEFEHEFAHH@HEHGHHHH@BGEHHHHEHHEGHHHHHH?HH4FHHHEHHHHDHHHHE?FHHH?E:GHHHHHFHGHHHHEHHHHHHEG,EHBEHEHHHGDHBH8GDHH2HHGHHHHGHHBBEH;6HDHHH3H:HHH2HBHDHHHFHEHGHGFHHHGHGHH9HDHHE9HHE@HHHHHHHGHHHHAH=HH9HGECHBGBHEEHHHH@BHHEHHHGHHHHHHDHFHG@E=HD77DEHEH0HEDGEDEDHGHHHBHHHB;EHE)HGEEEFE:HHHEH7HEHHGHHHBHHGHGFHHGEHEHHEH@HEEHHH@?H@DHHHHFEEGC>GH9DH5H=H:HHHGG?H?EHHG;GEEHHHDGHHHHHHE@FCDHHEHHF9HHCCHGHEFHH=HH=G;HE=>HGHH<HHHHGDEF7G-GEH:GEHEFHHHHH>G4HGHHEGHHHFEF;HH@HHHHECHHH<HHHEHFE<GDFHHE'HHE=GHHH@9H@CHHHFHHHH>GH=HHHH?HH4HHE?GHHHH>HGDACHG2GEHHEEDGEHDHH5HH8HEHHHH>BHHBEH?DHBHH7EH1EHFHG:HHEGH<CHHGHH:GHH<=HGHGHH>EHHEGHGE=GFFHHHGHBDHHGGHH:BHHHEBHHHHEHHHG?HGFHHBHHHG>HFEEGGEHHHEGEFHHEHHGG6HGHGHHHHHDHHEGHAGFE1DEH:HHHDGH2G8CEFHHHHHGHGHHHBHHGAFHHGHH?HFHGGHHGHH?HH=HGHHEH;EFFGHDGHEEHHHHGHGHHHFGHE<HH?GHBHGHH>HHGHGFHFGHGHG<HHHHHH@HH,C@HGHABHHHHHHHGHHGEHGGHDHGBCDBGGGHHHHHHHGFHGHHGDGHBHGHHHHGHHEHDHFHHHHGHFHHHDBHGBHH?GHHHHHHGHHGG7HGHHHHGG=GC5HHHH?H:5?GHEHGHFGHFG
samtools bcftools • 3.2k views
ADD COMMENTlink written 6.0 years ago by Jeremy Leipzig17k
1
gravatar for Doctoroots
6.0 years ago by
Doctoroots740
Doctoroots740 wrote:

Hi Jeremy, this could be a possible duplicate of this question, where the answer is that the use of BAQ by samtools reduces some true positives together with the false positive reduction and it is suggested to user either -B or -E in order to fix it.

also, your problem could be due to high coverage filtration, in which case you should use the -D / -d options to set the maximal accepted depth.

ADD COMMENTlink written 6.0 years ago by Doctoroots740
1
gravatar for lh3
6.0 years ago by
lh330k
United States
lh330k wrote:

Bcftools assumes your sample is diploid, which greatly improves the power for variant calling from diploid sequences. Your SNP has <2% frequency, even approaching the sequencing error rate. There is no way bcftools can call it. If you really want to call it as a SNP "no matter how infrequent", you should just parse the I16 field (first 4 numbers are identical to DP4).

ADD COMMENTlink written 6.0 years ago by lh330k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 655 users visited in the last hour