Greetings, I am able to run annotatePeaks.pl in HOMER on a ChIP-seq peak file, but in the annotated output, HOMER has changed my 'unique peak ID', which is problematic for downstream stuff I want to do. Can anyone help me? Problem detail below
I have a MACS ChIP-seq peak file ("PeakFile.txt") that I have formatted thusly:
Column1: Unique Peak ID
Column2: chromosome
Column3: starting position
Column4: ending position
Column5: Strand (+/- or 0/1, where 0="+", 1="-")
per http://homer.ucsd.edu/homer/ngs/annotation.html
Column1 entries read like so: peak1 peak2 peak3 peak4 ...
I am able to annotate the file with the following code in Terminal (Mac OS X):
checkPeakFile.pl PeakFile.txt ### all good
annotatePeaks.pl PeakFile.txt hg19 > annotatedPeakFile.txt
However in the resulting 'annotatedPeakFile.txt', the Column1 UniquePeak IDs read like so, sorted in ascending order as previously with the input 'PeakFile.txt
peak1 peak10 peak100 peak1000 peak10000 peak10001 peak10002
ugh. help please? I am a Mac OS user, and have saved my text .txt file in MS-DOS format as HOMER bucks Mac format. help appreciated