|
$ flex code.l && \ |
|
gcc lex.yy.c && \ |
|
curl -s "http://hgdownload.cse.ucsc.edu/goldenpath/hg19/chromosomes/chrUn_gl000249.fa.gz" |\ |
|
gunzip -c | ./a.out |
|
chrUn_gl000249 583 646 |
|
chrUn_gl000249 873 1720 |
|
chrUn_gl000249 1964 2885 |
|
chrUn_gl000249 2912 3126 |
|
chrUn_gl000249 3186 3894 |
|
chrUn_gl000249 4209 4845 |
|
chrUn_gl000249 4847 4956 |
|
chrUn_gl000249 4970 5361 |
|
chrUn_gl000249 5583 5889 |
|
chrUn_gl000249 5935 6233 |
|
chrUn_gl000249 6349 6543 |
|
chrUn_gl000249 6752 6780 |
|
chrUn_gl000249 7075 7097 |
|
chrUn_gl000249 7457 7570 |
|
chrUn_gl000249 7639 7728 |
|
chrUn_gl000249 7768 8151 |
|
chrUn_gl000249 8378 9020 |
|
chrUn_gl000249 9345 9637 |
|
chrUn_gl000249 9683 9953 |
|
chrUn_gl000249 9962 10278 |
|
chrUn_gl000249 10534 10599 |
|
chrUn_gl000249 10614 10680 |
|
chrUn_gl000249 10878 10980 |
|
chrUn_gl000249 11250 11358 |
|
chrUn_gl000249 11915 12293 |
|
chrUn_gl000249 12323 12486 |
|
chrUn_gl000249 12610 12914 |
|
chrUn_gl000249 13032 13090 |
|
chrUn_gl000249 13149 13371 |
|
chrUn_gl000249 13678 14014 |
|
chrUn_gl000249 14312 14599 |
|
chrUn_gl000249 14632 14796 |
|
chrUn_gl000249 14850 14939 |
|
chrUn_gl000249 15504 15595 |
|
chrUn_gl000249 15596 16393 |
|
chrUn_gl000249 16483 16583 |
|
chrUn_gl000249 17190 17395 |
|
chrUn_gl000249 17416 17697 |
|
chrUn_gl000249 17857 17915 |
|
chrUn_gl000249 18640 19662 |
|
chrUn_gl000249 20652 20739 |
|
chrUn_gl000249 20884 20965 |
|
chrUn_gl000249 20984 21148 |
|
chrUn_gl000249 22387 22441 |
|
chrUn_gl000249 22865 23159 |
|
chrUn_gl000249 23538 23649 |
|
chrUn_gl000249 25105 25321 |
|
chrUn_gl000249 25380 25598 |
|
chrUn_gl000249 26375 26675 |
|
chrUn_gl000249 27024 27227 |
|
chrUn_gl000249 27572 27709 |
|
chrUn_gl000249 27768 28017 |
|
chrUn_gl000249 28103 28418 |
|
chrUn_gl000249 29790 29859 |
|
chrUn_gl000249 30971 31080 |
|
chrUn_gl000249 31277 31309 |
|
chrUn_gl000249 32295 32431 |
|
chrUn_gl000249 32807 32828 |
|
chrUn_gl000249 33588 33632 |
|
chrUn_gl000249 34252 34323 |
|
chrUn_gl000249 36020 36169 |
|
chrUn_gl000249 37894 37915 |
Thanks for an answer! I haven't used flex before, could you please explain in greater detail how to install/run this code? (flex: can't open jeter.l)
in the current context GNU flex runs like a awk for C.
for example
is a pattern for the a fasta header.
Perfect! Thank you very much. I will just add note for the others that code reports are 0-based.
sorry I renamed the file when copy+paste, that should be
flex code.l
notflex jeter.l
Hi,
I'm trying to do the same but while testing the above script, I figured that I'm not getting the coordinates of the softmasked region at the end of the sequence.
For example, testing the script on the following three cases,
I get back,
But how would I change the flex script to give back the coordinates of the sequence "adad"?
Thanks,
Cheers,
ah yes, I forgot the trailing lowercase letters. I've added a
<<EOF>>
condition.This is very useful, but how do I run it with a locally stored .fasta file? I tried the following and I got an empty co-ordinate file and it doesn't finish running either. Also, is there a way to use multiple threads to run this?
flex code.l && \
gcc lex.yy.c && \
./a.out genome_softmasked.fasta > co-ords.gff