Hi guys, How do i extract a sequences from a fasta file by taking the start and end position from a gene predicted file: here the example the file with the orf statistics is my predicted file and for example the start position for the first orf is 65 and the end is 213. and the fasta file i'm going to search those position is the other one
my predicted file looks like this
>Seq1 [organism=S.burgodofry...
orf00001       65      213  +1     2.93
orf00002      799     2328  +1     7.09
orf00003     2331     3437  +3     6.09
orf00004     3457     4044  +1     6.15
>Seq2 [organism=S.burgodofry...
orf00001       55      317  +1     2.17
orf00002      206      610  +2     5.28
orf00003      747     2408  +3     4.85
and my fasta sequence sequence look like this:
>Seq1 [organism=S.burgodofry]...
ACTGTAGATGACATGACCAGTACGATACAGAT...
....
........
>Seq2 [organism=.....]
ATGTCGTGACTAGTACGATCAGATCAGAT
.........................
..............
...
You don't say which fields in your gene prediction file correspond to start and end. And that isn't a Fasta file, my friend. What have you tried so far?
my bad, the file with the orf statistics are is my predicted file and for example the start position for the first orf is 65 and the end is 213. and the fasta file i'm going to search those position is the other one