Hi All!!
I have searched a lot on how to fetch the subtelomeric region sequences chromosome wise. However I am unable to do that. Does someone have some experience doing that. I referred to this paper Mapping and Initial Analysis of Human Subtelomeric Sequence Assemblies https://www.ncbi.nlm.nih.gov/pmc/articles/PMC314271/
However the following link from abovementioned paper https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSM1958562 gives me SRR files which on dumping gives me the following output
@100_1000_1001_F3
T12022113211332310212120210000333321311203201121301
+100_1000_1001_F3
!@<@82?@>@@@@>@?@@@@@@@=@@@?/@=@@@@@@@@@@@?<@@2@@@@
This is unusual.
I used the following command:
fastq-dump --outdir fastq --gzip --skip-technical --origfmt --dumpcs --readids --read-filter pass --dumpbase --split-3 --clip SRR_ID