What apps can read large FASTQ files on Windows?
4
0
Entering edit mode
4 months ago
Sanjukta • 0

I have fastq files of around 12 GB size. I have tried opening them with Sublime Text and Atom but they are not able to read them. What apps can read really large FASTQ files on Windows?

fastq apps • 1.7k views
ADD COMMENT
3
Entering edit mode

Why do you want to open up such a large file in a text editor? You're not really going to learn anything new from manually looking at millions of sequences...

Just use the windows command line to view a few lines of the file instead. Or, if you really want to open it sublime text, maybe select only a few thousand reads from your FASTQ file and put them in a separate file then open that file up in sublime.

ADD REPLY
3
Entering edit mode

Also, as I suggested already here https://bioinformatics.stackexchange.com/questions/21931/how-to-download-sequencing-data-on-windows-using-sra-toolkit you're not doing much in a pure Windows environment. Use WSL2 or any other Unix.

ADD REPLY
1
Entering edit mode

Imho less than 10 reads is good enough to figure out what can be discovered in a FASTQ by a naked eye. Sequence naming convention, formatting, sequence length, interleaved or not and maybe quality encoding if one really insist. But apart from this one will get way more info using an intact (compressed by some sequencing core or SRA archive) FASTQ file and dedicated tools, i.e. fastp, seqkit etc.

ADD REPLY
3
Entering edit mode
4 months ago

As others have said, there's usually not much reason to look at large fastq files, but BBTools can process them in Windows if you install Java. For example...

java -cp C:\BBMap jgi.ReformatReads in=file.fastq out=stdout.fq reads=10

...would print the first 10 reads to the screen. It's worth noting that BBTools is developed in Windows so I can guarantee that all of the programs work in Windows as well as Linux.

Windows 10+ also has "Windows Subsystem for Linux" which can make this kind of thing much easier, allowing to (hopefully) use BBTools' shell scripts, for example, so the syntax would simpler:

reformat.sh in=file.fastq out=stdout.fq reads=10

I'm about to reboot my computer to install Windows Subsystem for Linux to see what it can do now... hopefully it has standard utilities like zcat and head, which are convenient for looking at fastq files.

ADD COMMENT
1
Entering edit mode

hopefully it has standard utilities like zcat and head

It does. Apart from some fussiness with mounting drives, you wouldn't know that you weren't ssh'd into a native linux box.

ADD REPLY
1
Entering edit mode
4 months ago

There are a number of bioinformatics workbenches that will run under windows, and will allow you to analyse fastq files.

Some examples are:

  • CLC workbench
  • Lazergene
  • Geneious
  • VectorNTI

Unfortunately these are all quite expensive, although some offer a free trial.

If you just want to look at the file, then I've had luck opening big files with Notepad++. However, if you want to open a 12GB file in any of these sorts of tool (assuming thats 12GB uncompressed, not .gz), then you will definately need at least 12GB of free memory.

If really all you need to do is look at the file. Then you might try opening the powershell application and using

$ Get-Content PATH/TO/FASTQ/FILE.fastq | Out-Host -Paging

or

$ Get-Content PATH/TO/FASTQ/FILE.fastq -First 40
ADD COMMENT
0
Entering edit mode

VectorNTI

Is now dead. Company decided to scuttle the product.

ADD REPLY
1
Entering edit mode
4 months ago

You don't want to do this, but one of my colleagues did once - he used large file viewer at this link

https://stackoverflow.com/questions/159521/text-editor-to-open-big-giant-huge-large-text-files

ADD COMMENT
0
Entering edit mode
4 months ago
size_t ▴ 120

As @dsull said, you won't really learn anything new from manually looking at millions of sequences.
If you insist on viewing, you can use the tools view command to view fastq files just like the less and more commands on linux

ADD COMMENT

Login before adding your answer.

Traffic: 1662 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6