Skip to content

Batching

Output from Oxford Nanopore Technologies products can be batched in various ways, depending on the software used to generate the files.

MinKNOW configuration

UI

The UI for MinKNOW allows configuring the batching for BAM and FASTQ based on "Time elapsed" and "Read count".

Default values

The default_writer.toml file in <minknow_install_dir>/conf/package/shared/default_writer.toml contains default batching parameters, and allows additional control on top of what the UI provides.

File format configuration locations

In the default_writer.toml config file batching for POD5 and .fast5 can be controlled:

File Type Config Location
POD5 [writer_configuration.read_pod5]
.fast5 [writer_configuration.read_fast5]

Batching examples

The following examples can be used to replace existing batching options for a file format, to change how a file type is batched.

Batch based on read count
batch_duration = "0s"               # Time based batching disabled
batch_count = 4000                  # How many reads in each file
Batch into hourly files, or 500 million bases, whichever comes first
batch_duration = "3600s"            # Time based batch duration
bases_per_batch = 500_000_000       # Batch based on estimated base output
Batch into hourly files
batch_duration = "3600s"            # Time based batch duration
no_output_based_batching = {}       # Disable output based batching
Batch into 1 giga base files
bases_per_batch = 1_000_000_000     # Batch based on estimated base output