Keyboard shortcuts

Press or to navigate between chapters

Press S or / to search in the book

Press ? to show this help

Press Esc to hide this help

Find positions varying in modification across isoforms with dmr isoform.

This command will compare the modification levels at common exonic positions for all isoforms of a gene.

Example Isoform methylation plot

Example

The input to dmr isoform is a single bedMethyl table that has been generated from a modBAM of reads aligned to a reference transcriptome (or transcriptome assembly). This command also requires an accompanying GTF file with valid transcript models.

Below is an example command:

$ modkit dmr isoform \
  ${bedmethyl} \
  isoform_dmr.bed \
  --gtf ${gtf}

The bedMethyl table is expected to be bgzipped and have a tabix index.

To generate an isoform SVG plot like the one above, add the following arguments:

modkit dmr isoform \
  ${bedmethyl} \
  isoform_dmr_${gene_name}.bed \
  --plot path/to/plot/dir \ # <-- add this option
  --gene-name ${gene_name} \ # <-- requires a gene name (or gene id to plot)
  --gtf ${gtf}

Note that exons without marks in certain transcripts indicates that there is no data in the input bedMethyl. The command modkit bedmethyl map-to-genome can map a transcript-aligned bedMethyl to genome coordinates.

Schema of output

columnnamedescriptiontype
1chromname of contig from GTFstr
2chromStart0-based start positionint
3chromEnd0-based exclusive end positionint
4namemodification codes present at this positionstr
5scorelikelihood ration scorefloat
6strandstrand, ‘+’ or ‘-’str
7p_valuep-value of alternative hypothesisfloat
8max_absolute_differencelargest effect size of all isoforms at this positionfloat
9n_transcriptsnumber of transcripts (isoforms) contributing to this positionint
10gene_idgene-id from the GTFstr
11gene_namegene-name from the GTF or ‘-’ if not foundstr
12per_isoform_proportionsJSON formatted string of per-transcript, per-modification proprotionsstr
13per_isoform_countsJSON formatted string of per-transcript, per-modification countsstr

Columns 12 and 13 are only present when the --full flag is passed.

Background

Gene sequences alone don’t describe all of the diversity of mRNAs in vertebrate cells. Through alternative splicing gene exons can be combined in multiple permutations called isoforms.

Central dogma of molecular biology and alternative splicing

Filtering the number of methylation marks

Some long genes may have many modified positions and drawing them all can get crowded. To only plot positions with a maximum p-value or minimum score use the --max-pvalue or --min-score arguments, respectively.