Transrate is software for de-novo transcriptome assembly quality analysis. It examines your assembly in detail and compares it to experimental evidence such as the sequencing reads, reporting quality scores for contigs and assemblies. This allows you to choose between assemblers and parameters, filter out the bad contigs from an assembly, and help decide when to stop trying to improve the assembly.

NOTE: this is the documentation for transrate v1.0.3

The transrate paper is now published!

Overview

Transrate analyses a transcriptome assembly in three key ways:

  1. by inspecting the contig sequences
  2. by mapping reads to the contigs and inspecting the alignments
  3. by aligning the contigs against proteins or transcripts from a related species and inspecting the alignments

By far the most useful metrics are those based on read mapping, in particular you should pay attention to the Transrate assembly score, the optimised assembly score and the individual contig scores.

You can read about all the metrics in detail.

Installation

See the installation guide.

The command-line interface

Running transrate --help will show you the command-line interface:

Transrate v1.0.3
by Richard Smith-Unna, Chris Boursnell, Rob Patro,
   Julian Hibberd, and Steve Kelly

DESCRIPTION:
Analyse a de-novo transcriptome assembly using three kinds of metrics:

1. sequence based (if --assembly is given)
2. read mapping based (if --left and --right are given)
3. reference based (if --reference is given)

Documentation at http://hibberdlab.com/transrate

USAGE:
transrate <options>

OPTIONS:
  --assembly=<s>            Assembly file(s) in FASTA format, comma-separated
  --left=<s>                Left reads file(s) in FASTQ format, comma-separated
  --right=<s>               Right reads file(s) in FASTQ format, comma-separated
  --reference=<s>           Reference proteome or transcriptome file in FASTA
                            format
  --threads=<i>             Number of threads to use (default: 8)
  --merge-assemblies=<s>    Merge best contigs from multiple assemblies into
                            file
  --output=<s>              Directory where results are output (will be created)
                            (default: transrate_results)
  --loglevel=<s>            Log level. One of [error, info, warn, debug]
                            (default: info)
  --install-deps=<s>        Install any missing dependencies. One of [read, ref,
                            all]
  --examples                Show some example commands with explanations

See the getting started guide for more instructions.

Authors and Contributors

Transrate is a collabortation between the Hibberd lab (Cambridge) and the Kelly lab (Oxford). It was developed by Richard Smith-Unna, Chris Boursnell, Steve Kelly and Julian Hibberd, with significant contributions from Rob Patro (Stony Brook University). Matt MacManes and the Transrate user community provided valuable feedback and testing.

We welcome contributions - please see the README in the Github repo.

Get Support

If you're having trouble with Transrate, please first consult the documentation on this site.

If you can't find the answer to your question, you can search the user forum. If you still can't find the answer, please make a new post to the user forum describing the trouble you're having.

For urgent help, or just to chat to us, please come to the Transrate chat room.

If you're sure the problem you're encountering is a bug, please post it to the issue tracker and we'll fix it.

Citation, License & Copyright

Transrate is research software. We therefore require that users cite Transrate if they use it in a publication. Please cite the paper:

TransRate: reference free quality assessment of de-novo transcriptome assemblies (2016). Richard D Smith-Unna, Chris Boursnell, Rob Patro, Julian M Hibberd, Steven Kelly. Genome Research doi: http://dx.doi.org/10.1101/gr.196469.115

Transrate is free open source software, released under the MIT license.

All site content is released under CC-BY v3.0 CC-BY icon.