Estimates taxonomy at genome and metagenome level using tRNA sequences..
🔙 To the main page of anvi’o programs and artifacts.
profile-db contigs-db trna-taxonomy collection bin metagenomes dna-sequence
genome-taxonomy genome-taxonomy-txt
This program uses the taxonomy associates of your tRNA sequences to estimate the taxonomy for genomes, metagenomes, or collection stored in your contigs-db.
This is the final step in the trna-taxonomy workflow. Before running this program, you’ll need to have run anvi-run-trna-taxonomy on the contigs-db that you’re inputting to this program.
By default, this program will assume that your contigs-db contains only a single genome and will determine the taxonomy of that single genome.
anvi-estimate-trna-taxonomy -c contigs-db
This will give you only the best taxonomy hit for your genome based on your tRNA data. If you want to look under the hood and see what results from anvi-run-trna-taxonomy it’s using to get there, add the --debug
flag.
anvi-estimate-trna-taxonomy -c contigs-db \ --debug
In metagenome mode, this program will assume that your contigs-db contains multiple genomes and will try to give you an overview of the taxa within it. To do this, anvi’o will determine which anticodon has the most hits in your contigs (for example GGG
), and then will look at the taxnomy hits for tRNA with that anticodon across your contigs.
anvi-estimate-trna-taxonomy -c contigs-db \ --metagenome-mode
If instead you want to look at a specific anticodon, you can specify that with the -S
parameter. For example, to look at GGT
, just run the following:
anvi-estimate-trna-taxonomy -c contigs-db \ --metagenome-mode \ -S GGT
You can use this program to look at multiple metagenomes by providing a metagenomes artifact. This is useful to get an overview of what kinds of taxa might be in your metagenomes, and what kinds of taxa they share.
Running this
anvi-estimate-trna-taxonomy --metagenomes metagenomes \ --output-file-prefix EXAMPLE
will give you an output file containing all taxonomic levels found and their coverages in each of your metagenomes, based on their tRNA.
You can use this program to estimate the taxonomy of all of the bins in a collection by providing the the collection and the associated profile-db.
anvi-estimate-trna-taxonomy -c contigs-db \ --C collection \ --p profile-db
When doing this, you can also put the final results into your profile-db as a misc-data-layers with the flag --update-profile-db-with-taxonomy
This program can run the entire ad hoc sequence search without a contigs-db involved (just a fasta and number of target sequences as a percent of the total; default: 20 percent), but this is not recommended. However, if you provide other parameters, they will be ignored.
anvi-estimate-trna-taxonomy --dna-sequence fasta \ --max-num-target-sequences 10
Now that you’ve inputted your desired inputs, you think about whether you want an output and what it will look like. By default, this program won’t give you an output (just genome-taxonomy information in your contigs-db. However, if you add any of these output options, it will instead produce a genome-taxonomy-txt.
If you want to look at the anticodon frequencies before getting taxonomy info at all (for example because you can’t decide which anticodon to use for input option 2), add the flag --report-anticodon-frequencies
. This will report the anticodon frequencies to a tab-delimited file and quit the program.
To get a single output (a fancy table for your viewing pleasure), just add the output file path.
In this example, the input will be a single contigs-db (input option 1),
anvi-estimate-trna-taxonomy -c contigs-db \ -o path/to/output.txt
This will give you a tab-delimited matrix with all levels of taxonomic information for the genome stored in your contigs-db. Specifically, the output is a genome-taxonomy-txt.
If you want to focus on a single taxonomic level, use the parameter --taxonomic-level
, like so:
anvi-estimate-trna-taxonomy -c contigs-db \ -o path/to/output.txt \ --taxonomic-level genus
You can also simplify the taxonomy names in the table with the flag --simplify-taxonomy-information
If you’re running on a profile-db, you can also choose to add the anticodon coverage to the output with --compute-anticodon-coverages
.
If you have multiple outputs (i.e. you are looking at multiple metagenomes (input option number 3) or you are looking at each anticodon individually with --per-anticodon-output-file
), you should instead provide a output filename prefix.
anvi-estimate-trna-taxonomy --metagenomes metagenomes \ --output-file-prefix EXAMPLE
The rest of the options listed for the single output (i.e. focusing on a taxonomic level, simplifying taxonomy information, etc.) still apply.
Edit this file to update this information.
Are you aware of resources that may help users better understand the utility of this program? Please feel free to edit this file on GitHub. If you are not sure how to do that, find the __resources__
tag in this file to see an example.