Rename all bins in a given collection (so they have pretty names).
🔙 To the main page of anvi’o programs and artifacts.
collection bin profile-db contigs-db
This program creates a new collection from the bins in another collection with specific guidelines. This is especially helpful when you wish to standardize your bin names, add project specific prefixes, and/or exclude those that do not match your criteria of completion, redundancy, and/or size estimates.
Let’s say you have a collection called MY_COLLECTION
, which has four bins that are named poorly (which can happen due to decisions made by automatic binning tools, or after a few steps of manual refinement): Bin_1_2_1
, Bin_2
, Bin_3_1_1
, and Bin_4
. In an instance like this, running the program anvi-rename-bins the following way will standardize these bin names with a prefix specific to your project:
anvi-rename-bins -c contigs-db \ -p profile-db \ --prefix SURFACE_OCEAN \ --collection-to-read MY_COLLECTION \ --collection-to-write SURFACE_OCEAN_SAMPLES \ --report-file rename.txt
Now your profile-db will have a new collection named SURFACE_OCEAN_SAMPLES
that will contains your four bins witht their new names SURFACE_OCEAN_Bin_00001
, SURFACE_OCEAN_Bin_00002
, SURFACE_OCEAN_Bin_00003
, and SURFACE_OCEAN_Bin_00004
. The new naming will order your bins based on their substantive completion (i.e., completion minus redunancy).
The file rename.txt
is a TAB-delimited file that contains a summary of your renaming process. The first column has the original name of the bins that you renamed, the second has their new names, and the remaining columns contain information about those bins (like their completion, redundency, and size).
You can also label your MAGs separately from your bins via the flag --call-MAGs
:
anvi-rename-bins -c contigs-db \ -p profile-db \ --prefix SURFACE_OCEAN \ --collection-to-read MY_COLLECTION \ --collection-to-write SURFACE_OCEAN_MAGS \ --report-file rename.txt \ --call-MAGs \ --min-completion-for-MAG 70
Now, the collection SURFACE_OCEAN_MAGS
will include SURFACE_OCEAN_MAG_00001
, SURFACE_OCEAN_MAG_00002
, SURFACE_OCEAN_MAG_00003
, and SURFACE_OCEAN_Bin_00004
. These are exactly the same bins that the collection contained before, but now the names differenciate the wheat from the chaff.
In addition to minimum completion estimate, you can also adjust the maximum redundancy value, minimum size to call MAGs. Please see the help menu for all parameters and their descriptions.
When you use the flag --call-MAGs
, anvi’o identifies those bins that could be considered ‘MAGs’ based on your specific criteria. But regardles of whether an original bin remains a bin, or tagged as a MAG, everything in your original collection will end up in your new collection. The flag --exclude-bins
enable you to filter out those that end up not being tagged as MAGs:
anvi-rename-bins -c contigs-db \ -p profile-db \ --prefix SURFACE_OCEAN \ --collection-to-read MY_COLLECTION \ --collection-to-write SURFACE_OCEAN_MAGS \ --report-file rename.txt \ --min-completion-for-MAG 70 \ --call-MAGs \ --exclude-bins
With the addition of the flag --exclude-bins
to the same command, the collection SURFACE_OCEAN_MAGS
will no longer include bins SURFACE_OCEAN_Bin_00003
and SURFACE_OCEAN_Bin_00004
.
See also the program anvi-delete-collection.
Following is an example reporting output file anvi’o will generate at the file path declared with the parameter --report-file
:
old_bin_name | new_bin_name | SCG_domain | completion | redundancy | size_in_Mbp |
---|---|---|---|---|---|
Bin_2 | p800_MAG_00001 | eukarya | 61.45 | 7.23 | 26.924911 |
Bin_1 | p800_MAG_00002 | bacteria | 98.59 | 8.45 | 1.612349 |
Bin_3 | p800_Bin_00003 | blank | 0.00 | 0.00 | 0.103694 |
Bin_5 | p800_Bin_00004 | blank | 0.00 | 0.00 | 0.128382 |
Bin_4 | p800_Bin_00005 | bacteria | 1.41 | 0.00 | 0.378418 |
The column SCG_domain
will explain which collection of single-copy core genes were used to generate these completion/redundancy estimates. The absence of any domain prediction for any given bin will be marked with the keyrowd blank
.
Edit this file to update this information.
Are you aware of resources that may help users better understand the utility of this program? Please feel free to edit this file on GitHub. If you are not sure how to do that, find the __resources__
tag in this file to see an example.