trnaseq-db [artifact]

DB

A DB-type anviā€™o artifact. This artifact is typically generated, used, and/or exported by anviā€™o (and not provided by the user)..

šŸ”™ To the main page of anviā€™o programs and artifacts.

Provided by

anvi-trnaseq

Required or used by

anvi-merge-trnaseq

Description

A tRNA-seq database contains information on tRNA sequences predicted from a single tRNA-seq sample.

This database is the key output of anvi-trnaseq. That program predicts which reads are tRNA through structural profiling, clusters tRNA reads into discrete biological sequences, and predicts the positions of nucleotide modifications.

The series of steps implemented in anvi-trnaseq sequentially adds the following information to the database.

  • Unique sequences predicted to be tRNA, including read counts
  • Primary sequence and secondary structural features (stems and loops) predicted in each profiled tRNA
  • Unconserved nucleotides in the primary sequence that differ from expectation
  • Unpaired nucleotides in the stems
  • ā€œTrimmedā€ tRNA sequences, formed from unique sequences only differing by 3ā€™ nucleotides of the CCA acceptor region and 5ā€™ nucleotides beyond the acceptor stem
  • ā€œNormalizedā€ tRNA sequences, formed by dereplicating trimmed tRNA sequences that are 3ā€™ fragments from incomplete reverse transcription and by mapping biological 5ā€™ and interior tRNA fragments
  • Potentially modified tRNA sequences, formed by clustering normalized tRNA sequences and retaining those clusters that differ by 3-4 nucleotides at aligned positions

This database is the key input to anvi-merge-trnaseq, which takes one or more databases comprising the samples in an experiment and generates a trnaseq-contigs-db of tRNA seed sequences and trnaseq-profile-dbs. These tRNA-seq variant contigs and profile databases can then be manipulated and displayed in anviā€™o like normal contigs-dbs and profile-dbs.

Edit this file to update this information.