Personal tools
You are here: Home Bioinformatics Standards Documents Human Readable Description (HRD)
Document Actions

Human Readable Description (HRD)

Description

HRDs are used as informative and readable text in FASTA headers for protein sequences

Application

The Human Readable Description Assignment is part of the iTAG pipeline.

The Human Readalbe Description that was assigned to a protein sequence:
  • comes from a high-scoring BLAST match
  • contains words occurring frequently in the descriptions of highest scoring BLAST matches
  • does not contain meaningless "fill words"
  • contains words also occurring in any GO terms assigned to the query protein

 

Basic Notations of the Descriptor

  • allowed are printable characters
  • not allowd are characters like newline and return according to FASTA format definition

Syntax of the HRD

(ProteinID) (high scoring description or name of hit gene) (version of AHRD + significance of HRD + ProteinID from where HRD is transferred); contains Interpro domain(s) (Interpro accession + description)

Example

>AC225517.13.1 Myosin heavy chain kinase A (AHRD V1 **-- P42527); contains Interpro domain(s) IPR020472 G-protein beta WD-40 repeat, region

Significance of the HRD

Example: [**--]

Character Criteria Criteria fulfilled Criteria not fulfilled
1 Bit score of the blast result is >50 and e-value is <e-10 * -
2 Overlap of the blast result is >60% * -
3 Top token score of assigned HRD is >0.5 * -
4 Gene ontology terms found in description line * -

 

EU-SOL Authority

MPIZ-H

Scope and Duration of Validity

The scope and duration of validity of the HRDs are 1:1 related to the batches and releases of the iTAG pipeline.

Version

1.0

Powered by Plone, the Open Source Content Management System