Human Readable Description (HRD)
Description
HRDs are used as informative and readable text in FASTA headers for protein sequencesApplication
The Human Readable Description Assignment is part of the iTAG pipeline.The Human Readalbe Description that was assigned to a protein sequence:
- comes from a high-scoring BLAST match
- contains words occurring frequently in the descriptions of highest scoring BLAST matches
- does not contain meaningless "fill words"
- contains words also occurring in any GO terms assigned to the query protein
Basic Notations of the Descriptor
- allowed are printable characters
- not allowd are characters like newline and return according to FASTA format definition
Syntax of the HRD
(ProteinID) (high scoring description or name of hit gene) (version of AHRD + significance of HRD + ProteinID from where HRD is transferred); contains Interpro domain(s) (Interpro accession + description)Example
>AC225517.13.1 Myosin heavy chain kinase A (AHRD V1 **-- P42527); contains Interpro domain(s) IPR020472 G-protein beta WD-40 repeat, region
Significance of the HRD
Example: [**--]| Character | Criteria | Criteria fulfilled | Criteria not fulfilled |
| 1 | Bit score of the blast result is >50 and e-value is <e-10 | * | - |
| 2 | Overlap of the blast result is >60% | * | - |
| 3 | Top token score of assigned HRD is >0.5 | * | - |
| 4 | Gene ontology terms found in description line | * | - |