| Definitions: |
In principle, taxonomic distributions can be described in two ways:
|
| Interpretation of phylogenetic distributions of single domains: |
|
Studies of protein domain frequencies in proteomes from complete eukaryote genomes have convincingly demonstrated that individual domain types can have very different evolutionary fates (PMID: 14759257). With respect to the quantitative evaluation of domain frequencies (phylogenetic distribution), they can be uniformly distributed over the taxonomic range or experience lineage-specific expansion. This classification is known to have functional significance- uniformly distributed domains are mostly involved in basic biological mechanisms, while taxon- and lineage-specific expanding domains are likely serving adaptive functions (PMID: 12097341).
Example: 1) Domains which appear uniform distribution using chi-square tests and are present in at least 15 proteomes 2) Domains which appear to be expanding in particular taxons using Dixon's outliers test (P value less than 0.05): a) plants exp b) fungal exp c) insect exp d)chordata exp
|
| Interpretation of phylogenetic profiles of domains: |
| Based on its phylogenetic signature a domain can be found in all known proteomes (omnipresent) or only in some sections of the phylogenetic tree (determined with the phylogenetic signature). If a domain is found only in a subtree of the phylogenetic hierarchy, it can be called lineage-specific. As above this classification is of functional importance: omnipresent domains are mostly involved in basic biological mechanisms, while lineage-specific domains are likely serving adaptive functions (PMID: 12097341). |
| Interpretation of phylogenetic distributions of domain pairs: |
We could show that functional relationship between domains is
associated with high correlation of their respective taxonomic distributions. Although the
performance of the various correlation coefficients is similar, the Pearson cc appears slightly more
predictive and is, therefore, used by PhyloDome.
Figure: Taxonomic correlation and functional link between domain pairs |