| Number of genes: | 3506 |
| Number of books for multi-domain architectures: | 1205 |
| Number of books for Pfam domains: | 5868 |
| Number of genes in multi-domain architecture books: | 921 (26.3%) |
| Number of genes in books for Pfam domains: | 2270 (64.7%) |
| Number of genes in any book: | 2554 (72.8%) |
To download files pertaining to an individual PhyloFacts 3.0 family, please visit the corresponding family webpage from here.
The PhyloFacts Encyclopedia is composed of "books" to represent gene families, clustered in two distinct ways: requiring agreement along the entire multi-domain architecture, and based on sharing a single Pfam domain. Each book contains a multiple sequence alignment, phylogenetic tree, inferred orthologs (using the PHOG algorithm (Datta et al., Nucleic Acids Research, 2009)), hidden Markov model, and associated experimental and annotation data.
Details on the library construction pipeline are provided in the following article.
"PhyloFacts: An online structural phylogenomic encyclopedia for protein
functional and structural classification,"
Genome Biology 2006,
7:R83
PDF.