Loading...
Please wait, while we are loading the content...
Similar Documents
Structural Genomics Analysis : Phylogenetic Patterns of Unique , Shared , and Common Folds in 20 Genomes
| Content Provider | Semantic Scholar |
|---|---|
| Author | Hegyi, Hédi Lin, Jimmy Gerstein, Mark B. |
| Copyright Year | 2001 |
| Abstract | We carried out a structural-genomics analysis of the folds in the first 20 completely sequenced genomes, focusing on the patterns of fold usage. We assigned folds to sequences using PSI-blast, run with a systematic protocol to reduce the amount of computational overhead. On average, folds could be assigned to about a fourth of the ORFs in the genomes and about a fifth of the amino acids in the proteomes. More than 80% of all the folds in the scop structural classification were identified in one of the 20 organisms, with worm and E. coli having the largest number of distinct folds. Folds are particularly effective at comprehensively measuring levels of gene duplication, as they group together even very remote homologues. Using folds, we find the average level of duplication varies depending on the complexity of the organism, ranging from 2.4 in M. genitalium to 32 for the worm -values significantly higher than those observed based purely on sequence similarity. We rank the common folds in the 20 organisms, finding that the top three folds are the P-loop NTP hydrolase, the ferrodoxin fold, and the TIM-barrel. We also discuss in detail the many factors that affect and bias these rankings. From the overall patterns of shared folds, we were able to group the 20 organisms into a whole-genome tree, which is similar but not identical to the classic ribosomal tree. We also focus on specific patterns of fold (and fold pair) occurrence in the genomes, associating some of them with instances of horizontal transfer and others with gene loss. In particular, we find three possible examples of transfer between archaea and bacteria and six between eukarya and bacteria. We make available our detailed results at the following URL: http://bioinfo.mbb.yale.edu/genome/20. |
| File Format | PDF HTM / HTML |
| Alternate Webpage(s) | http://archive.gersteinlab.org/papers/e-print/foldpat20-proteins/032201/fp-full-document.pdf |
| Alternate Webpage(s) | http://archive.gersteinlab.org/papers/e-print/foldpat20-proteins/032201/pdf/fp-text-revised.pdf |
| Alternate Webpage(s) | http://archive.gersteinlab.org/papers/e-print/foldpat20-proteins/fp-full-document.pdf |
| Alternate Webpage(s) | http://archive.gersteinlab.org/papers/e-print/foldpat20-proteins/pdf/fp-text-revised.pdf |
| Alternate Webpage(s) | http://papers.gersteinlab.org/e-print/foldpat20-proteins/032201/pdf/fp-text-revised.pdf |
| Alternate Webpage(s) | http://papers.gersteinlab.org/e-print/foldpat20-proteins/pdf/fp-text-revised.pdf |
| Alternate Webpage(s) | http://papers.gersteinlab.org/e-print/foldpat20-proteins/fp-full-document.pdf |
| Alternate Webpage(s) | http://papers.gersteinlab.org/e-print/foldpat20-proteins/032201/fp-full-document.pdf |
| Language | English |
| Access Restriction | Open |
| Content Type | Text |
| Resource Type | Article |