This page supplies supplementary material for the manuscript
"The Brix domain protein family - a key to the ribosomal biogenesis pathway"
by Frank Eisenhaber, Christian Wechselberger, Guenther Kreil
Acknowledgements to:
M. Breitenbach, F.M. Jantsch, G. Lepperdinger
for sharing experimental data with respect to Brix and yol077c with us prior to publication, for discussion of
the sequence analysis results and for carefully reading the manuscript.
The length of proteins in the Peter Pan family and in group 1 of hypothetical proteins is typically <475 residues (346-475 residues and 295-434 residues respectively). But four proteins (all of them are hypothetical, conceptual translations of genomic sequences) have an extraordinary length of ~700 residues.
predicted TM regions 450 470 492 512 526 546 567 587 625 645 667 687 729 749 PFAM: 7tm_1: domain 1 of 1, from 465 to 693: score 116.8, E = 4.5e-36 *->GNlLVilvilrtkklr..tptnifilNLAvADLLflltlppwalyyl N+L+++ ++ +k r+ p+ +f + LAv+DLL +ltlpp a+y + UI_emb|CAB 465 SNGLALYRFSI-RKQRpwHPAVVFSVQLAVSDLLCALTLPPLAAYLY 510 vggsedWpfGsalCklvtaldvvnmyaSillLtaISiDRYlAIvhPlryr +W +G+a C+l +l+++n+ S+ ++t+IS+ RYl IvhP+ +r UI_emb|CAB 511 PP--KHWRYGEAACRLERFLFTCNLLGSVIFITCISLNRYLGIVHPFFAR 558 rrrtsprrAkvvillvWvlalllslPpllfswvktveegngtln...... ++ + p++A++v+++ Wvla+ll++P l fs +k+++ + +++ + + UI_emb|CAB 559 SHLR-PKHAWAVSAAGWVLAALLAMPTLSFSHLKRPPQQG--AGncsvar 605 .vnvtvClidfpeestasvstwlrsyvllstlvgFllPllvilvcYtrIl ++ + Cl + +++ + +r+y l++ +g lPll+ l +Y+ UI_emb|CAB 606 pEACIKCLGTADHGL-----AAYRAYSLVLAGLGCGLPLLLTLAAYGALG 650 rtlr...........kaaktllvvvvvFvlCWlPyfivllldt<-* r++ ++++ + ++ ++a +++ v + + + +Py+i+ +l++ UI_emb|CAB 651 RAVLrspgmtvaeklRVAALVASGVALYASSYVPYHIMRVLNV 693
low complexity regions: SEG 12 2.2 2.5 >gi|7505687|pir||T32923 hypothetical protein K09H9.6 - Caenorhabditis elegans^Agi|2804465|gb|AAB97575.1| (AF043700) contains similarity to human RNA-binding protein FUS/TLS (SW:Q28009) [Caenorhabditis elegans] 1-1 M kmkkgkkqrk 2-11 12-32 SGAHNRGNVDALSQKDALYHQ ekfvkklqkqkfle 33-46 47-86 NKEIELARQPHCLVIHRGDVGKYVKGLESD LRNLVEPNTA knlkilkrnnik 87-98 99-384 DFIVNGAVLGVTNMMVLTSSDASLQLRMMR FSQGPTLSFKVKQYSLARHVVNCQKRPVAT DKLFKSSPLVVMNGFGDGTQKHLSLVQTFI QNMFPSINVDTIQLGNLKRCLIVSYDEETD EIQMRHFAIRVVASGLNKSVKKLMQAEKTM GKNIPNLSTYKDISDYFLNPGQFRIRTKLN FSNINHKIIIKKYSIYLNFPSFSSPGQLSD SEFEGDQQEVELPQDISEGRGCGVGQKSNV RLHEIGPRLTLELVKIEEGIDEGEVLYHKH NAKTPDELIKLRAHMD kkkqmkkrreqeseqr 385-400 401-477 VIRRLTIVKEQQDAEEAEVKAIRENAARKQ AAATGQVEEVENQKEKDREIAMNRERDLKR ANEEWGTSEASKRPRYE dsrggfrggfrgrgedrggfrgrggdrggf 478-608 rgrdrdgggfrgrsvdrggfrggggdrggf rgrssdrggdrggfrgrsgdrdggfrggfg grggggfrggdrggfrgrgggggfrggrgg drgggfrggrr
IMPALA: AAA AAA+ ATPase Module Length = 298 Score = 35.0 bits (79), Expect = 3e-04 Identities = 32/193 (16%), Positives = 32/193 (16%), Gaps = 14/193 (7%) Query: 95 RKPLVLSFHGYTGSGKNYVAEIIANNTFRLGLRSTFVQHIVATNDFPDKNKLEEYQVELR 154 Sbjct: 78 AQPKGVLLYGPPGTGKTLLARAVAHHTDC---------TFIRVSGSELVQKFIGEGARMV 128 Query: 155 NRILTTVQKCQRSIFIFDEADKLPEQLLGAIKPFLDYYSTISGVDFRRSIFILLSNKGGG 214 Sbjct: 129 RELFVMAREHAPSIIFMDEIDSIGSRLEGGSGGDSEVQRTMLELLNQLDGFEATKN---- 184 Query: 215 EIARITKEQYESGYPREQLRLEAFERELMNFSYNEKGGLQMSELISNHLIDHFVPFLPLQ 274 Sbjct: 185 -IKVIMATNRIDILDSALLRPGRIDRKIEFPPPNEEARLDILKIHSRKMNLTRGINLRKI 243 Query: 275 REHVRSCVGAYLR 287 Sbjct: 244 AELMPGASGAEVK 256 PFAM: AAA 1/1 99 287 .. 1 216 [] -25.3 0.15 AAA: domain 1 of 1, from 99 to 287: score -25.3, E = 0.15 *->gvLLyGPPGtGKTlLAkavAkelg......vpfisisg......sel + ++G G+GK +A+++A+ + + + ++ + i ++++ ++ CE_pir|T19 99 VLSFHGYTGSGKNYVAEIIANNTFrlglrsTFVQHIVAtndfpdKNK 145 vskyvGesekrvralfelArkslkkaaPspiiFIDEiDalapkRgdegdv +++y e + r+ + ++ +s i ++DE+D+l + CE_pir|T19 146 LEEYQVELRNRILTTVQKCQRS--------IFIFDEADKLPEQ------- 180 servvnqLLtemDLerigfekhylr..vsdvvDlsgviviaaTNrpdlld LL + f ++y++ ++ vD+++ i+i+ +N + + CE_pir|T19 181 -------LLGAIK----PFLDYYSTisG---VDFRRSIFILLSNKGGGEI 216 paLlrpGRfdrrievplPdeeerleIlkihlkkmplalc..qerselakd + + ++ e ++P e+ rle +++ l ++ +++++ +sel ++ CE_pir|T19 217 ARITK-----EQYESGYPREQLRLEAFERELMNFSYNEKggLQMSELISN 261 vdldelakelArrtpgfsgadlaa..lcreAalralr<-* + +d+ f+ +++++r + lr CE_pir|T19 262 HLIDH-----------FVPFLPLQreHVRSCVGAYLR 287