analysis of sequence from BAC01259.1.fa ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ >BAC01259.1 putative GPAA1 - like protein [Oryza sativa (japonica cultivar-group)]. MASSAKEEAK PKPRLIVRLG VFLASHHILF SAVCCTAGII ALLFLPSLAK NTYLSENALI PGSANTLFST EDVQEANRFA KGIEAAIGES RGGTTEIPKF IAQQTKNLGA EVYYHEFLPD SKCFHPLKFF TSMTNNMAAK PNGTYTNFGI NTVGIIRAPR GDGKEAIVLV TPYNSQKVTP NELLSLALGF SVFSLLSRAA WLSKDIVWLS ADSQFGEYSA VSSWLNQYHN PMFLSHPVNL DTKIYGANQI LYKPDGTAEK AELMAFKRAG TMAAALIFKV GETRKYGDRD SVTMYAEASN GQMPNLDLLN VVHYLAVHRQ GFRVNVETFN SLLSSSWLRV IAEVFQNLGS LLRKINPDWK LDVTVPDYVE GTANLASSMY NQALGVPTGS HGAFRDYQVD AVSLEFAPAF HLKNENAKSS FLLRGGRLTE GVVRSVNNLL EKFHQSFFLY FLTAPSKFIS VGVYMIPFAL LLAPLPIVAA ALAGGSKTKG KLEDECKTKG NADDLQMEGG SWKWLKSARV LLIIQFWAVL VSLLPYYISQ IPGAMPIQYA VIWAVLSITI LIILYAMFGS PSRAGVEWKL LKATMITSIT IGMGLMSIIN FATAQLGALI LIPMCLFSRP LRAQLEMNFL PRTVLLASNI LLTVLGFPPA AFLIMKGLSK GSWTVDIVGD FWLWMEFLWE WSSATYLYVF LVHLPCWLLC IHVLLHPCYQ PESKMKQE ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ sec.str. with predator > BAC01259.1 . . . . . 1 MASSAKEEAKPKPRLIVRLGVFLASHHILFSAVCCTAGIIALLFLPSLAK 50 _____________EEEEHHHHHHHHHHHHHHH_HHHHHHHHHH_______ . . . . . 51 NTYLSENALIPGSANTLFSTEDVQEANRFAKGIEAAIGESRGGTTEIPKF 100 _______EEE__________HHHHHHHHHHHHHHHHH_________HHHH . . . . . 101 IAQQTKNLGAEVYYHEFLPDSKCFHPLKFFTSMTNNMAAKPNGTYTNFGI 150 HHHHHHH___EEEEEEE_________EEE_____________________ . . . . . 151 NTVGIIRAPRGDGKEAIVLVTPYNSQKVTPNELLSLALGFSVFSLLSRAA 200 __EEEEEE______EEEEEE___________HHHHHHH__HHHHHHHHHH . . . . . 201 WLSKDIVWLSADSQFGEYSAVSSWLNQYHNPMFLSHPVNLDTKIYGANQI 250 HH__EEEEEE__________HHHHHHHH____________________EE . . . . . 251 LYKPDGTAEKAELMAFKRAGTMAAALIFKVGETRKYGDRDSVTMYAEASN 300 EE_____HHHHHHHHHHHHHHHHHHHHHHH___________EEEEE____ . . . . . 301 GQMPNLDLLNVVHYLAVHRQGFRVNVETFNSLLSSSWLRVIAEVFQNLGS 350 ________HHHHHHHHHHH____EEEE_______HHHHHHHHHHHHHHHH . . . . . 351 LLRKINPDWKLDVTVPDYVEGTANLASSMYNQALGVPTGSHGAFRDYQVD 400 H________EEEEE_________HHHHHHHHHEEE_____________EE . . . . . 401 AVSLEFAPAFHLKNENAKSSFLLRGGRLTEGVVRSVNNLLEKFHQSFFLY 450 EEEEEE____________EEEE____________HHHHHHHHHHHHHEEE . . . . . 451 FLTAPSKFISVGVYMIPFALLLAPLPIVAAALAGGSKTKGKLEDECKTKG 500 EE_____EEEEEEEEEEEEEEEEEHHHHHHHH________HHHHHHH___ . . . . . 501 NADDLQMEGGSWKWLKSARVLLIIQFWAVLVSLLPYYISQIPGAMPIQYA 550 ____________HHHHHHHHHHHHHHHHHHHH___EEEE______HHHHH . . . . . 551 VIWAVLSITILIILYAMFGSPSRAGVEWKLLKATMITSITIGMGLMSIIN 600 HHHHHHHHHHHHHHHHH_______HHHHHHHHHHHHHEEE___EEEEEEE . . . . . 601 FATAQLGALILIPMCLFSRPLRAQLEMNFLPRTVLLASNILLTVLGFPPA 650 EHHHHH___EEEE__EEEHHHHHHHHHH_____HHHHHHHH_________ . . . . . 651 AFLIMKGLSKGSWTVDIVGDFWLWMEFLWEWSSATYLYVFLVHLPCWLLC 700 HHHHHH_______EEEEEHHHHHHHHHHHHHHHHHHHEEEEEE__HHHHH . 701 IHVLLHPCYQPESKMKQE 718 HHHH______________ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ method : 1 alpha-contents : 44.1 % beta-contents : 30.4 % coil-contents : 25.5 % class : mixed method : 2 alpha-contents : 29.5 % beta-contents : 39.0 % coil-contents : 31.6 % class : mixed ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ GPI: learning from metazoa -13.76 -3.03 -0.05 -0.07 -4.00 0.00 -24.00 -0.79 -0.96 -3.64 -5.56 -12.00 -12.00 -28.00 0.00 0.00 -107.85 0.32 0.00 0.00 0.00 0.00 0.00 -16.00 -3.80 -4.41 -4.98 -5.76 -12.00 -12.00 -28.00 -12.00 0.00 -98.62 ID: BAC01259.1 AC: xxx Len: 718 1:I 683 Sc: -98.62 Pv: 9.893267e-01 NO_GPI_SITE GPI: learning from protozoa -29.90 -1.47 -2.15 -3.44 0.00 0.00 -20.00 -0.93 -1.52 -3.98 -17.70 -12.00 -12.00 -28.00 -12.00 0.00 -145.09 -28.20 -4.69 -4.13 -0.68 -4.00 0.00 -32.00 0.00 -0.18 -15.26 -18.61 -12.00 -12.00 0.00 0.00 0.00 -131.75 ID: BAC01259.1 AC: xxx Len: 718 1:I 709 Sc: -131.75 Pv: 9.999636e-01 NO_GPI_SITE ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ # SignalP euk predictions # name Cmax pos ? Ymax pos ? Smax pos ? Smean ? BAC01259.1 0.940 50 Y 0.792 481 Y 0.985 526 Y 0.205 N # SignalP gram- predictions # name Cmax pos ? Ymax pos ? Smax pos ? Smean ? BAC01259.1 0.636 605 Y 0.613 480 Y 0.961 40 Y 0.240 N # SignalP gram+ predictions # name Cmax pos ? Ymax pos ? Smax pos ? Smean ? BAC01259.1 0.614 86 Y 0.539 482 Y 0.992 636 Y 0.296 N ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ low complexity regions: SEG 12 2.2 2.5 >BAC01259.1 putative GPAA1 - like protein [Oryza sativa (japonica cultivar-group)]. 1-468 MASSAKEEAKPKPRLIVRLGVFLASHHILF SAVCCTAGIIALLFLPSLAKNTYLSENALI PGSANTLFSTEDVQEANRFAKGIEAAIGES RGGTTEIPKFIAQQTKNLGAEVYYHEFLPD SKCFHPLKFFTSMTNNMAAKPNGTYTNFGI NTVGIIRAPRGDGKEAIVLVTPYNSQKVTP NELLSLALGFSVFSLLSRAAWLSKDIVWLS ADSQFGEYSAVSSWLNQYHNPMFLSHPVNL DTKIYGANQILYKPDGTAEKAELMAFKRAG TMAAALIFKVGETRKYGDRDSVTMYAEASN GQMPNLDLLNVVHYLAVHRQGFRVNVETFN SLLSSSWLRVIAEVFQNLGSLLRKINPDWK LDVTVPDYVEGTANLASSMYNQALGVPTGS HGAFRDYQVDAVSLEFAPAFHLKNENAKSS FLLRGGRLTEGVVRSVNNLLEKFHQSFFLY FLTAPSKFISVGVYMIPF alllaplpivaaala 469-483 484-718 GGSKTKGKLEDECKTKGNADDLQMEGGSWK WLKSARVLLIIQFWAVLVSLLPYYISQIPG AMPIQYAVIWAVLSITILIILYAMFGSPSR AGVEWKLLKATMITSITIGMGLMSIINFAT AQLGALILIPMCLFSRPLRAQLEMNFLPRT VLLASNILLTVLGFPPAAFLIMKGLSKGSW TVDIVGDFWLWMEFLWEWSSATYLYVFLVH LPCWLLCIHVLLHPCYQPESKMKQE low complexity regions: SEG 25 3.0 3.3 >BAC01259.1 putative GPAA1 - like protein [Oryza sativa (japonica cultivar-group)]. 1-468 MASSAKEEAKPKPRLIVRLGVFLASHHILF SAVCCTAGIIALLFLPSLAKNTYLSENALI PGSANTLFSTEDVQEANRFAKGIEAAIGES RGGTTEIPKFIAQQTKNLGAEVYYHEFLPD SKCFHPLKFFTSMTNNMAAKPNGTYTNFGI NTVGIIRAPRGDGKEAIVLVTPYNSQKVTP NELLSLALGFSVFSLLSRAAWLSKDIVWLS ADSQFGEYSAVSSWLNQYHNPMFLSHPVNL DTKIYGANQILYKPDGTAEKAELMAFKRAG TMAAALIFKVGETRKYGDRDSVTMYAEASN GQMPNLDLLNVVHYLAVHRQGFRVNVETFN SLLSSSWLRVIAEVFQNLGSLLRKINPDWK LDVTVPDYVEGTANLASSMYNQALGVPTGS HGAFRDYQVDAVSLEFAPAFHLKNENAKSS FLLRGGRLTEGVVRSVNNLLEKFHQSFFLY FLTAPSKFISVGVYMIPF alllaplpivaaala 469-483 484-718 GGSKTKGKLEDECKTKGNADDLQMEGGSWK WLKSARVLLIIQFWAVLVSLLPYYISQIPG AMPIQYAVIWAVLSITILIILYAMFGSPSR AGVEWKLLKATMITSITIGMGLMSIINFAT AQLGALILIPMCLFSRPLRAQLEMNFLPRT VLLASNILLTVLGFPPAAFLIMKGLSKGSW TVDIVGDFWLWMEFLWEWSSATYLYVFLVH LPCWLLCIHVLLHPCYQPESKMKQE low complexity regions: SEG 45 3.4 3.75 >BAC01259.1 putative GPAA1 - like protein [Oryza sativa (japonica cultivar-group)]. 1-509 MASSAKEEAKPKPRLIVRLGVFLASHHILF SAVCCTAGIIALLFLPSLAKNTYLSENALI PGSANTLFSTEDVQEANRFAKGIEAAIGES RGGTTEIPKFIAQQTKNLGAEVYYHEFLPD SKCFHPLKFFTSMTNNMAAKPNGTYTNFGI NTVGIIRAPRGDGKEAIVLVTPYNSQKVTP NELLSLALGFSVFSLLSRAAWLSKDIVWLS ADSQFGEYSAVSSWLNQYHNPMFLSHPVNL DTKIYGANQILYKPDGTAEKAELMAFKRAG TMAAALIFKVGETRKYGDRDSVTMYAEASN GQMPNLDLLNVVHYLAVHRQGFRVNVETFN SLLSSSWLRVIAEVFQNLGSLLRKINPDWK LDVTVPDYVEGTANLASSMYNQALGVPTGS HGAFRDYQVDAVSLEFAPAFHLKNENAKSS FLLRGGRLTEGVVRSVNNLLEKFHQSFFLY FLTAPSKFISVGVYMIPFALLLAPLPIVAA ALAGGSKTKGKLEDECKTKGNADDLQMEG gswkwlksarvlliiqfwavlvsllpyyis 510-576 qipgampiqyaviwavlsitiliilyamfg spsragv 577-718 EWKLLKATMITSITIGMGLMSIINFATAQL GALILIPMCLFSRPLRAQLEMNFLPRTVLL ASNILLTVLGFPPAAFLIMKGLSKGSWTVD IVGDFWLWMEFLWEWSSATYLYVFLVHLPC WLLCIHVLLHPCYQPESKMKQE low complexity regions: XNU # Score cutoff = 21, Search from offsets 1 to 4 # both members of each repeat flagged # lambda = 0.347, K = 0.200, H = 0.664 >BAC01259.1 putative GPAA1 - like protein [Oryza sativa (japonica cultivar-group)]. MASSAKEEAKPKPRLIVRLGVFLASHHILFSAVCCTAGIIALLFLPSLAKNTYLSENALI PGSANTLFSTEDVQEANRFAKGIEAAIGESRGGTTEIPKFIAQQTKNLGAEVYYHEFLPD SKCFHPLKFFTSMTNNMAAKPNGTYTNFGINTVGIIRAPRGDGKEAIVLVTPYNSQKVTP NELLSLALGFSVFSLLSRAAWLSKDIVWLSADSQFGEYSAVSSWLNQYHNPMFLSHPVNL DTKIYGANQILYKPDGTAEKAELMAFKRAGTMAAALIFKVGETRKYGDRDSVTMYAEASN GQMPNLDLLNVVHYLAVHRQGFRVNVETFNSLLSSSWLRVIAEVFQNLGSLLRKINPDWK LDVTVPDYVEGTANLASSMYNQALGVPTGSHGAFRDYQVDAVSLEFAPAFHLKNENAKSS FLLRGGRLTEGVVRSVNNLLEKFHQSFFLYFLTAPSKFISVGVYMIPFALLLAPLPIVAA ALAGGSKTKGKLEDECKTKGNADDLQMEGGSWKWLKSARVLLIIQFWAVLVSLLPYYISQ IPGAMPIQYAVIWAVLSITILIILYAMFGSPSRAGVEWKLLKATMITSITIGMGLMSIIN FATAQLGALILIPMCLFSRPLRAQLEMNFLPRTVLLASNILLTVLGFPPAAFLIMKGLSK GSWTVDIVGDFWLWMEFLWEWSSATYLYVFLVHLPCWLLCIHVLLHPCYQPESKMKQE 1 - 718 MASSAKEEAK PKPRLIVRLG VFLASHHILF SAVCCTAGII ALLFLPSLAK NTYLSENALI PGSANTLFST EDVQEANRFA KGIEAAIGES RGGTTEIPKF IAQQTKNLGA EVYYHEFLPD SKCFHPLKFF TSMTNNMAAK PNGTYTNFGI NTVGIIRAPR GDGKEAIVLV TPYNSQKVTP NELLSLALGF SVFSLLSRAA WLSKDIVWLS ADSQFGEYSA VSSWLNQYHN PMFLSHPVNL DTKIYGANQI LYKPDGTAEK AELMAFKRAG TMAAALIFKV GETRKYGDRD SVTMYAEASN GQMPNLDLLN VVHYLAVHRQ GFRVNVETFN SLLSSSWLRV IAEVFQNLGS LLRKINPDWK LDVTVPDYVE GTANLASSMY NQALGVPTGS HGAFRDYQVD AVSLEFAPAF HLKNENAKSS FLLRGGRLTE GVVRSVNNLL EKFHQSFFLY FLTAPSKFIS VGVYMIPFAL LLAPLPIVAA ALAGGSKTKG KLEDECKTKG NADDLQMEGG SWKWLKSARV LLIIQFWAVL VSLLPYYISQ IPGAMPIQYA VIWAVLSITI LIILYAMFGS PSRAGVEWKL LKATMITSIT IGMGLMSIIN FATAQLGALI LIPMCLFSRP LRAQLEMNFL PRTVLLASNI LLTVLGFPPA AFLIMKGLSK GSWTVDIVGD FWLWMEFLWE WSSATYLYVF LVHLPCWLLC IHVLLHPCYQ PESKMKQE low complexity regions: DUST >BAC01259.1 putative GPAA1 - like protein [Oryza sativa (japonica cultivar-group)]. MASSAKEEAKPKPRLIVRLGVFLASHHILFSAVCCTAGIIALLFLPSLAKNTYLSENALI PGSANTLFSTEDVQEANRFAKGIEAAIGESRGGTTEIPKFIAQQTKNLGAEVYYHEFLPD SKCFHPLKFFTSMTNNMAAKPNGTYTNFGINTVGIIRAPRGDGKEAIVLVTPYNSQKVTP NELLSLALGFSVFSLLSRAAWLSKDIVWLSADSQFGEYSAVSSWLNQYHNPMFLSHPVNL DTKIYGANQILYKPDGTAEKAELMAFKRAGTMAAALIFKVGETRKYGDRDSVTMYAEASN GQMPNLDLLNVVHYLAVHRQGFRVNVETFNSLLSSSWLRVIAEVFQNLGSLLRKINPDWK LDVTVPDYVEGTANLASSMYNQALGVPTGSHGAFRDYQVDAVSLEFAPAFHLKNENAKSS FLLRGGRLTEGVVRSVNNLLEKFHQSFFLYFLTAPSKFISVGVYMIPFALLLAPLPIVAA ALAGGSKTKGKLEDECKTKGNADDLQMEGGSWKWLKSARVLLIIQFWAVLVSLLPYYISQ IPGAMPIQYAVIWAVLSITILIILYAMFGSPSRAGVEWKLLKATMITSITIGMGLMSIIN FATAQLGALILIPMCLFSRPLRAQLEMNFLPRTVLLASNILLTVLGFPPAAFLIMKGLSK GSWTVDIVGDFWLWMEFLWEWSSATYLYVFLVHLPCWLLCIHVLLHPCYQPESKMKQE ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ coiled coil prediction for BAC01259.1 sequence: 718 amino acids, 0 residue(s) in coiled coil state . | . | . | . | . | . 60 MASSAKEEAK PKPRLIVRLG VFLASHHILF SAVCCTAGII ALLFLPSLAK NTYLSENALI ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 120 PGSANTLFST EDVQEANRFA KGIEAAIGES RGGTTEIPKF IAQQTKNLGA EVYYHEFLPD ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 180 SKCFHPLKFF TSMTNNMAAK PNGTYTNFGI NTVGIIRAPR GDGKEAIVLV TPYNSQKVTP ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 240 NELLSLALGF SVFSLLSRAA WLSKDIVWLS ADSQFGEYSA VSSWLNQYHN PMFLSHPVNL ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 300 DTKIYGANQI LYKPDGTAEK AELMAFKRAG TMAAALIFKV GETRKYGDRD SVTMYAEASN ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 360 GQMPNLDLLN VVHYLAVHRQ GFRVNVETFN SLLSSSWLRV IAEVFQNLGS LLRKINPDWK ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 420 LDVTVPDYVE GTANLASSMY NQALGVPTGS HGAFRDYQVD AVSLEFAPAF HLKNENAKSS ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 480 FLLRGGRLTE GVVRSVNNLL EKFHQSFFLY FLTAPSKFIS VGVYMIPFAL LLAPLPIVAA ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 540 ALAGGSKTKG KLEDECKTKG NADDLQMEGG SWKWLKSARV LLIIQFWAVL VSLLPYYISQ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 600 IPGAMPIQYA VIWAVLSITI LIILYAMFGS PSRAGVEWKL LKATMITSIT IGMGLMSIIN ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . 660 FATAQLGALI LIPMCLFSRP LRAQLEMNFL PRTVLLASNI LLTVLGFPPA AFLIMKGLSK ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 -w border ---------- ---------- ---------- ---------- ---------- ---------- * 21 M'95 -w register ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 M'95 +w polar ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 21 MTK -w class ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 28 M'95 -w signif. ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ * 14 M'95 -w local . | . | . | . | . | . GSWTVDIVGD FWLWMEFLWE WSSATYLYVF LVHLPCWLLC IHVLLHPCYQ PESKMKQE ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~ ---------- ---------- ---------- ---------- ---------- -------- ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ prediction of transmembrane regions with toppred2 *********************************** *TOPPREDM with eukaryotic function* *********************************** BAC01259.1.fa.___inter___ is a single sequence Using hydrophobicity file: /bio_software/2D/toppredm/lib/Engelman-Steitz.scale Using cyt/ext file: /bio_software/2D/toppredm/lib/Cyt-Ext.prok Using sequence file: BAC01259.1.fa.___inter___ (1 sequences) MASSAKEEAKPKPRLIVRLGVFLASHHILFSAVCCTAGIIALLFLPSLAK NTYLSENALIPGSANTLFSTEDVQEANRFAKGIEAAIGESRGGTTEIPKF IAQQTKNLGAEVYYHEFLPDSKCFHPLKFFTSMTNNMAAKPNGTYTNFGI NTVGIIRAPRGDGKEAIVLVTPYNSQKVTPNELLSLALGFSVFSLLSRAA WLSKDIVWLSADSQFGEYSAVSSWLNQYHNPMFLSHPVNLDTKIYGANQI LYKPDGTAEKAELMAFKRAGTMAAALIFKVGETRKYGDRDSVTMYAEASN GQMPNLDLLNVVHYLAVHRQGFRVNVETFNSLLSSSWLRVIAEVFQNLGS LLRKINPDWKLDVTVPDYVEGTANLASSMYNQALGVPTGSHGAFRDYQVD AVSLEFAPAFHLKNENAKSSFLLRGGRLTEGVVRSVNNLLEKFHQSFFLY FLTAPSKFISVGVYMIPFALLLAPLPIVAAALAGGSKTKGKLEDECKTKG NADDLQMEGGSWKWLKSARVLLIIQFWAVLVSLLPYYISQIPGAMPIQYA VIWAVLSITILIILYAMFGSPSRAGVEWKLLKATMITSITIGMGLMSIIN FATAQLGALILIPMCLFSRPLRAQLEMNFLPRTVLLASNILLTVLGFPPA AFLIMKGLSKGSWTVDIVGDFWLWMEFLWEWSSATYLYVFLVHLPCWLLC IHVLLHPCYQPESKMKQE (p)rokaryotic or (e)ukaryotic: e Charge-pair energy: 0 Length of full window (odd number!): 21 Length of core window (odd number!): 11 Number of residues to add to each end of helix: 1 Critical length: 60 Upper cutoff for candidates: 1 Lower cutoff for candidates: 0.6 Total of 1 structures are to be tested Candidate membrane-spanning segments: Helix Begin End Score Certainity 1 28 48 2.147 Certain 2 182 202 1.201 Certain 3 463 483 1.990 Certain 4 519 539 1.627 Certain 5 549 569 2.278 Certain 6 583 603 1.919 Certain 7 635 655 1.662 Certain 8 685 705 1.703 Certain ---------------------------------------------------------------------- Structure 1 Transmembrane segments included in this structure: Segment 1 2 3 4 5 6 7 8 Loop length 27 133 260 35 9 13 31 29 13 K+R profile 6.00 + 0.00 3.00 2.00 + 8.00 3.00 2.00 CYT-EXT prof - 1.68 - - - 0.71 - - - For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: -2.00 Tm probability: 1.00 -> Orientation: N-out Charge-difference over N-terminal Tm (+-15 residues): 3.00 (NEG-POS)/(NEG+POS): -0.4286 NEG: 2.0000 POS: 5.0000 -> Orientation: N-in CYT-EXT difference: 0.96 -> Orientation: N-out ---------------------------------------------------------------------- "BAC01259" 718 28 48 #t 2.14688 182 202 #t 1.20104 463 483 #t 1.98958 519 539 #t 1.62708 549 569 #t 2.27813 583 603 #t 1.91875 635 655 #t 1.6625 685 705 #t 1.70312 ************************************ *TOPPREDM with prokaryotic function* ************************************ BAC01259.1.fa.___inter___ is a single sequence Using hydrophobicity file: /bio_software/2D/toppredm/lib/Engelman-Steitz.scale Using cyt/ext file: /bio_software/2D/toppredm/lib/Cyt-Ext.prok Using sequence file: BAC01259.1.fa.___inter___ (1 sequences) MASSAKEEAKPKPRLIVRLGVFLASHHILFSAVCCTAGIIALLFLPSLAK NTYLSENALIPGSANTLFSTEDVQEANRFAKGIEAAIGESRGGTTEIPKF IAQQTKNLGAEVYYHEFLPDSKCFHPLKFFTSMTNNMAAKPNGTYTNFGI NTVGIIRAPRGDGKEAIVLVTPYNSQKVTPNELLSLALGFSVFSLLSRAA WLSKDIVWLSADSQFGEYSAVSSWLNQYHNPMFLSHPVNLDTKIYGANQI LYKPDGTAEKAELMAFKRAGTMAAALIFKVGETRKYGDRDSVTMYAEASN GQMPNLDLLNVVHYLAVHRQGFRVNVETFNSLLSSSWLRVIAEVFQNLGS LLRKINPDWKLDVTVPDYVEGTANLASSMYNQALGVPTGSHGAFRDYQVD AVSLEFAPAFHLKNENAKSSFLLRGGRLTEGVVRSVNNLLEKFHQSFFLY FLTAPSKFISVGVYMIPFALLLAPLPIVAAALAGGSKTKGKLEDECKTKG NADDLQMEGGSWKWLKSARVLLIIQFWAVLVSLLPYYISQIPGAMPIQYA VIWAVLSITILIILYAMFGSPSRAGVEWKLLKATMITSITIGMGLMSIIN FATAQLGALILIPMCLFSRPLRAQLEMNFLPRTVLLASNILLTVLGFPPA AFLIMKGLSKGSWTVDIVGDFWLWMEFLWEWSSATYLYVFLVHLPCWLLC IHVLLHPCYQPESKMKQE (p)rokaryotic or (e)ukaryotic: p Charge-pair energy: 0 Length of full window (odd number!): 21 Length of core window (odd number!): 11 Number of residues to add to each end of helix: 1 Critical length: 60 Upper cutoff for candidates: 1 Lower cutoff for candidates: 0.6 Total of 1 structures are to be tested Candidate membrane-spanning segments: Helix Begin End Score Certainity 1 28 48 2.147 Certain 2 182 202 1.201 Certain 3 463 483 1.990 Certain 4 519 539 1.627 Certain 5 549 569 2.278 Certain 6 583 603 1.919 Certain 7 635 655 1.662 Certain 8 685 705 1.703 Certain ---------------------------------------------------------------------- Structure 1 Transmembrane segments included in this structure: Segment 1 2 3 4 5 6 7 8 Loop length 27 133 260 35 9 13 31 29 13 K+R profile 6.00 + 0.00 3.00 2.00 + 8.00 3.00 2.00 CYT-EXT prof - 1.68 - - - 0.71 - - - For CYT-EXT profile neg. values indicate cytoplasmic preference. K+R difference: -2.00 Tm probability: 1.00 -> Orientation: N-out Charge-difference over N-terminal Tm (+-15 residues): 3.00 (NEG-POS)/(NEG+POS): -0.4286 NEG: 2.0000 POS: 5.0000 -> Orientation: N-in CYT-EXT difference: 0.96 -> Orientation: N-out ---------------------------------------------------------------------- "BAC01259" 718 28 48 #t 2.14688 182 202 #t 1.20104 463 483 #t 1.98958 519 539 #t 1.62708 549 569 #t 2.27813 583 603 #t 1.91875 635 655 #t 1.6625 685 705 #t 1.70312 ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ SAPS. Version of April 11, 1996. Date run: Tue Jan 28 11:24:29 2003 File: /people/b_eisen/BAC01259.1.fa.___saps___ ID BAC01259.1 DE putative GPAA1 - like protein [Oryza sativa (japonica cultivar-group)]. number of residues: 718; molecular weight: 79.4 kdal 1 MASSAKEEAK PKPRLIVRLG VFLASHHILF SAVCCTAGII ALLFLPSLAK NTYLSENALI 61 PGSANTLFST EDVQEANRFA KGIEAAIGES RGGTTEIPKF IAQQTKNLGA EVYYHEFLPD 121 SKCFHPLKFF TSMTNNMAAK PNGTYTNFGI NTVGIIRAPR GDGKEAIVLV TPYNSQKVTP 181 NELLSLALGF SVFSLLSRAA WLSKDIVWLS ADSQFGEYSA VSSWLNQYHN PMFLSHPVNL 241 DTKIYGANQI LYKPDGTAEK AELMAFKRAG TMAAALIFKV GETRKYGDRD SVTMYAEASN 301 GQMPNLDLLN VVHYLAVHRQ GFRVNVETFN SLLSSSWLRV IAEVFQNLGS LLRKINPDWK 361 LDVTVPDYVE GTANLASSMY NQALGVPTGS HGAFRDYQVD AVSLEFAPAF HLKNENAKSS 421 FLLRGGRLTE GVVRSVNNLL EKFHQSFFLY FLTAPSKFIS VGVYMIPFAL LLAPLPIVAA 481 ALAGGSKTKG KLEDECKTKG NADDLQMEGG SWKWLKSARV LLIIQFWAVL VSLLPYYISQ 541 IPGAMPIQYA VIWAVLSITI LIILYAMFGS PSRAGVEWKL LKATMITSIT IGMGLMSIIN 601 FATAQLGALI LIPMCLFSRP LRAQLEMNFL PRTVLLASNI LLTVLGFPPA AFLIMKGLSK 661 GSWTVDIVGD FWLWMEFLWE WSSATYLYVF LVHLPCWLLC IHVLLHPCYQ PESKMKQE -------------------------------------------------------------------------------- COMPOSITIONAL ANALYSIS (extremes relative to: swp23s) A : 69( 9.6%); C : 8( 1.1%); D : 20( 2.8%); E : 33( 4.6%); F : 40( 5.6%) G : 47( 6.5%); H : 14( 1.9%); I : 43( 6.0%); K : 38( 5.3%); L : 93(13.0%) M : 21( 2.9%); N : 33( 4.6%); P : 35( 4.9%); Q : 21( 2.9%); R : 23( 3.2%) S : 57( 7.9%); T : 36( 5.0%); V : 47( 6.5%); W : 16( 2.2%); Y : 24( 3.3%) KR : 61 ( 8.5%); ED : 53 ( 7.4%); AGP : 151 ( 21.0%); KRED : 114 ( 15.9%); KR-ED : 8 ( 1.1%); FIKMNY : 199 ( 27.7%); LVIFM : 244 ( 34.0%); ST : 93 ( 13.0%). -------------------------------------------------------------------------------- CHARGE DISTRIBUTIONAL ANALYSIS 1 00000+--0+ 0+0+000+00 0000000000 0000000000 000000000+ 00000-0000 61 0000000000 --00-00+00 +00-0000-0 +0000-00+0 00000+0000 -0000-000- 121 0+00000+00 000000000+ 0000000000 000000+00+ 0-0+-00000 000000+000 181 0-00000000 0000000+00 000+-00000 0-0000-000 0000000000 0000000000 241 -0+0000000 00+0-000-+ 0-0000++00 00000000+0 0-0++00-+- 000000-000 301 000000-000 00000000+0 00+000-000 00000000+0 00-0000000 00++000-0+ 361 0-0000-00- 0000000000 0000000000 0000+-000- 0000-00000 00+0-00+00 421 000+00+00- 000+000000 -+00000000 000000+000 0000000000 0000000000 481 000000+0+0 +0---0+0+0 00--000-00 00+00+00+0 0000000000 0000000000 541 0000000000 0000000000 0000000000 00+000-0+0 0+00000000 0000000000 601 0000000000 00000000+0 0+000-0000 0+00000000 0000000000 00000+000+ 661 00000-000- 00000-000- 0000000000 0000000000 0000000000 0-0+0+0- A. CHARGE CLUSTERS. Positive charge clusters (cmin = 8/30 or 11/45 or 13/60): none Negative charge clusters (cmin = 7/30 or 10/45 or 12/60): none Mixed charge clusters (cmin = 12/30 or 16/45 or 20/60): 1) From 487 to 519: KTKGKLEDECKTKGNADDLQMEGGSWKWLKSAR +0+0+0---0+0+000--000-0000+00+00+ quartile: 3; size: 33, +count: 8, -count: 6, 0count: 19; t-value: 4.17 * G: 4 (12.1%); K: 7 (21.2%); B. HIGH SCORING (UN)CHARGED SEGMENTS. There are no high scoring positive charge segments. There are no high scoring negative charge segments. There are no high scoring mixed charge segments. There are no high scoring uncharged segments. C. CHARGE RUNS AND PATTERNS. pattern (+)| (-)| (*)| (0)| (+0)| (-0)| (*0)|(+00)|(-00)|(*00)| (H.)|(H..)| lmin0 4 | 4 | 6 | 54 | 9 | 9 | 12 | 11 | 11 | 15 | 7 | 8 | lmin1 6 | 6 | 8 | 65 | 11 | 11 | 14 | 14 | 13 | 18 | 8 | 10 | lmin2 7 | 7 | 9 | 73 | 13 | 12 | 16 | 16 | 15 | 20 | 9 | 12 | (Significance level: 0.010000; Minimal displayed length: 6) (*00) 15(0,0,0); at 72- 86: DVQEANRFAKGIEAA (1. quartile) -00-00+00+00-00 Run count statistics: + runs >= 3: 0 - runs >= 3: 1, at 493; * runs >= 4: 0 0 runs >= 36: 2, at 520; 583; -------------------------------------------------------------------------------- DISTRIBUTION OF OTHER AMINO ACID TYPES 1. HIGH SCORING SEGMENTS. __________________________________ High scoring hydrophobic segments: 2.00 (LVIFM) 1.00 (AGYCW) 0.00 (BZX) -2.00 (PH) -4.00 (STNQ) -8.00 (KEDR) Expected score/letter: -1.318 M_0.01= 36.32; M_0.05= 30.03 1) From 520 to 569: length= 50, score=33.00 * 520 VLLIIQFWAV LVSLLPYYIS QIPGAMPIQY AVIWAVLSIT ILIILYAMFG L: 8(16.0%); A: 5(10.0%); V: 5(10.0%); I: 10(20.0%); ____________________________________ High scoring transmembrane segments: 5.00 (LVIF) 2.00 (AGM) 0.00 (BZX) -1.00 (YCW) -2.00 (ST) -6.00 (P) -8.00 (H) -10.00 (NQ) -16.00 (KR) -17.00 (ED) Expected score/letter: -2.206 M_0.01= 92.95; M_0.05= 76.11; M_0.30= 56.07 1) From 19 to 45: length= 27, score=58.00 (pocket at 25 to 27: length= 3, score=-18.00) 19 LGVFLA |SHH| I LFSAVCCTAG IIALLFL L: 6(22.2%); A: 4(14.8%); I: 3(11.1%); F: 3(11.1%); 2) From 458 to 485: length= 28, score=64.00 458 FISVGVYMIP FALLLAPLPI VAAALAGG L: 5(17.9%); A: 6(21.4%); G: 3(10.7%); V: 3(10.7%); I: 3(10.7%); P: 3(10.7%); 3) From 520 to 569: length= 50, score=81.00 * (pocket at 535 to 549: length= 15, score=-22.00) 520 VLLIIQFWAV LVSLL |PYYIS QIPGAMPIQY| AVIWAVLSIT ILIILYAMFG L: 8(16.0%); A: 5(10.0%); V: 5(10.0%); I: 10(20.0%); 2. SPACINGS OF C. H2N-33-C-C-87-C-372-C-118-C-80-C-3-C-7-C-10-COOH 2*. SPACINGS OF C and H. (additional deluxe function for ALEX) H2N-25-H-H-6-C-C-79-H-7-C-1-H-103-H-6-H-76-H-4-H-72-H-19-H-32-H-51-C-118-C-77-H-2-C-3-C-1-H-3-H-1-C-10-COOH -------------------------------------------------------------------------------- REPETITIVE STRUCTURES. A. SEPARATED, TANDEM, AND PERIODIC REPEATS: amino acid alphabet. Repeat core block length: 5 B. SEPARATED AND TANDEM REPEATS: 11-letter reduced alphabet. (i= LVIF; += KR; -= ED; s= AG; o= ST; n= NQ; a= YW; p= P; h= H; m= M; c= C) Repeat core block length: 9 -------------------------------------------------------------------------------- MULTIPLETS. A. AMINO ACID ALPHABET. 1. Total number of amino acid multiplets: 52 (Expected range: 23-- 63) 2. Histogram of spacings between consecutive amino acid multiplets: (1-5) 21 (6-10) 9 (11-20) 12 (>=21) 11 3. Clusters of amino acid multiplets (cmin = 12/30 or 16/45 or 19/60): none B. CHARGE ALPHABET. 1. Total number of charge multiplets: 7 (Expected range: 0-- 17) 3 +plets (f+: 8.5%), 4 -plets (f-: 7.4%) Total number of charge altplets: 7 (Critical number: 20) 2. Histogram of spacings between consecutive charge multiplets: (1-5) 0 (6-10) 2 (11-20) 1 (>=21) 5 -------------------------------------------------------------------------------- PERIODICITY ANALYSIS. A. AMINO ACID ALPHABET (core: 4; !-core: 5) Location Period Element Copies Core Errors 185- 208 6 S..... 4 4 0 188- 215 7 L...... 4 4 0 261- 276 4 A... 4 4 0 393- 424 8 A....... 4 4 0 606- 625 5 L.... 4 4 0 B. CHARGE ALPHABET ({+= KR; -= ED; 0}; core: 5; !-core: 6) and HYDROPHOBICITY ALPHABET ({*= KRED; i= LVIF; 0}; core: 6; !-core: 9) Location Period Element Copies Core Errors 487- 500 2 *0 7 7 /0/2/ 552- 569 3 i.0 6 6 /0/./2/ 576- 625 5 i.0.. 10 10 ! /0/./2/././ 625- 684 10 i*..0.0.0. 6 6 /0/2/././1/./2/./2/./ -------------------------------------------------------------------------------- SPACING ANALYSIS. Location (Quartile) Spacing Rank P-value Interpretation 519- 573 (4.) *( 54)* 1 of 115 0.0068 large 1. maximal spacing 582- 619 (4.) *( 37)* 2 of 115 0.0081 large 2. maximal spacing ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ Start with Pfam_ls (from /local/index/hmmer) hmmpfam - search a single seq against HMM database HMMER 2.1.1 (Dec 1998) Copyright (C) 1992-1998 Washington University School of Medicine HMMER is freely distributed under the GNU General Public License (GPL). - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - HMM file: /local/index/hmmer/Pfam_ls Sequence file: BAC01259.1.fa - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Query: BAC01259.1 putative GPAA1 - like protein [Oryza sativa (japonica cultivar-group)]. Scores for sequence family classification (score includes all domains): Model Description Score E-value N -------- ----------- ----- ------- --- Gaa1 Gaa1-like, GPI transamidase component 43.1 1.4e-10 1 YCF9 YCF9 -31.4 71 1 DUF624 Protein of unknown function, DUF624 -36.2 53 1 COX2_TM Cytochrome C oxidase subunit II, tran -41.4 18 1 CDP-OH_P_transf CDP-alcohol phosphatidyltransferase -41.8 34 1 DedA DedA family -42.3 26 1 DoxD DoxD-like family -42.5 68 1 MtN3_slv MtN3/saliva family -42.7 47 1 MgtE Divalent cation transporter -60.8 30 1 SirB Invasion gene expression up-regulator -65.9 63 1 PhaG_MnhG_YufB Na+/H+ antiporter subunit -66.0 59 1 DUF389 Domain of unknown function (DUF389) -74.2 32 1 ATP-synt_A ATP synthase A chain -95.0 77 1 Peptidase_M28 Peptidase family M28 -99.5 33 1 CTP_transf_1 Cytidylyltransferase family -102.5 66 1 DUF125 Integral membrane protein DUF125 -104.7 19 1 UPF0032 MttB family UPF0032 -110.4 48 1 Competence Competence protein -113.0 19 1 UbiA UbiA prenyltransferase family -113.8 88 1 TerC Integral membrane protein TerC family -117.6 72 1 Pept_tRNA_hydro Peptidyl-tRNA hydrolase -119.1 94 1 Ribonuclease_BN Ribonuclease BN-like family -128.6 15 1 UPF0118 Domain of unknown function DUF20 -158.5 60 1 DUF554 Protein of unknown function (DUF554) -174.5 91 1 Adenyl_transf Streptomycin adenylyltransferase -191.3 6.6 1 oxidored_q1 NADH-Ubiquinone/plastoquinone (comple -191.6 92 1 ABC-3 ABC 3 transport family -214.4 70 1 sugar_tr Sugar (and other) transporter -217.4 46 1 FecCD FecCD transport family -231.2 31 1 NADHdh NADH dehydrogenase -235.0 68 1 Cyto_ox_2 Cytochrome oxidase subunit II -252.8 23 1 MVIN Virulence factor MVIN -267.9 28 1 DUF409 Protein of unknown function (DUF409) -283.5 27 1 DUF639 Plant protein of unknown function (DU -464.2 98 1 HCO3_cotransp HCO3- transporter family -588.2 94 1 Parsed for domains: Model Domain seq-f seq-t hmm-f hmm-t score E-value -------- ------- ----- ----- ----- ----- ----- ------- Pept_tRNA_hydro 1/1 12 125 .. 1 193 [] -119.1 94 Peptidase_M28 1/1 134 340 .. 1 282 [] -99.5 33 Adenyl_transf 1/1 179 382 .. 1 294 [] -191.3 6.6 ATP-synt_A 1/1 444 568 .. 1 179 [] -95.0 77 CDP-OH_P_transf 1/1 447 571 .. 1 164 [] -41.8 34 MtN3_slv 1/1 447 573 .. 1 89 [] -42.7 47 DUF639 1/1 81 585 .. 1 747 [] -464.2 98 DedA 1/1 436 588 .. 1 167 [] -42.3 26 DUF125 1/1 455 595 .. 1 184 [] -104.7 19 UbiA 1/1 364 599 .. 1 290 [] -113.8 88 TerC 1/1 429 600 .. 1 274 [] -117.6 72 MgtE 1/1 465 602 .. 1 137 [] -60.8 30 YCF9 1/1 546 604 .. 1 59 [] -31.4 71 DUF624 1/1 547 606 .. 1 77 [] -36.2 53 ABC-3 1/1 435 612 .. 1 267 [] -214.4 70 Ribonuclease_BN 1/1 361 619 .. 1 273 [] -128.6 15 COX2_TM 1/1 562 621 .. 1 89 [] -41.4 18 DUF554 1/1 467 621 .. 1 250 [] -174.5 91 Cyto_ox_2 1/1 331 622 .. 1 363 [] -252.8 23 CTP_transf_1 1/1 455 625 .. 1 341 [] -102.5 66 DUF409 1/1 324 633 .. 1 581 [] -283.5 27 HCO3_cotransp 1/1 159 641 .. 1 887 [] -588.2 94 DUF389 1/1 461 646 .. 1 169 [] -74.2 32 SirB 1/1 543 650 .. 1 127 [] -65.9 63 FecCD 1/1 408 656 .. 1 311 [] -231.2 31 DoxD 1/1 553 657 .. 1 135 [] -42.5 68 UPF0118 1/1 271 662 .. 1 340 [] -158.5 60 PhaG_MnhG_YufB 1/1 585 674 .. 1 120 [] -66.0 59 UPF0032 1/1 522 676 .. 1 213 [] -110.4 48 Competence 1/1 428 680 .. 1 292 [] -113.0 19 Gaa1 1/1 147 684 .. 1 546 [] 43.1 1.4e-10 sugar_tr 1/1 293 688 .. 1 488 [] -217.4 46 MVIN 1/1 373 693 .. 1 483 [] -267.9 28 oxidored_q1 1/1 502 693 .. 1 319 [] -191.6 92 NADHdh 1/1 463 704 .. 1 331 [] -235.0 68 Alignments of top-scoring domains: Pept_tRNA_hydro: domain 1 of 1, from 12 to 125: score -119.1, E = 94 *->tikLiVGLGNPGkqYaeTRHNaGFmvlDlLasrlglslreekrffgl +++LiV rlg++l + l BAC01259.1 12 KPRLIV--------------------------RLGVFLASHH---IL 29 ggkvlvsgkkHCviLlKPrTYMNlSGkaVlalasfYkikpeeilVVhDdL +++v + + + Ll ++++la++ +++ e+ l+ BAC01259.1 30 FSAVCCTAGI--IALL-----------FLPSLAKNTYLS-ENALI----- 60 DLPlGkiRLKqgGgaGRGHNGLKSiishLGntnnFnR...LRiGIGrpNp +G+a+ + L S+ + n F+++ + +iG +r+ + BAC01259.1 61 -----------PGSAN-T---LFSTEDVQE-ANRFAKgieAAIGESRG-G 93 gsndvaefVLskFspaErpllekaldkaiealemiieghgmnklmnrfn< + ++++k++ + ++ l + h++++ + f+ BAC01259.1 94 T-----------------TEIPKFIAQQTKNLGAEVYYHEFLPDSKCFH 125 -* BAC01259.1 - - Peptidase_M28: domain 1 of 1, from 134 to 340: score -99.5, E = 33 *->klkldvhteleernhkiqNvvgtikG.GseepdeyVviGAHrDSwge + ++ ++ + +++n + +N vg+i+ + ++ + +V+++ +++S + BAC01259.1 134 TNNMAAKPNGTYTN-FGINTVGIIRApRGDGKEAIVLVT-PYNSQKV 178 gpnaInnGAlDngSGtAvLLElArvlk..dlpkrggWKPrRsIrFasWgA +p n LL lA +++ +l++r +W + + I+ +s + BAC01259.1 179 TP---NE-----------LLSLALGFSvfSLLSRAAW-LSKDIVWLSADS 213 EEfGLvGStey..aeelskalkkkavay..iNlDmigsgnytldvlftai +fG ey+ + ++ ++ + +++ +++NlD ++++ +a BAC01259.1 214 -QFG-----EYsaVSSWLNQYHNPMFLShpVNLD--------TKIY-GA- 247 tplldslvqpvskevaapleggrsGrksLYdswikltvpsfvvkvlnaie + l++ + + k a+ + ++++ + BAC01259.1 248 NQILYKPDGTAEK--AELMA---------------------FKRAGT--- 271 rlgdgrSDhtpFlaaGGIPavnfssganeekteergypwyHTakveDTcy ++ ++ G + +++ +r+ T ++ + BAC01259.1 272 ----M--AAALIFKVG-------ETRKYG----DRDSV---T--MYAEAS 299 hsptdkiekilfddpllkrlaavaalagalvleladdpvlpf<-* ++ +++ l + +l+++++ ++ +++ l ++ l+ BAC01259.1 300 NGQMPNLDL-LNVVHYLAVHRQGFRVNVETFNSLLSSSWLRV 340 Adenyl_transf: domain 1 of 1, from 179 to 382: score -191.3, E = 6.6 *->MRtEkEmLDvvLefAekdervRiVaLeGSRTNenipKDeFQDYDIvY t E L + L f+ V SR+ + + DIv BAC01259.1 179 --TPNELLSLALGFS--------VFSLLSRAAWLSK-------DIVW 208 vVeD..iepFIsddsWLkkFGepImmQePEDmelGiGFppdLgkrysYlM D++ ++ sWL++ +p + P ++ +++ ++ BAC01259.1 209 LSADsqFGEYSAVSSWLNQYHNPMFLSHPVNLDT----K---IYGANQIL 251 LFeDGnKiDLTLipkkelqryvqEKELPEdsDgLvkvLiDKDhlikqeiv + DG T k el+ + + Li K + + +++ BAC01259.1 252 YKPDG-----TA-EKAELMAFK-------RAGTMAAALIFKVGET-RKYG 287 PtdsdYwiKkPserEFddccNEFWwVstYVaKGlcRdEIlYAkdHfntIv ds ++s + ++ ++ +V+ H + BAC01259.1 288 DRDSVTMYAEASNGQMPNL------DLLNVV-------------HYLAVH 318 RpEYLLrmiaWhIasergFnlstGKnyKffkkYLpqeeWakleaTysmsd R+ F++ + + + +++s s BAC01259.1 319 RQG---------------FRVNV----------------ETFNSLLSSSW 337 yekiWeSLFlccdLFreyskeVaekleYkYPKeydeenItkYleglyek< ++ i e + L r++ + kl + P +y e+ + + +y+ BAC01259.1 338 LRVIAEVFQNLGSLLRKINPD--WKLDVTVP-DYV-EGTANLASSMYNQ 382 -* BAC01259.1 - - ATP-synt_A: domain 1 of 1, from 444 to 568: score -95.0, E = 77 *->gglfapliitlFlfilisNllGLPidllPysftnllgpeglwkspTs ++ f+++++t + + ++ ++P+++ BAC01259.1 444 HQSFFLYFLTAPSKFISVGVY-----MIPFALL-------------- 471 dlslTlslA.lplwlatliyGiknkg.................lkffskl lA+lp+++a l G k kg+ +++ +++++ ++ + +++ +k BAC01259.1 472 -------LApLPIVAAALAGGSKTKGkledecktkgnaddlqmEGGSWKW 514 vPqgtPlalvpflvpiEliSlfarPlsLglRLfgNilAGhlLltLLasll +++ + +l++i + ++L+sll BAC01259.1 515 ---LKS---ARVLLIIQ------------------------FWAVLVSLL 534 pvlmsissilgilgllgpllivllltiLelfVaaIQAYVFtiLtilYlse p + is i g p++ +++++L++ tiL+ilY + BAC01259.1 535 P--YYISQIPGAM----PIQYAVIWAVLSI----------TILIILYAMF 568 <-* BAC01259.1 - - CDP-OH_P_transf: domain 1 of 1, from 447 to 571: score -41.8, E = 34 *->llLfllaallDalDGylARktgnqvSrfGafLDplaDklsdgaalva ++L++l a + S+f +s+g++++ BAC01259.1 447 FFLYFLTA-------------P---SKF----------ISVGVYMIP 467 layagpdrylllpgwltldlllvilarfilllalrlarfyvg........ +a lll+ l +++la+++ + + +++ ++++ BAC01259.1 468 FA-----------------LLLAPLPIVAAALAGGSKTKGKLedecktkg 500 .............ggkiktgeqlliillllllgigppnsklsglglanwl + ++ ++++ +++k +++ +++ ++++l + +++ ++++ BAC01259.1 501 naddlqmeggswkWLKSARVLLIIQFWAVLVSLLPY--------YISQIP 542 lalglvlaallfilqmgviaailtlisvleyllaaykl<-* a+ + +a++ +a+l++ +++++ ++ + BAC01259.1 543 GAMPIQYAVI---------WAVLSITILIILYAMFGSP 571 MtN3_slv: domain 1 of 1, from 447 to 573: score -42.7, E = 47 *->fvLGlvcv...ifsvavFlsPLsil...................... f+L ++ +++++sv+v+ P + l + + ++++++++ + BAC01259.1 447 FFLYFLTApskFISVGVYMIPFALLlaplpivaaalaggsktkgkle 493 ..........................frkiikkKSveglpflpflaglLs ++ +++++ ++ + ++++ + ++ +++ ii+ + v ++lp+ + + BAC01259.1 494 decktkgnaddlqmeggswkwlksarVLLIIQFWAV-LVSLLPYYISQIP 542 sslWllYGllknDaffiiipNliGcvlgliylilFliYppkkk<-* + + + Y+++ +++ ++++ +il++++ ++ + BAC01259.1 543 GAMPIQYAVI----WAVLSITIL--------IILYAMFGSPSR 573 DUF639: domain 1 of 1, from 81 to 585: score -464.2, E = 98 *->ekkeellilrdrspkatrkkiPkGFLSliANvVVeRCSkiLgitadd +e + ++ ++ +iPk +iA + + a+ BAC01259.1 81 KGIEAA----IGESRGGTTEIPK----FIAQ-------QTKNLGAEV 112 LqdeFevevkpsLsqgsTyaRnFvEyCcFraLSrvseevlerLk...Dke +eF ++ s + cF L ++ ++ ++ + BAC01259.1 113 YYHEF---LPDS-K-------------CFHPLKF-FTSMTNNMAakpNGT 144 FrRLtFDMMLAWeqPdvkseesykeavGkesedkriqatlSeeeDdiSLF + + P + + ea+ + q+ e + L BAC01259.1 145 YTNFGINTVGIIRAPRGDGK----EAIVLVTP-YNSQKVTPNELLSLALG 189 YSdmalLlVdeeksVGeEAFvRIAPaiPliADvItvhNLFeaLTssTGhr +S lL+ + + + ++ l AD + + a s + BAC01259.1 190 FSVFSLLS--RAAWLSKD-------IVWLSADSQFGEY--SAVSSWLNQY 228 GLhfevYDkYlkeLe.KiiKklksqstpsaidlelaksEii.LemdGtva h + ++ L++Ki + + p d+ k E +Gt+a BAC01259.1 229 --HNPMFLSHPVNLDtKIYGANQILYKP---DGTAEKAELMaFKRAGTMA 273 kqPVLkHvgisaWPG.kLTLTdkALYFEavgIvgyedplRyDLseDlKqv + k vg++ G++ T +A Ea + p Dl v BAC01259.1 274 AALIFK-VGETRKYGdRDSVTMYA---EASN---GQMP-----NLDLLNV 311 iKPelTGPLGarLFDKAvsYeSidleEpvVlEFpElkGetRRDyWLaIik + + G+r v +e F l ++ WL +i BAC01259.1 312 VHYLAVHRQGFR-----VNVET----------FNSLLSSS----WLRVIA 342 EvllvHkFlRkfGPGEGDKSLYnveGPlKKqreEilarAiLGIiRlqAlr Ev f n + l BAC01259.1 343 EV---------F---------QNLGS-L---------------------- 351 EmlrvsdddyKnLLiFsLleqvPgGDlVLEt..LAVeisGGPLLtkrstt ++++++ d K L vP D+V t +LA ++ BAC01259.1 352 -LRKINP-DWK------LDVTVP--DYVEGTanLASSMY-------NQAL 384 RsdiarasssiLkWPsnessealdlvsdlDGsVyLKRWmRSPSWGstkee ++ +s + + + + ++ + +k e BAC01259.1 385 GVPT--GSHGAFR-----DYQVDAVSLEF-APAF-----------HLKNE 415 sedkesslvVGlvlkkelvVgdlapLErAvkqSRkkykvvEkAQATidev + +l+ G l + + r v+ +k BAC01259.1 416 NAKSSFLLRGG-----RL----TEGVVRSVNNLLEKFHQ----------- 445 kveGIdtNvAVmKELlLPliktateiekLayWedPyksvvFllLasyiIy L Fl s I BAC01259.1 446 -------------SFFL----------------------YFLTAPSKFIS 460 RgWlgyvlavsLlf..vAivMllLKGLrRqfgrkgKlekavkVkapPskN g +y ++ +Ll+ ++ iv + L G + + gKle++ k k + BAC01259.1 461 VG--VYMIPFALLLapLPIVAAALAGGSKTK---GKLEDECKTKGNADDL 505 tvE..qllavQdaiseLEsliQkVNVvLLKlRalvlSllPQatsevAiam +E+++ ++ a L + + a + SllP s++ am BAC01259.1 506 QMEggSWKWLKSARVLL----------IIQFWAVLVSLLPYYISQIPGAM 545 .lVvAtvlAvVPfKYllavvfVelFTReseVfRkaSvdrlnRRlREWWfs ++ A++ Av l++ l + + + +a v EW BAC01259.1 546 pIQYAVIWAVLSITILII-----LYAMFGS-PSRAGV--------EW--- 578 vPAAPVivlrakn<-* +l+a BAC01259.1 579 ------KLLKATM 585 DedA: domain 1 of 1, from 436 to 588: score -42.3, E = 26 *->favlsvgfieagliigpiiPgdillilfGvlagagilkfpifllall +l+ f +++++ ++ ++ +i++Gv + +l ++ +l ++ BAC01259.1 436 VNNLLEKFHQSFFLY--FLTAPSKFISVGVYMIPFALLLA-PLPIVA 479 apliggliGdavsywigrrfgnkllvrqskathqekwlak.akksfeahg a+l+g g + ++k l + k + ++ l+++ ++ + + BAC01259.1 480 AALAG-----------GSKTKGK-LEDECKTKGNADDLQMeGGSWKWLKS 517 lpviiiarFlpglrtlvPtvA....Gvvalpyapfvalfnfigallwvll +v++i F + l +l+P+ +++ +G++ ya++ a ++++++ ++l BAC01259.1 518 ARVLLIIQFWAVLVSLLPYYIsqipGAMPIQYAVIWA---VLSITILIIL 564 ntllGayaggavpafllhlsiqiig<-* + +G+ + v +ll ++ +i++ BAC01259.1 565 YAMFGSPSRAGVEWKLL-KATMITS 588 DUF125: domain 1 of 1, from 455 to 595: score -104.7, E = 19 *->meRYfvrGfiDGvLsvLGVvlGASGsaGAGGLevdarlIIaAGlgGG + ++ G + + L + + +++aA l+GG BAC01259.1 455 PSKFISVGVY---------MIPFAL------LLAPLPIVAAA-LAGG 485 vAlGiSNvfGAFtAE.RaEeerekrekEKslLveeGkLrdsiIykkaRr. S G ++ E + + E + L+ ++ + BAC01259.1 486 -----SKTKGKLEDEcKTKGNADDLQMEGGSWKW---LKSARVLLIIQFw 527 rvymSglIdGiStiiGSvlPVlPFviAFfvifdletAlivaValtlasLf +v +S l iS i G+ P i ++v i+aV l + +L BAC01259.1 528 AVLVSLLPYYISQIPGAM----P--IQYAV--------IWAV-LSITILI 562 iLGvvlGkiSkENvliSslkmVvaGvvvAvlslllekgs<-* iL + G S+ v lk ++ + s+ ++ g BAC01259.1 563 ILYAMFGSPSRAGVEWKLLK------ATMITSITIGMGL 595 UbiA: domain 1 of 1, from 364 to 599: score -113.8, E = 88 *->tllvlapvlaglalaaggvntgeadllllllallgfflaraagnviN t++ ++ + a la+++++ l + + g BAC01259.1 364 TVPDYVEGTANLASSMYN------QALGVPTGSHG------------ 392 DyfDrdiDaineRrisTpnRPlpsGrisprealtfalllla.lglalall + D+ +Da+ + p+ +++ a+ +ll + +l ++ + BAC01259.1 393 AFRDYQVDAVSLE-------FAPAFHLKNENAKSSFLLRGGrLTEGVVRS 435 lnnla..........fllallglalavlYsypYtrllKRytplgtvvlga nnl+++ +++ fl+a+ ++ + +Y +p + +++ ++ BAC01259.1 436 VNNLLekfhqsfflyFLTAPSKFISVGVYMIP-----FALLLAPLPIV-- 478 agalpplmGwvAvtgqltpslsvvailLflalflwtaphdiakaledved +++l+G+ + g +l ++ t + ++ + +e + BAC01259.1 479 ---AAALAGGSKTKG----KLEDECK---------TKGNADDLQME--G- 509 DrkaGlkslpvvlGeekakkiallllavavalllllgll...gglgllgi ++++ + a++++ + +++ +++ll +++ + +g++++ + BAC01259.1 510 --------GSWKWL-KSARVLLIIQFWAVLVSLLPYYISqipGAMPIQYA 550 lylivvlllgalllvlaiklvrremdpekiekarlllTfvaslilllvlf + ++v+ + ++++l +a+++ + + + +k a+ i+++ ++ BAC01259.1 551 VIWAVLSITILIIL-YAMFGSPS--RAGVEWKL-----LKATMITSITIG 592 aalllaf<-* ++l + BAC01259.1 593 MGLMSII 599 TerC: domain 1 of 1, from 429 to 600: score -117.6, E = 72 *->dpeaaiaiAkmwlallTlillEkvLgiDNaiviailvgfFhLPeeqR + ++ +++ + L e+ + BAC01259.1 429 TEGV-------VRSV----------------------N--NLLEKFH 444 kralflGlagAlvlRilllfsgawLlslfqpllygvvmfAaGqaisgldL + ++ +l++ ++ ++ ++ ++l+L BAC01259.1 445 QSFFLY----------FLTAPSKFIS--VGVYMIP----------FALLL 472 l.llggklflikkkeseelennlilatkeiheaiskdlegdrffekGtle +l ++ +l+++ + k + e +k +d ++e+ BAC01259.1 473 ApLPIVAAALAGGSKT---------KGKLEDECKTKGNADDLQMEG---G 510 ngkkarlftplfiaiiqIvlaDlvFSlD..SViAifGitqdvffvvitav + k+ + + +++ I + + Sl + ++ i G+ ++ vi+av BAC01259.1 511 SWKW---LK-SARVLLIIQFWAVLVSLLpyYISQIPGAMPIQY-AVIWAV 555 ilsvlvmrfaafliadllekfpyLkylaaviLgfiGvklilegladfidd ls+ +++ ++ ++++ ++ ++ BAC01259.1 556 -LSITILIILY--------------------------AMFGSPSR-AGVE 577 vhvpkyVvVgypysvslaiaFavlvealni<-* + +k + + + i+++ +++++ BAC01259.1 578 WKLLK----ATMI---TSITIGMGLMSIIN 600 MgtE: domain 1 of 1, from 465 to 602: score -60.8, E = 30 *->fiPlLiGlgGNvGtqlssrlsrgLalGevss................ +iP + l+ + + +++ + G+++++ +++++ ++ + +++ BAC01259.1 465 MIPFALLLA-PLPIVAAALAGGSKTKGKLEDecktkgnaddlqmegg 510 kkrvlrvlakelltsillgvvlslallfvagilgggsgvideegeallfl ++l+ l +++++ +v++l +i +++ +a+ + BAC01259.1 511 SWKWLKSARVLLIIQFWAVLVSLLP----YYISQIP--------GAMPIQ 548 ivai..slliislvalllgvlipilfkklglDPdnvsgPlITTlaDittl + +i +l i++l++l+++ +p + g+ + + +IT+++ +++l BAC01259.1 549 YAVIwaVLSITILIILYAMFGSP---SRAGVEWKLLKATMITSITIGMGL 595 llIyllia<-* + ++a BAC01259.1 596 M-SIINFA 602 YCF9: domain 1 of 1, from 546 to 604: score -31.4, E = 71 *->afQlaVlALIllSfvLVvgVPVvlASPg....gWersKsliysGagl +Q aV +l ++L++ + + SP + + +W K + + + + BAC01259.1 546 PIQYAVIWAVLSITILII-LYAMFGSPSragvEWKLLKATMITSITI 591 WtgLViVvgiLnslvv<-* gL +i n+ + BAC01259.1 592 GMGL---MSIINFATA 604 DUF624: domain 1 of 1, from 547 to 606: score -36.2, E = 53 *->laylNLLWllftLlGLiVfGlmPATaALfavlRkwlqgekDvpifkt ++ ++W +++ ++Li++ A+f+ + + + k BAC01259.1 547 -IQYAVIWAVLSITILIIL------YAMFGSPSRAGVEWK------- 579 FwqtYKqeFvkaNllGlfflligliLllnL<-* + K + ++G ++ i+ + ++ L BAC01259.1 580 ---LLKATMITSITIGMGLMSIINFATAQL 606 ABC-3: domain 1 of 1, from 435 to 612: score -214.4, E = 70 *->qyefmqrAllasilvglacgi.....LGsFlVLRRqSLmGDAiSHav + ++++ + s ++ +++++ ++G++ BAC01259.1 435 SVNNLLEKFHQSFFLYFLTAPskfisVGVYM---------------- 465 LpGVALAffLginkSleipliGAflfgliaAvaigylkrnsrlkeDtaiG + +A++L+ p+++A+l+g ++ + g l+ + ++k+ + BAC01259.1 466 ---IPFALLLAPL-----PIVAAALAG--GSKTKGKLEDECKTKGNA--- 502 IvfssflAlGlllislikgsnaaskvdLdhyLFGniLgisqqDliqiaii + + + + ++ L + ++ BAC01259.1 503 --------------DDLQ-----MEGGSWKWL------------KSARVL 521 taiiLlllllfwkeLllitFDpdlAkviGlpvnflkllLliLlaltiVva +i + ++l+ Ll + ++ +p++ + ++l++ti BAC01259.1 522 LIIQFWAVLV---SLLPYYISQIPGA---MPIQ--YAVIWAVLSITI--- 560 lqaVGvILViAlLitPAatArlltkslesmlliAsaiGvvssvaGlllSY I+ A++ P+ + + k l+ +++++ iG+ + BAC01259.1 561 -----LIILYAMFGSPSRAGVEW-KLLKATMITSITIGMGLM-------- 596 yfdtatGpvIVLiatllFlisflfa<-* +I+ at + l +++ + BAC01259.1 597 --------SIINFAT-AQLGALILI 612 Ribonuclease_BN: domain 1 of 1, from 361 to 619: score -128.6, E = 15 *->neddisllA...AaLaYytLLSLFPlLlvllallaglfpiafaavle + ++++ +++A La +++ +++ + + BAC01259.1 361 LDVTVPDYVegtANLA---------------SSMYNQALGVPTGSHG 392 qlkdfipPsslaplisdvvenllnqpnggllsggRtevtavGllvalwtA +++d+ ++++l + l n+ ++ + BAC01259.1 393 AFRDYQV--DAVSLEFAPAFHLKNENAKSSFL---------------LRG 425 snginalqkaLNkaydvhhVeesrpsfiglrdlrsilfsfglylalltll ++ + + +++N + ++ f++++ + s++ s+g+y++ +ll BAC01259.1 426 GRLTEGVVRSVNNLLE---KFHQS-FFLYFLTAPSKFISVGVYMIPFALL 471 llslfaslaisaliaklvglsg..............eflttvltllrwpv l +l+ +a++a k g +++ +++++ ++ + e++++ + + ++ BAC01259.1 472 LAPLPIVAAALAGGSKTKGKLEdecktkgnaddlqmEGGSWKWLKSARVL 521 svlllfllfalL..YrvlPnv.klkwrdvlpGAlfaavlwelgsylfgl. +++ ++++ L +Y++ + ++ +++++A+++ +++++ +fg++ BAC01259.1 522 LIIQFWAVLVSLlpYYISQIPgAMPIQYAVIWAVLSITILIILYAMFGSp 571 ...YvsyfgnysstYGslGaviiilLlWiylsaliillGAelnavlseyk ++ v + +++ s+ ++l+ i++ a + l +l ++ + BAC01259.1 572 sraGVEWKLLKATMITSITI-GMGLMSIINF-ATAQLGALILIPMCLFSR 619 <-* BAC01259.1 - - COX2_TM: domain 1 of 1, from 562 to 621: score -41.4, E = 18 *->matwwglnfQDsASPlmEqlieFHDhtlmiLilItilVsyilvsllf +++ SP ++ + l+ +++It++ + + ++ ++ BAC01259.1 562 IILYAMF-----GSPSRAGVEW----KLLKATMITSITIGMGLMSII 599 nFnrrkskltnrylleGqtIEiIWTilPaiILilIAlPSLrL<-* n + t+ l a+ILi ++l+S L BAC01259.1 600 N------FATAQ--------------LGALILIPMCLFSRPL 621 DUF554: domain 1 of 1, from 467 to 621: score -174.5, E = 91 *->liGtliNalaVvlGsliGll.fkkrlPerikdtLmqvlGLAvlgiGI ++ l+ l +v+++l G+ + k++l ++ k+ BAC01259.1 467 PFALLLAPLPIVAAALAGGSkTKGKLEDECKTK-------------- 499 smatvsllqsknfllvilsLViGgviGEilnLEkrlnklGdkleKPivkr + ++ l+ E++ k +++ + BAC01259.1 500 -------GNADD-----------------LQMEGGSWKWLKSAR----VL 521 frGskkglanesFaegF.VTatLL.FcvGpmgIlGalneGLTGDhsvLlt + +++ V +LL++ ++ I Ga+ + BAC01259.1 522 L-----------IIQFWaVLVSLLpYYISQ--IPGAMPIQ---------- 548 KslLDGftAiiLAstLGiGVafsAIpvllyQGliaLfAkqieslvp.ilt A+i A V+ ++I ++ly a f + ++v +l+ BAC01259.1 549 -------YAVIWA------VLSITILIILY----AMFGSPSRAGVEwKLL 581 tdsfIaeftatGGlLIlaiGlnlLLGGLGmkikkfkVgNlLPALllvpvl +I+ +t iG l+ +i++f ++ L AL+l+p BAC01259.1 582 KATMITSIT---------IGMGLM------SIINFATAQ-LGALILIPM- 614 vwlvykl<-* + + l BAC01259.1 615 CLFSRPL 621 Cyto_ox_2: domain 1 of 1, from 331 to 622: score -252.8, E = 23 *->elLplvWfvligvllfgYvvlDGFDlGVGmllpflakd......... lL W +i+ + +G ll+ + d + + + ++ BAC01259.1 331 SLLSSSWLRVIAEVFQN----------LGSLLRKINPDwkldvtvpd 367 ..eeERRillNsIGPVWDGNEVWLvlaGGALFAAFPlaYAtllSglYlPl e G+A +A S +Y + BAC01259.1 368 yvE------------------------GTANLA----------SSMYNQA 383 ilvLvg.......LiFRGVaFEyR..gKiedakWkkvWDwaffiGSlvpa ++v g+++ +++ + +V++E+ + + ++++ k +S+ BAC01259.1 384 LGVPTGshgafrdYQVDAVSLEFApaFHLKNENAK---------SSF--- 421 lllGvafGnllqGlPFlvdadlr..............tsYaGsswdlLnR ll G G+l +G+ v++ l + +++ + + + s+ + BAC01259.1 422 LLRG---GRLTEGVVRSVNNLLEkfhqsfflyfltapSKFI-SVGVYMI- 466 PfaLLcGlvlvslyalhGatflalKTeGeLqerarkl..........Ary PfaLL + + +++ al G++ KT G+L ++ ++ ++ ++ + +++ + BAC01259.1 467 PFALLLAPLPIVAAALAGGS----KTKGKLEDECKTKgnaddlqmegGSW 512 lafvtLvavllvglwllyGiDGyvlvs.iDtpatsaplakrVaveigaWw ++ ++ l++ +w vlvs + ++ ++p+a BAC01259.1 513 KWLKSARVLLIIQFW-------AVLVSlLPYYISQIPGAMP--------- 546 fnfprmpillalpvLgvvafllllvalrrgrygwaFiltlllialailga + +++a+ ++++++l + + + + + +l +i+++++g BAC01259.1 547 ---IQYAVIWAVLSITILIILYAMFGSPSRAGVEWKLLKATMITSITIGM 593 gislfPnvmPSsldpaysLTiwnAassplTLkiMLvialiflPivLgYti g + i n a ++l +ali++P+ L BAC01259.1 594 GL----------------MSIINFATAQL-------GALILIPMCL---- 616 wsYwVFRGKis<-* F+ ++ BAC01259.1 617 -----FSRPLR 622 CTP_transf_1: domain 1 of 1, from 455 to 625: score -102.5, E = 66 *->lkkRiitaivliliflillllgglslyfartalflllialivilalw k i +++ +i+++l+l l+ ++a++ + + + BAC01259.1 455 PSKFISVGVYMIPFALLLAPLP--------------IVAAALAGGSK 487 Elirllrlkfrsydlllplflallwalllllllllylfflfallevvlll +l ++ ++ + + ++ + + BAC01259.1 488 TKGKLEDECKTK---------------------GNADDLQMEGGS----- 511 ailllllvvwfqyhrwilfillvigfpffvlilerrlkrlqfglgaltyf +k+l+ + +l++ BAC01259.1 512 ------------------------------------WKWLKSARVLLII- 524 piilllvsfllfpllflqslfli.nrd......dGliwillliivvwaaD ++ ++++ll ++i+ +++ + ++ iw++l+i ++++ BAC01259.1 525 -QFWAVLVSLLP--------YYIsQIPgampiqYAVIWAVLSITILIIL- 564 igAYfvGklFGKtKrlaPkiSPnKTwEGfiGGlvgavlvgllfslllglp Y +FG + + +G+ + l+ a+ ++++ BAC01259.1 565 ---YA---MFGSP-----------SRAGVEWKLLKATMITSITIGMG--- 594 lsficpsqdltnfsldCpnlFpqYlPvfkflkIlPifwhllllglllsli l +++ + BAC01259.1 595 -----------------------------------------LMSIINFAT 603 svfGDLvESafKRdfgiKDSGklIPGHGGiLDRfDsllfaapvfylflli +++G L +l++++++ + BAC01259.1 604 AQLGAL-----------------------------ILIPMCLFSRPLRAQ 624 f<-* + BAC01259.1 625 L 625 DUF409: domain 1 of 1, from 324 to 633: score -283.5, E = 27 *->srkkkiNSYLAiRmtRSGltkyavlsRllvLLltiLynslalpests +++ NS L + v+ + + L +L + p+ ++ BAC01259.1 324 VNVETFNSLLS-------SSWLRVIAEVFQNLGSLLRK--INPDWKL 361 evfnppcslykedsssslvdslierlLgnkllhWDav..YFlkNiiAenG +v p + l+ s+ + Lg + a ++Y ++ A+ + BAC01259.1 362 DVTVPDYV----EGTANLASSMYNQALGVPTGSHGAFrdYQVD---AVSL 404 KylYEqeyAFspLwPffvrllakelleplvvlLslrscllvvlfaLsGil e+A P f +l +++ lL+ + + v+ +++ BAC01259.1 405 ------EFA-----PAF--HLK-NENAKSSFLLRGGRLTEGVVRSVN--- 437 lFilAavaLfqltkvilkdrkasfyAsLlFCfsPAaiFlssiYSEsLFAl +l ++sf l++++ + F s++ BAC01259.1 438 --------------NLLEKFHQSF---FLYFLTAPSKFISVG-------- 462 fsfvgmlelekgrSVPVLGqFsisavyLFalatlSarSngllslgfitls vy ++a l + + ++ + + + s BAC01259.1 463 -------------------------VYMIPFALL-LAPLPIVAAALAGGS 486 llgifFifsLlelnkerklvkqlvalfLscllillpflyfQYYgPYklFC + + + + ++k q+ ++ + l +l Q+ + BAC01259.1 487 KTKGKLE-D-ECKTKGNADDLQMEGGSWKWLKSARVLLIIQFWA------ 528 lgrtrrniPehlVdlAVdkkYllaeqGdEvvpWCkkslPlSiFiTKTSlY BAC01259.1 - -------------------------------------------------- - sYiQshYWnDRGVGFLKYytlkqiPNFLLAlPviilllwslfyYmkshpe ++++ +L Yy qiP A P+ ++w+ + BAC01259.1 529 -----VLVS-----LLPYY-ISQIPG---AMPIQYAVIWAVLSITI---- 560 lavslgLtlssfnkklekRLYslKDAvEPsvKtstnEGnHDiRqRKPssK + l + f + +++ BAC01259.1 561 -LIIL---YAMFGSPSRA-------------------------------- 574 kDltGtKvAPEKsGyLSadvFpfVvhaalLvligcffmHVQVltRflSSa + ++ a +++ + m + BAC01259.1 575 -GVE------------------WKLLKATMITSITIGM---------G-- 594 lPllYWFaAdllvktdqEPLLRsaKtkkwkplgdDsPPGqKvkRnPivgl + + + t +++l + BAC01259.1 595 -------LMSIIN----------FATAQLGAL-----------------I 610 LfvWltCaPvtRYiLGYFLtYillgtlLFsnFLPpt<-* L+ + + +l + L nFLP t BAC01259.1 611 LIPMCL-------------FSRPLRAQLEMNFLPRT 633 HCO3_cotransp: domain 1 of 1, from 159 to 641: score -588.2, E = 94 *->dveeggerwgkPHVatLslrSLleLrrciakGtvlLDLqatsLpeIa ++g+ ++ + V +++ S ++ + L + a BAC01259.1 159 ---PRGDGKE-AIVLVTPYNS--------------QKVTPNELLSLA 187 nFLETkvvdPdliksgqEkikeqdRenllraLLlkhsHqnekkstnslda +v++ + ++ ++ + +++ +++ s++ BAC01259.1 188 L--GFSVFS----LLSR---AAWLSKDIVWLSADSQFGEYSAVSSWLNQY 228 snpqtiptvrsqssiGkllsahhpnnlgsleehPe...........rglP np ++++ + + + k +a + + +P+++ ++ + ++ BAC01259.1 229 HNPMFLSHPVNLDT--KIYGANQ------ILYKPDgtaekaelmafKRAG 270 tshsgdlemssflevkddGvvselstvdksKvdlkflkKiPedaEAsiVL t++ ++ +++e+ + + +v+ +aEAs+ BAC01259.1 271 TMA--AALIFKVGETRK---YGDRDSVTM-------------YAEASNGQ 302 vGevdfLeqCpivAFVRLseAvnLggllevPvPvRFlFlLLGPsgekgTk +d L++ + +A R + vn + BAC01259.1 303 MPNLDLLNVVHYLAVHRQGFRVNVE------------------------- 327 dYhEiGRaiATLMsDevFhdvaYkAkdrddLlsgIdeFLdgviVlPPgev ++ +L+s ++ +a ++ +Ll I+ + + P v BAC01259.1 328 -------TFNSLLSSSWLRVIAEVFQNLGSLLRKINPDWKLDVTVP-DYV 369 dpeNLVRSilieplknlqsqelrkRiehpsdvrpgsgatkkaapheel.l +++ + +++ q+l+ p +++ + + +a+ e BAC01259.1 370 EGT-------ANLASSMYNQALGV----PTGSHGAFRDYQVDAVSLEFaP 408 eidgggaegdddpLqrTgrlFGGLvkDikRRaphYlSDyrDAlsGHKtvp + + + + L+r grl G+v+ + + + ++ s BAC01259.1 409 AFHLKNENAKSSFLLRGGRLTEGVVRSVNNLLEKFH------QS------ 446 QcLAaViFiYFAaLsPaITFGGLLGekTeglmgVsElllstAvqGilFsL +F+YF + + ++s v i F+L BAC01259.1 447 ------FFLYF--------------------LTAPSKFISVGVYMIPFAL 470 fggQPLLilGfTGPLLVFEkalFkFCkdndldYLvgRvwvGlWlvflvil + l+ l i+ BAC01259.1 471 L------------------------------------------LAPLPIV 478 lVAtegSfLVkfiTRfTEEiFsfLISLIFIyEafeKLvkifeehplicny ++A+ g + k + + c + BAC01259.1 479 AAALAGGS------------------------------KTKGKLEDECKT 498 nydsleadscactepkgktsvnpdnatLvikstdlcaivdgidwlelngs + +ad + BAC01259.1 499 K---GNADDLQ--------------------------------------- 506 eckkmsGvftGslcRHHGpYvPntaLLsliLmfGTfflamfLrkFknSrY m G + + k+ r BAC01259.1 507 -------------------------------MEGGSW-----KWLKSARV 520 fPgkvRrlisDFGavpisIliMvlvDfligdDvytqKLnVPsgfkvtrpt + li F ++l+ +l+ ++i ++P + BAC01259.1 521 L------LIIQF----WAVLV-SLLPYYIS--------QIPG-------A 544 aRGWfIpPlGensPFPaWmmlaAliPALLvfILIFmdqQITtlIVnrker P+ ++ +W++ L+ +ILI BAC01259.1 545 ------MPIQYA---VIWAV-------LSITILI---------------- 562 KLKKGsGfHLDLLvVgvlggvcSlfGLPW..lvAATVrSitHinALtves ++ ++ g S G+ W+ l A+ Sit BAC01259.1 563 -------------ILYAMFGSPSRAGVEWklLKATMITSIT--------- 590 easaPGyeqpkIveVREQRVtGllvalLvGlSifmgpPyILkfIPmaVsD + G L+ + f+++ L++ +++ BAC01259.1 591 --IGMG---------------------LMSIINFATA--QLGALILIP-- 613 LDLfGvFLYMGVtSLsGiQlfdRllLLfmPpKyyPdtiYirrVptrkvHL + L+ P +++ + ++ r ++ BAC01259.1 614 -----------------------MCLFSRPLRAQLEMNFLPRTVLLA--- 637 FTai<-* + ++ BAC01259.1 638 SNIL 641 DUF389: domain 1 of 1, from 461 to 646: score -74.2, E = 32 *->cGlvedsavllIGAMlIaPLlGPimgaavG..............lVv +G+++ ++l l+aPL Pi +aa++++++++++ +++ ++ BAC01259.1 461 VGVYMIPFAL-----LLAPL--PIVAAALAggsktkgkledeckTKG 500 gdrkL........alrgaknlLvglalaivvsaifslfvd...kapisHl ++L+ ++++ ++l++a++lL+ + a++vs ++ +++ +++ pi ++ BAC01259.1 501 NADDLqmeggswkWLKSARVLLIIQFWAVLVSLLPYYISQipgAMPIQYA 550 lelthleilsrtspRERLRPdflsld.................livCAll + +ls+t + +l++ +++++ + + + + ++i+ + BAC01259.1 551 VIWA---VLSITIL------IILYAMfgspsragvewkllkatMIT-SIT 590 aGiAGslslC.qasgvsgsLVGVAIavaLlPPAavvGlll.Aiadlelav G G +s+ + a + g+L+ l+P +++ l+A++ ++ BAC01259.1 591 IGM-GLMSIInFATAQLGALI--------LIPMCLFSRPLrAQLEMNFLP 631 gslvLfliNlaalvva<-* + vL+++N+++ v++ BAC01259.1 632 RT-VLLASNILLTVLG 646 SirB: domain 1 of 1, from 543 to 650: score -65.9, E = 63 *->myliLkylHlififlSvlLLvIRfvLqlknk.nwreakflKILPHln ++ y i + lS+ L+I +++ +++ e k+lK+ BAC01259.1 543 GAMPIQYA-VIWAVLSITILIILYAMFGSPSrAGVEWKLLKA----- 583 DTLL..LlSGigLmlithfsPFsaaapWLteKfllvllYIvLGfialsar T+ +++ G gLm i f t+ + ++l + +s r BAC01259.1 584 -TMItsITIGMGLMSIINFA---------TAQLGALIL---IPMCLFS-R 619 RrkSqtkfsqafllalfwlAcivfLAttkvayL<-* ++ fl + lA + L + ++ BAC01259.1 620 PLR--AQLEMNFLPRTVLLASNILLTVLGFPPA 650 FecCD: domain 1 of 1, from 408 to 656: score -231.2, E = 31 *->GalsispadvlqalfgggtegeievdeliiwdltlrRLPRvLlAlLV a ++ ++++ g+ e +++++ BAC01259.1 408 PAFHLKNENAKSSFLLRGGRL----TEGVVRSV-------------- 436 GAaLAVaGAilQgltRNPLAsPgilGinsGAslgvvlaivlfpgg...ls N L + + + ++++++ + ++ +s BAC01259.1 437 ----------------NNLLE----------KFHQSFFLYFLTAPskfIS 460 isalyllpsfAfaGaliaallVyllawkgrng.................. + + y++p+ A++ a + ++ l++ ++ +g+ +++ +++++ ++ + + BAC01259.1 461 V-GVYMIPF-ALLLAPLPIVAAALAGGSKTKGkledecktkgnaddlqme 508 ......lspvrLiLaGialsalfsAlttlllllsddlqdqqalfWltGSl +++ + l +r++L+ i + a++ +l +++ +++ + BAC01259.1 509 ggswkwLKSARVLLI-IQFWAVLVSLLPYYISQIPGAMP----------- 546 sgrnWedvklalpilliglplalllarqLnvLsLGddtAkgLGvnvervR ++ + ++++l i+++++++a++ G++ BAC01259.1 547 --IQYAVIWAVLSITILIILYAMF------------------GSPSRAGV 576 llllllvvlLtGaaVAvAGpIgFVGLivPHiaRrLvGt.dhrwLLPaSAL + ll ++++t+ ++ + G ++ i+ + + L+P++ + BAC01259.1 577 EWKLLKATMITSITIGM-GLMS--------IINFATAQlGALILIPMCLF 617 lGAi.LLllADllARtlfaPiElPvGivTAliGaPyFlYLLrr<-* ++++ l +l Rt+++ + + ++T ++G P +L+++ BAC01259.1 618 SRPLrAQLEMNFLPRTVLLASNI---LLT-VLGFPPAAFLIMK 656 DoxD: domain 1 of 1, from 553 to 657: score -42.5, E = 68 *->iglLllRlllGlvFlaaGlqKlfgwfggaGldgtsgyfeyalGlppP ++ L + +l+ l +fg++ s++ + ++ l BAC01259.1 553 WAVLSITILI----------ILYAMFGSP-----SRAGV-EWKL--- 580 tllawlagllElvgGlLlllGllTPRlaAlvlavfmlvAiflvHwpaWsp l a+ + + ++ Gl+ ++ ++T l+Al+l+ ++l+ +p BAC01259.1 581 -LKATMITSITIGMGLMSIINFATAQLGALILIPMCLF-----SRPL--- 621 tdsgfflgangyelallll.agflaLalaGaGrllSLDr<-* + l +n+ + +ll++ ++l+++ + ++l + BAC01259.1 622 ---RAQLEMNFLPRTVLLAsNILLTVLGFPPAAFLIMKG 657 UPF0118: domain 1 of 1, from 271 to 662: score -158.5, E = 60 *->lllilifllli..........lafipfinvietllvplliAlvlayl ++ +++ ++++++ +++++++ + ++ +p l l BAC01259.1 271 TMAAALIFKVGetrkygdrdsVTMYAEAS---NGQMPNLD------L 308 lnPv.vrfLqkkrgikrslaillvlllflvalvllgvllipllinqltqL ln v++ ++ g++ + ++ ll+ ++ v+ v+ l++L BAC01259.1 309 LNVVhYLAV-HRQGFRVNVETFNSLLSSSWLRVIAEVFQN------LGSL 351 iksl.Ptg.......................................... ++ ++P+++ + + ++ +++ + ++ ++ + ++++++ ++ + + BAC01259.1 352 LRKInPDWkldvtvpdyvegtanlassmynqalgvptgshgafrdyqvda 401 .......dyidslqnwlnelpeslpelealdasvviqqlnsslsdilsni + + +++ + +n + + + +l + + +s+ + l+++ BAC01259.1 402 vslefapAFHLKNENAKSSFLLRGGRLT--------EGVVRSVNNLLEKF 443 lssi...lnsllsllasltglllqlilvLv...llfffLldgeklrqgii +s + ++ s ++s++ +++ + l+L + +++ L++g k + ++ BAC01259.1 444 HQSFflyFLTAPSKFISVGVYMIPFALLLAplpIVAAALAGGSKTKGKLE 493 sllPkryrervrailrelndtlggylrgqvivaliigvlvfigll...il +++ +++++ e + +++l++ ++ +i++ +v+++ll+ i BAC01259.1 494 D--ECKTKGNADDLQMEGGS--WKWLKSARVLLIIQFWAVLVSLLpyyIS 539 gv......pyAlllAllvglanlIPyi...GpviiliPiaiialltggGi ++++ + +yA ++A+l+ +I y+ + p + + + +l + BAC01259.1 540 QIpgampiQYAVIWAVLSITILIILYAmfgSPSRAGVEW--KLLKATM-- 585 iwaalivlivvllvqqiedniLrPklmgkr...........lglhPlvil i+ + i + ++ ++++ + + +l+ + +++ + + +++ P ++l BAC01259.1 586 ITSITIGMGLMSIINFATAQLGALILIPMClfsrplraqleMNFLPRTVL 635 lsliaGgslfGlvGlilapPltavlkaildayr<-* l+ + + ++G+ pP++ ++ l BAC01259.1 636 LASNILLTVLGF------PPAAFLIMKGLSKGS 662 PhaG_MnhG_YufB: domain 1 of 1, from 585 to 674: score -66.0, E = 59 *->Mtelilaaaliglilv.lIGalfsllGalGllRFPDvYtRLHAATKa M++ i ig +l+++I ++ lGal l+ BAC01259.1 585 MITSIT----IGMGLMsIINFATAQLGALILI--------------- 612 tTlGvisilLGvfLiflarlvelrsPYlavltklilailFilLTn...Pv +++l+ L ++ +e +l +++l++ ilLT + P BAC01259.1 613 -----PMCLFSRPLRAQ---LEM-----NFLPRTVLLASNILLTVlgfPP 649 gaHllarAAYlsGvppweksvvDkyke<-* +a l++++ ls+ + v D + + BAC01259.1 650 AAFLIMKG--LSKGSWTVDIVGDFWLW 674 UPF0032: domain 1 of 1, from 522 to 676: score -110.4, E = 48 *->kRllyiliafllafiacfyfvkeiyelLqrPllglppgstfiatspt l++ ++a+l+ ++ ++ + p g+ i+ +++ BAC01259.1 522 --LIIQFWAVLVSLLPYYISQ-------------IP-GAMPIQYAVI 552 EafftyiklsfivgivlssPvilYQiWlFiaPGLYekERkvilpllvlSs ++++ +++ + ++++ F P++ E k++++ +++S+ BAC01259.1 553 ---WAVLSITILIILYAM----------FGSPSRAGVEWKLLKATMITSI 589 aivLFliGlfFaYyvlfPialgFl.lsfgatstnllvvepllsideYldF iG+ + + +++F ++++ga l+ p+ ++++l+ BAC01259.1 590 -----TIGMGL----MS--IINFAtAQLGA-----LILIPMCLFSRPLRA 623 ilrLlfsfGvaFeiPviiilLlklgivtyetLkkaRryiivlffvigall l + f + +lL +++++t++ + a +f++ l BAC01259.1 624 QLEMNFLP--------RTVLLASNILLTVLGFPPA------AFLIMKGLS 659 TPsFFpDvlsQillaip<-* + s+ +D++ ++ l BAC01259.1 660 KGSWTVDIVGDFWLWME 676 Competence: domain 1 of 1, from 428 to 680: score -113.0, E = 19 *->ALvlGdrsglskelweafqrlGlaHLlAISGlHvglvagllffllrr L+ G + +++ l+ +H + l+fl ++ BAC01259.1 428 -LTEGVVRSVNNLLEK---------------FHQ---SFFLYFLTAP 455 lglrfrprparlpalplkwakllglafllfYlalLaGfspSvlRAllmla + + +g+ + +f +lLa+ ++++ A+l+++ BAC01259.1 456 SKF-----------------ISVGVYMIPFA-LLLAP--LPIVAAALAGG 485 lkvlla.......................llfrrrlsglqvLalslllil k ++ +++ +++++ ++ + ++++ + l++ r l +q +a ++l+ BAC01259.1 486 SKTKGKledecktkgnaddlqmeggswkwLKSARVLLIIQFWAVLVSLL- 534 lldPlallslGFwLSflAvgaLvaWa.svflllLlwkfaqrlpgrqlvvl P+ + ++ + + v+Wa + +l+++ + p+r+ v++ BAC01259.1 535 ---PYYISQIPGAMPIQYA---VIWAvLSITILIILYAMFGSPSRAGVEW 578 kwllsllaasllaqlltaplllllFgqlsligvlaNllavPlislllvPl k l++ +++s + +++ ++ + +q l l+++P + l+ Pl BAC01259.1 579 KLLKATMITSITIGMGLMSIINFATAQ------LGALILIPMC-LFSRPL 621 llllllllllppglglpllslwllllwilkllllllavpnaawvv.vaap + l +++lp ++ll++ +ll++l ++p+a+++ ++ ++ BAC01259.1 622 RAQL-EMNFLP----------RTVLLASNILLTVLGFPPAAFLIMkGLSK 660 llllllallslllllllrll<-* +++ ++ ++l+ +++ BAC01259.1 661 GSWTVDIVGDFWLWMEFLWE 680 Gaa1: domain 1 of 1, from 147 to 684: score 43.1, E = 1.4e-10 *->vnGeNvYGiLRAPRgDgTEalvLaVPwgrssdetnisagvaLllALa G N Gi+RAPRgDg Ea+vL+ P+ + n + aL+ + BAC01259.1 147 NFGINTVGIIRAPRGDGKEAIVLVTPYNSQKVTPNELLSLALGFSVF 193 dyfrrqvyWAKDIifvitDGGEKNDSiEkpalgleAwLeaYhdiealssv +r +KDI+ + DS + + ++wL Yh +s BAC01259.1 194 SLLSRAAWLSKDIVWLSA------DSQFGEYSAVSSWLNQYHN-PMFLSH 236 k...........sitvePd......eLqa..raGSiqAAvvLelsstevk + + +++ + + i ++Pd++ ++ eL a +raG AA+ + t BAC01259.1 237 PvnldtkiyganQILYKPDgtaekaELMAfkRAGTMAAALIFKVGETRKY 286 se..vveVqyeGLNGqLPNLDLfNliqrIaek.................. ++++ v e+ NGq PNLDL+N + a ++++ + + ++ ++ +++ BAC01259.1 287 GDrdSVTMYAEASNGQMPNLDLLNVVHYLAVHrqgfrvnvetfnsllsss 336 ......eGllssyklklq......pkdrhSnsgfeyssRLkvLllglltQ + e ++ + l++ +++ ++d + ++ e + L ++ Q BAC01259.1 337 wlrviaE-VFQNLGSLLRkinpdwKLDVTVPDYVEGT---ANLASSMYNQ 382 AsssvtgaHgNElFsrYRidaLTlgAnRRaikak...lkfsydlvavGLL A++++tg+Hg +F Y +da+ l a + k+++ k+s+ l + G BAC01259.1 383 ALGVPTGSHG--AFRDYQVDAVSL-EFAPAFHLKnenAKSSF-LLRGG-- 426 kaiEgmfRsLNNLLErLHQSFFfYlLlddlrFvSIgsYmPslviLvaalv + Eg+ Rs NNLLE++HQSFF+Y+L ++++F+S g Ym++ L+a+l BAC01259.1 427 RLTEGVVRSVNNLLEKFHQSFFLYFLTAPSKFISVGVYMIPFALLLAPLP 476 LsAYeeWinlheanieLekpagWLRlvelevwlvsllepskvelaFlsIa + A + ++ Le + + + +++ + k l a BAC01259.1 477 IVAAALAGG-SKTKGKLEDECK-TKGNADDLQMEG--GSWK----WLKSA 518 ayylvstiigltcFvLliLqqkfvgiefssaelt.esvllllSllsigll +l+ + +++ +L+ + + +++++ + ++ +lS+ + l+ BAC01259.1 519 RVLLIIQFWAVLVSLLP----YYISQIPGAMPIQyAVIWAVLSI--TILI 562 lPnntfvVVstqlPdr.Gflllklvlllalalrlsv..iallNfslglvv + + P r G + +l ++++ + +++ + ++ Nf+ + BAC01259.1 563 IL---YAMF--GSPSRaGVEWKLLKATMITSITIGMglMSIINFATAQLG 607 avllvPLgialelkpeisrvelrldlllAtlqlisltaillvvlfltnlf a +l+P + l sr + +ql ++ + v+l+ + BAC01259.1 608 ALILIP--MCLF-----SR--------PLRAQLEMNFLPRTVLLAS---N 639 fplvpfgkLlakewelffaalakgvleelvlgewiFfvlSigflplWlli ++l+ +g + ++++l + l k +++ v +g++ lW ++ BAC01259.1 640 ILLTVLG--FPPAAFLIMKGLSK----------GSWTVDIVGDFWLWMEF 677 waktfek<-* + + BAC01259.1 678 LWEWSSA 684 sugar_tr: domain 1 of 1, from 293 to 688: score -217.4, E = 46 *->valvaalgGgflfGyDtgviggflalidflfrfglltssgalaslvg + +a G + D+ + +la+ + fr + t ++ l s BAC01259.1 293 TMYAEASNG-QMPNLDLLNVVHYLAVHRQGFRVNVETFNSLLSS--- 335 ystvltglvvsifflGrliGslfaGklgdrfGRkksllialvlfviGall ++ i+ + +Gsl+ d+ l +++ +v G++ BAC01259.1 336 ------SWLRVIAEVFQNLGSLLRKINPDWK-----LDVTVPDYVEGTAN 374 sgaapgytTiGlwafyllivGRvlvGlgvGgasvlvPmYisEiA..Pkal + + ++ G ++ ++R + +v ++ + + ++ +E+A++ l BAC01259.1 375 LASSMYNQALG-VPTGSHGAFRDYQVDAVSLEFAPAFHLKNENAksSFLL 423 R.GalgslyqlaitiGilv.....AaiiglglnktnndsalnswgWRipl R+G+l+ + ++ l+++ ++++++ + ++ + i++ BAC01259.1 424 RgGRLT--EGVVRSVNNLLekfhqSFFLYFLTAPSKF----------ISV 461 glqlvpalllligllflPESPRwLvekgkleeArevLaklrgvedvdqei g+ ++p +lll+ l+ + a l g ++++ ++ BAC01259.1 462 GVYMIPFALLLAPLPIVA-------------------AALAGGSKTKGKL 492 qeikaeleatvseekagkaswgelfrgrtrpkvrqrllmgvmlqafqQlt +++ ++ + + + ++ sw+ l++ r+l+++ +a++ BAC01259.1 493 EDECKTKGNADDLQ-MEGGSWKWLKSA--------RVLLIIQFWAVL--- 530 GiNaifYYsptifksvGvsdsvasllvtiivgvvNfvfTfvaLiflvDrf p++ ++ ++++ + v +v ++++ ++ a++ r+ BAC01259.1 531 -----VSLLPYYISQIPGAMPIQ-YAVIWAVLSITILIILYAMFGSPSRA 574 GRRpllllG..aagmaicflilgasigvallllnkpkdpsskaagivaiv G + ll + ++ +i++ +++ + ++ ++ +++ BAC01259.1 575 GVEWKLLKAtmITSITIGMGLMS----IINFATAQ------------LGA 608 fillfiafFalgwGpipwvilsElFPtkvRskalalataanwlanfiigf +il+ ++F+ p+ + +++ +++ la + + + + BAC01259.1 609 LILIPMCLFS---RPLRAQL-----EMNFLPRTVLLASNILLTVLGFPPA 650 lfpyitgaiglalggyvflvfagllvlfilfvfffvPETkGrtLEeieel +f ++ g ++ g ++ + + +++ + f+ E +l BAC01259.1 651 AFLIMKGLSK---GSWTVDIVGDFWLWME-----FLWEWSS-----ATYL 687 f<-* + BAC01259.1 688 Y 688 MVIN: domain 1 of 1, from 373 to 693: score -267.9, E = 28 *->iAayfGAgllaDaFnvAFriPN....llRrLfaiEGafssAFvPvfa A+l+++ +n A +P+++++ +R + a s F P f BAC01259.1 373 ------ANLASSMYNQALGVPTgshgAFRDYQV--DAVSLEFAPAF- 410 elksaqdkdeaaeFvrkvstllilvlllvtllgilaapwvirllapgfad +lk +++++++F+ + +l +v+ +v l BAC01259.1 411 HLK---NENAKSSFLLRGGRLTEGVVRSVNNL---------------L-- 440 aekfsLtvsllritfPy.lllvsLsavfgavLNarkkFfapafsPvllNi ekf + +l +t+P++ + v+++ + +a+L a + BAC01259.1 441 -EKFHQSFFLYFLTAPSkFISVGVYMIPFALLLAP--------------L 475 vvIltllflanyfgrepiyevkvtiwleaellLaiGvliGGvlQlLvqlp + +++l ++ +e k + + BAC01259.1 476 PIVAAALAGGSKTKGKLEDECKTK----G------------------NAD 503 flrkaglldftklkprfnfrdkgvkrflklalptllgvsvsQlnllidta l ++g + + + ++ + +l ++++l vs l +i BAC01259.1 504 DLQMEGGS--------WKWLKS-ARVLLIIQFWAVL---VSLLPYYI--- 538 lASfaqlkeGsisylyYAdriyqLPlGiFgvsvstvlLPrl.Srsaksgd S ++G+ + +YA+ ++++++++++l + ++S s + + BAC01259.1 539 --S---QIPGAMPI-QYAVI-----WAVLSITILIILYAMFgSPSRAGVE 577 wdefrdlldqairltllltiPasfgllvLSdpIvsvLyerGaFsaedvta w+ +++++ ++i++ + l+ s+++ ++++ BAC01259.1 578 WKLLKATMITSITIGMGLM---------------SIIN-------FATAQ 605 tasvLaayalGLipyaLvklLsrvFYAredtktPfkislisavlnillsl + + Lip+ L +sr A+ + + f ++ ++ nill BAC01259.1 606 LGALI------LIPMCL---FSRPLRAQLEMN--FLPRTVLLASNILLT- 643 vayllllpplgvvGlAlAtslsawinlvfLyyllrkrlvgDghsargikt +l++pp +a fL+++ g s+ + + BAC01259.1 644 ---VLGFPP------------AA-----FLIMK--------GLSKGSWTV 665 flasgvlvvltalmsgvilllssltqgewvvgslllilvgvl<-* ++ + +l+++ l++ w+ ++l ++v BAC01259.1 666 DIVG-----------DFWLWMEFLWE--WSSATYLYV-FLVH 693 oxidored_q1: domain 1 of 1, from 502 to 693: score -191.6, E = 92 *->SsnLllmylgwEllllpsylLigfwgtsprsleAglkyflytalgSl +++L + g+s++ l+++ +++++ BAC01259.1 502 ADDL------------------QMEGGSWKWLKSARVLLIIQ----- 525 fLLfgilyiysltGNffslnFdllfkfdfgmpnfglnntmynsnllllll ++++l+ ++ BAC01259.1 526 --FWAVLVSLLPYY------------------------------------ 537 llllvaflvKspqfPfHfWLPdamegpppvsslilAatlvkaGlylLlRl ++q+P g++p++ ++ a+l + +l++L+ + BAC01259.1 538 ----------ISQIP----------GAMPIQYAVIWAVLSITILIILYAM 567 spllfnkpsnlisyillilgilsmllgsliaLnQtDiKkliAYSSishmG + + + + + ++++s+ +i+ mG BAC01259.1 568 FGSPSRAG-------VEWKLLKATMITSI---------------TIG-MG 594 ymllalgsgtnsiigitgailhlltHalfsalLFllagsiihrmgsenvh +m ++ ++ +++ l l+ +lfs L +++ ++ + BAC01259.1 595 LMSIINFAT------AQLGALILIPMCLFSRPLRAQLEMNFLPR------ 632 trdlrnlgglsnsmPilalllliflIllslmGlPpllGFisK........ ++ll Ill +G+Pp + +i K+ ++++ + BAC01259.1 633 -----------------TVLLASN-ILLTVLGFPPAAFLIMKglskgswt 664 .fii..Lesllsskdfgslflalllvitsllsa<-* +i+++++++++ ++ ++ ++ ++++l BAC01259.1 665 vDIVgdFWLWMEF----LWEWSSATYLYVFLVH 693 NADHdh: domain 1 of 1, from 463 to 704: score -235.0, E = 68 *->limillniLlliipvllaVAFLtllERKvlgymQqRkGPNvVGplGl + mi++ +Ll +p+++a+ l+ kG BAC01259.1 463 VYMIPFALLLAPLPIVAAA----------LAGGSKTKGK-------- 491 LQpfaDgiKLfiKEpvvPstSspvlFllaPvlaltlaLlaWav..lPfdy + D K + + + ++ + l + + l++ + a +v+ lP y BAC01259.1 492 ---LEDECKTKGNADDLQMEGGSWKWLKSARVLLIIQFWAVLVslLP--Y 536 gfvlad..lnlGvLfiLalSSLaVYgiLiSGWaSNSKYallGA.LRAvAQ ++ +++ + ++ +i a+ S+ iL Ya+ G++ RA BAC01259.1 537 YISQIPgaMPIQYAVIWAVLSITILIIL---------YAMFGSpSRAG-- 575 tISYEvsL..aLiLLsiillaGslSNSfnlsdivnaQehdyGlLlWlllp +E L +a + si + G ++++++ +aQ + BAC01259.1 576 ---VEWKLlkATMITSITIGMGL----MSIINFATAQLG----------- 607 lfplflmFfistLAEtNRaPFDLpEGEsEELVsGfnvEYSggpFaLfFla +++l m+++s p ++ + E + Fl BAC01259.1 608 ALILIPMCLFSR-----------P--LRA------QLE-------MNFLP 631 EYaniilmslLtsvLFLGGwlfsiplpil........vllpllgiiifil + ++ ++ L +vL + + f i + ++++ + + v +++l+ BAC01259.1 632 R-TVLLASNILLTVLGFPPAAFLIMKGLSkgswtvdiVGDFWLW------ 674 KtlllsflfiwvRasyPRfRYDQLMrLgWKnfLPlsLall.lltasllit + f+w +s + +L ++L l++++ + ++ BAC01259.1 675 ------MEFLWEWSS--------------ATYLYVFLVHLpCWLLCIHVL 704 <-* BAC01259.1 - - // Start with Pfam_fs (from /local/index/hmmer) hmmpfam - search a single seq against HMM database HMMER 2.1.1 (Dec 1998) Copyright (C) 1992-1998 Washington University School of Medicine HMMER is freely distributed under the GNU General Public License (GPL). - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - HMM file: /local/index/hmmer/Pfam_fs Sequence file: BAC01259.1.fa - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Query: BAC01259.1 putative GPAA1 - like protein [Oryza sativa (japonica cultivar-group)]. Scores for sequence family classification (score includes all domains): Model Description Score E-value N -------- ----------- ----- ------- --- Gaa1 Gaa1-like, GPI transamidase component 125.8 6.2e-36 3 Sdh_cyt Succinate dehydrogenase cytochrome b 3.3 17 1 BTK BTK motif 3.0 52 1 DUF403 Bacterial domain of unknown function 2.6 9.2 1 Rep-A_N Replication factor-A protein 1, N-ter 2.1 21 1 PH PH domain 1.8 44 1 CBM_20 Starch binding domain 1.6 55 1 AfsA_repeat A-factor biosynthesis repeat 1.2 85 1 DSS1_SEM1 DSS1/SEM1 family 1.1 69 1 FMN_bind FMN-binding domain 0.6 85 1 Cation_ATPase_C Cation transporting ATPase, C-terminu 0.5 40 1 Zip ZIP Zinc transporter 0.5 47 1 Porin_2 Porin subfamily 0.3 27 1 DUF140 Domain of unknown function DUF140 0.3 57 1 CG-1 CG-1 domain 0.3 52 1 Corona_S2 Coronavirus S2 glycoprotein 0.1 20 1 CitMHS Citrate transporter -0.6 79 1 FecCD FecCD transport family -0.7 92 1 cytochrome_b_N Cytochrome b(N-terminal)/b6/petB -0.9 88 1 EcoRI Restriction endonuclease EcoRI -1.3 96 1 Vps35 Vacuolar protein sorting-associated p -2.1 66 1 ICL Isocitrate lyase family -3.9 90 1 Parsed for domains: Model Domain seq-f seq-t hmm-f hmm-t score E-value -------- ------- ----- ----- ----- ----- ----- ------- Vps35 1/1 36 83 .. 572 638 .. -2.1 66 ICL 1/1 72 86 .. 393 407 .. -3.9 90 PH 1/1 67 88 .. 64 85 .] 1.8 44 Corona_S2 1/1 132 150 .. 1 19 [. 0.1 20 Gaa1 1/3 148 229 .. 1 89 [. 26.9 3.3e-07 Cation_ATPase_C 1/1 274 291 .. 179 196 .] 0.5 40 Gaa1 2/3 297 312 .. 137 152 .. 7.3 0.16 CG-1 1/1 312 318 .. 120 126 .] 0.3 52 cytochrome_b_N 1/1 308 321 .. 209 222 .] -0.9 88 EcoRI 1/1 338 350 .. 247 260 .] -1.3 96 CBM_20 1/1 356 367 .. 48 59 .. 1.6 55 AfsA_repeat 1/1 383 394 .. 75 86 .] 1.2 85 Rep-A_N 1/1 427 438 .. 1 12 [. 2.1 21 FMN_bind 1/1 429 443 .. 94 108 .] 0.6 85 BTK 1/1 437 449 .. 1 14 [. 3.0 52 DUF403 1/1 430 450 .. 303 323 .] 2.6 9.2 Gaa1 3/3 379 479 .. 193 296 .. 94.0 1.1e-26 CitMHS 1/1 526 547 .. 463 484 .] -0.6 79 FecCD 1/1 543 559 .. 1 17 [. -0.7 92 Sdh_cyt 1/1 553 565 .. 113 125 .] 3.3 17 Zip 1/1 589 600 .. 352 363 .] 0.5 47 DSS1_SEM1 1/1 617 628 .. 59 70 .] 1.1 69 Porin_2 1/1 627 638 .. 1 10 [. 0.3 27 DUF140 1/1 634 647 .. 253 266 .] 0.3 57 Alignments of top-scoring domains: Vps35: domain 1 of 1, from 36 to 83: score -2.1, E = 66 *->lrksmAayivqsilknassGaGGGGWkPtvvvtvDseskLhPPpsnt + +++A++ ++s +kn t se L P nt BAC01259.1 36 TAGIIALLFLPSLAKN----------------TYLSENALIPGSANT 66 livtaDkLqVdsLfeLispL<-* l++t+D V+++ +++++ BAC01259.1 67 LFSTED---VQEANRFAKGI 83 ICL: domain 1 of 1, from 72 to 86: score -3.9, E = 90 *->DyeqAkeFAeGVkak<-* D+++A +FA+G+ a+ BAC01259.1 72 DVQEANRFAKGIEAA 86 PH: domain 1 of 1, from 67 to 88: score 1.8, E = 44 *->llqaeseeerkeWvkaiqsair<-* l+++e+ +e+++ k i++ai BAC01259.1 67 LFSTEDVQEANRFAKGIEAAIG 88 Corona_S2: domain 1 of 1, from 132 to 150: score 0.1, E = 20 *->sndtsvctepvltYSsfgv<-* s +++ + p tY +fg+ BAC01259.1 132 SMTNNMAAKPNGTYTNFGI 150 Gaa1: domain 1 of 3, from 148 to 229: score 26.9, E = 3.3e-07 *->vnGeNvYGiLRAPRgDgTEalvLaVPwgrssdetnisagvaLllALa G N Gi+RAPRgDg Ea+vL+ P+ + n + aL+ + BAC01259.1 148 -FGINTVGIIRAPRGDGKEAIVLVTPYNSQKVTPNELLSLALGFSVF 193 dyfrrqvyWAKDIifvitDGGEKNDSiEkpalgleAwLeaYh<-* +r +KDI+ + DS + + ++wL Yh BAC01259.1 194 SLLSRAAWLSKDIVWLSA------DSQFGEYSAVSSWLNQYH 229 Cation_ATPase_C: domain 1 of 1, from 274 to 291: score 0.5, E = 40 *->sflvfildElrKfikRrr<-* ++l+f ++E+rK+ R BAC01259.1 274 AALIFKVGETRKYGDRDS 291 Gaa1: domain 2 of 3, from 297 to 312: score 7.3, E = 0.16 *->eGLNGqLPNLDLfNli<-* e+ NGq PNLDL+N + BAC01259.1 297 EASNGQMPNLDLLNVV 312 CG-1: domain 1 of 1, from 312 to 318: score 0.3, E = 52 *->VHYleVk<-* VHYl V+ BAC01259.1 312 VHYLAVH 318 cytochrome_b_N: domain 1 of 1, from 308 to 321: score -0.9, E = 88 *->illllHLlfLHetG<-* +l ++H l+ H +G BAC01259.1 308 LLNVVHYLAVHRQG 321 EcoRI: domain 1 of 1, from 338 to 350: score -1.3, E = 96 *->LRvladDLdenLts<-* LRv+a+ ++nL s BAC01259.1 338 LRVIAE-VFQNLGS 350 CBM_20: domain 1 of 1, from 356 to 367: score 1.6, E = 55 *->YPtWygdvsLPA<-* +P+W++dv++P+ BAC01259.1 356 NPDWKLDVTVPD 367 AfsA_repeat: domain 1 of 1, from 383 to 394: score 1.2, E = 85 *->apgvPlgyhfll<-* a gvP+g h+++ BAC01259.1 383 ALGVPTGSHGAF 394 Rep-A_N: domain 1 of 1, from 427 to 438: score 2.1, E = 21 *->sLTpGAIaailn<-* +LT+G+++ ++n BAC01259.1 427 RLTEGVVRSVNN 438 FMN_bind: domain 1 of 1, from 429 to 443: score 0.6, E = 85 *->sravvksvkraLgka<-* + +vv+sv + L+k+ BAC01259.1 429 TEGVVRSVNNLLEKF 443 BTK: domain 1 of 1, from 437 to 449: score 3.0, E = 52 *->NnnllqkYHPsfwv<-* nnll k+H sf++ BAC01259.1 437 -NNLLEKFHQSFFL 449 DUF403: domain 1 of 1, from 430 to 450: score 2.6, E = 9.2 *->aelitevrrvGdaIaqeYFva<-* +++++ v+ + ++ +q++F + BAC01259.1 430 EGVVRSVNNLLEKFHQSFFLY 450 Gaa1: domain 3 of 3, from 379 to 479: score 94.0, E = 1.1e-26 *->lltQAsssvtgaHgNElFsrYRidaLTlgAnRRaikak...lkfsyd ++ QA++++tg+Hg +F Y +da+ l a + k+++ k+s+ BAC01259.1 379 MYNQALGVPTGSHG--AFRDYQVDAVSL-EFAPAFHLKnenAKSSF- 421 lvavGLLkaiEgmfRsLNNLLErLHQSFFfYlLlddlrFvSIgsYmPslv l + G + Eg+ Rs NNLLE++HQSFF+Y+L ++++F+S g Ym++ BAC01259.1 422 LLRGG--RLTEGVVRSVNNLLEKFHQSFFLYFLTAPSKFISVGVYMIPFA 469 iLvaalvLsA<-* L+a+l + A BAC01259.1 470 LLLAPLPIVA 479 CitMHS: domain 1 of 1, from 526 to 547: score -0.6, E = 79 *->KWAvltslVilaiAiLmGiipl<-* +WAvl+sl i+ + G++p+ BAC01259.1 526 FWAVLVSLLPYYISQIPGAMPI 547 FecCD: domain 1 of 1, from 543 to 559: score -0.7, E = 92 *->Galsispadvlqalfgg<-* Ga++i++a+++++l + BAC01259.1 543 GAMPIQYAVIWAVLSIT 559 Sdh_cyt: domain 1 of 1, from 553 to 565: score 3.3, E = 17 *->yivlvlsvvLall<-* ++vl+++++++l+ BAC01259.1 553 WAVLSITILIILY 565 Zip: domain 1 of 1, from 589 to 600: score 0.5, E = 47 *->lllGfalMllia<-* + +G++lM++i+ BAC01259.1 589 ITIGMGLMSIIN 600 DSS1_SEM1: domain 1 of 1, from 617 to 628: score 1.1, E = 69 *->FSnQLrAELekk<-* FS+ LrA Le BAC01259.1 617 FSRPLRAQLEMN 628 Porin_2: domain 1 of 1, from 627 to 638: score 0.3, E = 27 *->MNi..ksvLLGS<-* MN+ +++vLL+S BAC01259.1 627 MNFlpRTVLLAS 638 DUF140: domain 1 of 1, from 634 to 647: score 0.3, E = 57 *->vifildfvlTaimf<-* v+++++++lT ++f BAC01259.1 634 VLLASNILLTVLGF 647 // Start with Repeat Library (from /local/index/hmmer) hmmpfam - search a single seq against HMM database HMMER 2.1.1 (Dec 1998) Copyright (C) 1992-1998 Washington University School of Medicine HMMER is freely distributed under the GNU General Public License (GPL). - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - HMM file: /local/index/hmmer/adrade-repeats.hmm Sequence file: BAC01259.1.fa - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Query: BAC01259.1 putative GPAA1 - like protein [Oryza sativa (japonica cultivar-group)]. Scores for sequence family classification (score includes all domains): Model Description Score E-value N -------- ----------- ----- ------- --- [no hits above thresholds] Parsed for domains: Model Domain seq-f seq-t hmm-f hmm-t score E-value -------- ------- ----- ----- ----- ----- ----- ------- [no hits above thresholds] Alignments of top-scoring domains: [no hits above thresholds] // ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ Start with Prosite --------------------------------------------------------- | ppsearch (c) 1994 EMBL Data Library | | based on MacPattern (c) 1990-1994 R. Fuchs | --------------------------------------------------------- PROSITE pattern search started: Tue Jan 28 11:28:29 2003 Sequence file: BAC01259.1.fa ---------------------------------------- Sequence BAC01259.1 (718 residues): Matching pattern PS00001 ASN_GLYCOSYLATION: 142: NGTY Total matches: 1 Matching pattern PS00005 PKC_PHOSPHO_SITE: 4: SAK 175: SQK 283: TRK 511: SWK 517: SAR Total matches: 5 Matching pattern PS00006 CK2_PHOSPHO_SITE: 4: SAKE 69: STED 179: TPNE 364: TVPD Total matches: 4 Matching pattern PS00008 MYRISTYL: 20: GVFLAS 62: GSANTL 82: GIEAAI 143: GTYTNF 270: GTMAAA 301: GQMPNL 385: GVPTGS 389: GSHGAF 425: GGRLTE 431: GVVRSV 484: GGSKTK 657: GLSKGS Total matches: 12 Matching pattern PS00016 RGD: 160: RGD Total matches: 1 Matching pattern PS00029 LEUCINE_ZIPPER: 188: LGFSVFSLLSRAAWLSKDIVWL Total matches: 1 Total no of hits in this sequence: 24 ======================================== 1329 pattern(s) searched in 1 sequence(s), 718 residues. Total no of hits in all sequences: 24. Search time: 00:00 min ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ Start with Profile Search ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ Start with motif search against own library ***** bioMotif : Version V41a DB, 1999 Nov 11 ***** SeqTyp=2 : PROTEIN search; >APC D-Box is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >ER-GOLGI-traffic signal is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >INTRA-SIGNAL-M minimal SH3 binding is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >INTRA-SIGNAL-M deubiquitinating enzyme SH3 domain binding motif (Kato, 2000) is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >INTRA-SIGNAL-M minimal class I consensus-SH3 binding motif is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >INTRA-SIGNAL-M minimal class II consensus-SH3 binding motif is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >INTRA-SIGNAL-M exact 14-3-3 binding consensus (Muslin 1996 Cell 84 889) is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >INTRA-SIGNAL-M 14-3-3 binding motif in RAF and others (Muslin 1996 Cell 84 889) is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >INTRA-SIGNAL-M WW domain binding motif in formins (Bedford 1997) is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >INTRA-SIGNAL-M PY motif for WW domain is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >TM-CYTOPLASMIC-M di-hydrophobic endocytosis motifs for internalized transmembrane proteins is the MOTIF name >BAC01259.1 putative GPAA1 - like protein [Oryza sativa (japonica cultivar-group)]. ;LENGTH=718; DIRECT_SEQUENCE n 1 solutions m %_E 165-165 %_XXXL 166-169 %_V 170-170 f >STATISTICS Total : 1 solutions in 1 sequences, 718 units; out of 1 sequences, 718 units >TM-CYTOPLASMIC-M tyrosine-based endocytosis motif for internalized transmembrane proteins is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >TM-EXTRACELL-M Endocytosis signal for internalized transmembrane proteins is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >EXTRACELL-M minimal furin protease cleavage site motif is the MOTIF name >BAC01259.1 putative GPAA1 - like protein [Oryza sativa (japonica cultivar-group)]. ;LENGTH=718; DIRECT_SEQUENCE n 3 solutions m %_RXXR 157-160 f m %_RXXR 424-427 f m %_RXXR 619-622 f >STATISTICS Total : 3 solutions in 1 sequences, 718 units; out of 1 sequences, 718 units >EXTRACELL-M extended furin protease cleavage site motif is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >EXTRACELL-M zinc binding motif in MMPs is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >EXTRACELL-M g alpha binding go loco is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS PDX-1 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS QKI-5 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS HCDA experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS SV40 LrgT experimentally determined is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS H2B experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS v-Rel experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS Amida experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS Amida experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS RanBP3 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS Pho4p experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS DNAhelicaseQ1 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS LEF-1 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS TCF-1 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR p53-NLS1 NLS experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS hum-Ku70 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS GAL4 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS act/inh betaA experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS BDV-P experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS BDV-P experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS BDV-P experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS TR2 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS THOV NP experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS polyomaVP1 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS HIV-1 Tat experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS HIV-1 Rev experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS Rex experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS SRY experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS SRY experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS SOX9 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS SOX9 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS NS5A experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS DNAse EBV experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS DNAse EBV experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS adenovE1a experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS ystDNApolalpha experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS hVDR experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS CPV capsid experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS hGlu.cort.experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS cFOS experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS cJUN experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS hDNApolalpha experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS hDNAtopoII experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS hDNAtopoII experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS hBLM experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS hARNT experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS influenzaNP experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS influenzaNP experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS p54 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS hProTalpha experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS Tst1/Oct6 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS protHsc9 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS protHsci experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS protHsc3 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS Ta alpha experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS Pax-QNR experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS Hunt.Dis.pro experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS MyoD experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS MyoD experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS opaque2 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS CTP experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS HCV experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS HCV experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS p110RB1 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS VirD2-Nterm experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS VirD2-Cterm experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS Nucloplasmin experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS Nucleolin experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS ICP-8 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS Nab2 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS M9 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS lscMyc experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS humKprotein experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS FluA experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS Mat-alpha experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS polyoma Lrg-T experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS polyoma Lrg-T experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS SV40 VP1 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS SV40 VP2 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS polyoma VP2 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS c-myb experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS N-myc experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS p53 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS c-erb-A experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS yeast SKI3 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS L29 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS L29 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS Max experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS L3 experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >NUCLEAR NLS dyskerin experimentally determined NLS is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >PDZ domain binding motif science 278_2075_pawson is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units >WW domain binding motif science 278_2075_pawson is the MOTIF name >STATISTICS Total : 0 solutions in 0 sequences, 0 units; out of 1 sequences, 718 units ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~ ~~~ Start with HMM-search search against own library hmmpfam - search a single seq against HMM database HMMER 2.1.1 (Dec 1998) Copyright (C) 1992-1998 Washington University School of Medicine HMMER is freely distributed under the GNU General Public License (GPL). - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - HMM file: /local/index/hmmer/own-hmm.lib Sequence file: BAC01259.1.fa - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Query: BAC01259.1 putative GPAA1 - like protein [Oryza sativa (japonica cultivar-group)]. Scores for sequence family classification (score includes all domains): Model Description Score E-value N -------- ----------- ----- ------- --- gaa1-glob 48.4 2.3e-12 1 gaa1-glob-1-short 39.4 6.6e-08 1 Parsed for domains: Model Domain seq-f seq-t hmm-f hmm-t score E-value -------- ------- ----- ----- ----- ----- ----- ------- gaa1-glob-1-short 1/1 280 452 .. 1 152 [] 39.4 6.6e-08 gaa1-glob 1/1 46 456 .. 1 357 [] 48.4 2.3e-12 Alignments of top-scoring domains: gaa1-glob-1-short: domain 1 of 1, from 280 to 452: score 39.4, E = 6.6e-08 *->lvLeLStlevkGseivdidteGLNGqLPNLDlfNgIaqiimakegll v e ++ + v e+ NGq PNLDl+N + + g++ BAC01259.1 280 -VGETRKYGDR--DSVTMYAEASNGQMPNLDLLNVVHYLAVHRQGFR 323 vsl.....................qGklrhqdrssds........afasr v ++ ++ +++ + + +++G l ++ +++ + + + + + + BAC01259.1 324 VNVetfnsllssswlrviaevfqnLGSLLRK--INPDwkldvtvpDYVEG 371 lkvlllmllsqAlstvtgpHglfprYgidaLTvrsnRlklh......das l+ ++ qAl+++tg+Hg f Y +da ++ ++++ +++++ s BAC01259.1 372 TANLASSMYNQALGVPTGSHGAFRDYQVDAVSLEF--APAFhlknenAKS 419 svfPhsdlPvafGqaiEgifRsLNNLLErlHHQSFFfYll<-* s + l + G++ Eg+ Rs NNLLE++H QSFF+Y+l BAC01259.1 420 S----FLL--RGGRLTEGVVRSVNNLLEKFH-QSFFLYFL 452 gaa1-glob: domain 1 of 1, from 46 to 456: score 48.4, E = 2.3e-12 *->dertrrtYiSENALgPglVy....sefrfdkskiarqllrefkelrk tY+SENAL Pg +++ ++e +++ a + ++ e r BAC01259.1 46 PSLAKNTYLSENALIPGSANtlfsTEDVQEANRFAKGIEAAIGESRG 92 kssslpsaligsvleelGlevhtpkf................kqkypfnd ++ i+ + lG ev+ ++f ++++ ++ + ++ + + + BAC01259.1 93 -GTTEIPKFIAQQTKNLGAEVYYHEFlpdskcfhplkfftsmTNNMAAKP 141 pkeeRYmvnGeNvYGilRAPRgdgTEalvlaVpygnsdkeynDisasVsL + + G N Gi+RAPRgdg Ea+vl+ py + n++ s +L BAC01259.1 142 NGTY--TNFGINTVGIIRAPRGDGKEAIVLVTPYNSQKVTPNEL-LSLAL 188 llALadyfrrqkyWAKDIifvfvdGGEKNDSiEkpalGleAwLeaYhD.. + + +r ++KDI+ + DS + + ++wL Yh + BAC01259.1 189 GFSVFSLLSRAAWLSKDIVWLSA------DSQFGEYSAVSSWLNQYHNpm 232 ..............ieallslssiyiesteLqa..raGsiqAAlvLeLSs +++ + +++ + + l + + e eL a +raG AAl + BAC01259.1 233 flshpvnldtkiygANQILYKPDGTAEKAELMAfkRAGTMAAALIFKVGE 282 levkG.sevveiqteGLNGqLPNLDlfNgIaqiimakegllvsl...... + G+ + v e+ NGq PNLDl+N + + g++v ++ ++ BAC01259.1 283 TRKYGdRDSVTMYAEASNGQMPNLDLLNVVHYLAVHRQGFRVNVetfnsl 332 ...............qGklrhqdrssds........aflsrlkvlllmll +++ + + +++G l ++ +++ + + + + + + l+ ++ BAC01259.1 333 lssswlrviaevfqnLGSLLRK--INPDwkldvtvpDYVEGTANLASSMY 380 sqAlssvtgpHglfprYgidaLTvrsnRlklh......daskvfPhsdlP qAl+++tg+Hg f Y +da ++ ++++ +++++ s+ + l BAC01259.1 381 NQALGVPTGSHGAFRDYQVDAVSLEF--APAFhlknenAKSS----FLL- 423 vafGqaiEgifRsLNNLLErlHQSFFfYllvdll<-* + G++ Eg+ Rs NNLLE++HQSFF+Y+l ++ BAC01259.1 424 -RGGRLTEGVVRSVNNLLEKFHQSFFLYFLTAPS 456 // hmmpfam - search a single seq against HMM database HMMER 2.1.1 (Dec 1998) Copyright (C) 1992-1998 Washington University School of Medicine HMMER is freely distributed under the GNU General Public License (GPL). - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - HMM file: /local/index/hmmer/own-hmm-f.lib Sequence file: BAC01259.1.fa - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Query: BAC01259.1 putative GPAA1 - like protein [Oryza sativa (japonica cultivar-group)]. Scores for sequence family classification (score includes all domains): Model Description Score E-value N -------- ----------- ----- ------- --- gaa1-glob 97.5 3.7e-27 4 gaa1-glob-1-short 73.3 1.3e-21 2 Parsed for domains: Model Domain seq-f seq-t hmm-f hmm-t score E-value -------- ------- ----- ----- ----- ----- ----- ------- gaa1-glob 1/4 49 66 .. 1 21 [. 4.6 1.5 gaa1-glob 2/4 149 229 .. 87 174 .. 21.4 2.3e-05 gaa1-glob 3/4 297 320 .. 223 247 .. 8.2 0.14 gaa1-glob-1-short 1/2 297 320 .. 21 45 .. 10.5 0.03 gaa1-glob-1-short 2/2 375 452 .. 72 152 .] 64.8 5.3e-19 gaa1-glob 4/4 379 456 .. 278 357 .] 66.0 3.9e-18 Alignments of top-scoring domains: gaa1-glob: domain 1 of 4, from 49 to 66: score 4.6, E = 1.5 *->dertrrtYiSENALgPglVys<-* tY+SENAL Pg +++ BAC01259.1 49 ---AKNTYLSENALIPGSANT 66 gaa1-glob: domain 2 of 4, from 149 to 229: score 21.4, E = 2.3e-05 *->GeNvYGilRAPRgdgTEalvlaVpygnsdkeynDisasVsLllALad G N Gi+RAPRgdg Ea+vl+ py + n++ s +L+ + BAC01259.1 149 GINTVGIIRAPRGDGKEAIVLVTPYNSQKVTPNEL-LSLALGFSVFS 194 yfrrqkyWAKDIifvfvdGGEKNDSiEkpalGleAwLeaYh<-* +r ++KDI+ + DS + + ++wL Yh BAC01259.1 195 LLSRAAWLSKDIVWLSA------DSQFGEYSAVSSWLNQYH 229 gaa1-glob: domain 3 of 4, from 297 to 320: score 8.2, E = 0.14 *->eGLNGqLPNLDlfNgIaqiimakeg<-* e+ NGq PNLDl+N + ++ + + BAC01259.1 297 EASNGQMPNLDLLN-VVHYLAVHRQ 320 gaa1-glob-1-short: domain 1 of 2, from 297 to 320: score 10.5, E = 0.03 *->eGLNGqLPNLDlfNgIaqiimakeg<-* e+ NGq PNLDl+N + ++ + + BAC01259.1 297 EASNGQMPNLDLLN-VVHYLAVHRQ 320 gaa1-glob-1-short: domain 2 of 2, from 375 to 452: score 64.8, E = 5.3e-19 *->lllmllsqAlstvtgpHglfprYgidaLTvrsnRlklh......das l+ ++ qAl+++tg+Hg f Y +da ++ ++++ +++++ s BAC01259.1 375 LASSMYNQALGVPTGSHGAFRDYQVDAVSLEF--APAFhlknenAKS 419 svfPhsdlPvafGqaiEgifRsLNNLLErlHHQSFFfYll<-* s + l + G++ Eg+ Rs NNLLE++H QSFF+Y+l BAC01259.1 420 S----FLL--RGGRLTEGVVRSVNNLLEKFH-QSFFLYFL 452 gaa1-glob: domain 4 of 4, from 379 to 456: score 66.0, E = 3.9e-18 *->llsqAlssvtgpHglfprYgidaLTvrsnRlklh......daskvfP ++ qAl+++tg+Hg f Y +da ++ ++++ +++++ s+ BAC01259.1 379 MYNQALGVPTGSHGAFRDYQVDAVSLEF--APAFhlknenAKSS--- 420 hsdlPvafGqaiEgifRsLNNLLErlHQSFFfYllvdll<-* + l + G++ Eg+ Rs NNLLE++HQSFF+Y+l ++ BAC01259.1 421 -FLL--RGGRLTEGVVRSVNNLLEKFHQSFFLYFLTAPS 456 // ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ L. Aravind's signalling DB+ PSSM from other authors IMPALA version 1.1 [20-December-1999] Reference: Alejandro A. Schaffer, Yuri I. Wolf, Chris P. Ponting, Eugene V. Koonin, L. Aravind, Stephen F. Altschul (1999), "IMPALA: Matching a Protein Sequence Against a Collection of "PSI-BLAST-Constructed Position-Specific Score Matrices", Bioinformatics 15:1000-1011. Query= BAC01259.1 putative GPAA1 - like protein [Oryza sativa (japonica cultivar-group)]. (718 letters) Searching..................................done Results from profile search Score E Sequences producing significant alignments: (bits) Value DHHC Novel zinc finger domain with DHHC signature 31 0.004 14-3-3 14-3-3 protein alpha Helical domain 23 1.7 PH Pleckstrin homology domain (lipid and protein interactio... 22 3.6 INSL Insulinase like Metallo protease domain 21 3.7 CALMO Calmodulin like EF-hand domains 21 4.8 VWA Von Willebrand factor A domain 21 4.8 ARM Armadillo repeat 21 6.0 ACET NH2 acetyltransferase domain 21 6.2 ACYC Adenylyl/Guanylyl cyclase domain 21 6.3 LRR Leucine rich repeats 20 9.7 >DHHC Novel zinc finger domain with DHHC signature Length = 217 Score = 31.4 bits (70), Expect = 0.004 Identities = 17/74 (22%), Positives = 17/74 (22%), Gaps = 12/74 (16%) Query: 509 GGSWKWLKSARVLLIIQF------WAVLVSLLPYYISQIPGAMPIQYAVIWAVLSITILI 562 Sbjct: 43 GWSWPPHPLQIVAWLLYLFFAVIGFGILVPLLPHH------WVPAGYACMGAIFAGHLVV 96 Query: 563 ILYAMFGSPSRAGV 576 Sbjct: 97 HLTAVSIDPADDNV 110 Score = 20.1 bits (41), Expect = 9.0 Identities = 3/28 (10%), Positives = 3/28 (10%) Query: 446 SFFLYFLTAPSKFISVGVYMIPFALLLA 473 Sbjct: 66 GFGILVPLLPHHWVPAGYACMGAIFAGH 93 >14-3-3 14-3-3 protein alpha Helical domain Length = 270 Score = 22.6 bits (48), Expect = 1.7 Identities = 8/10 (80%), Positives = 8/10 (80%) Query: 184 LSLALGFSVF 193 Sbjct: 173 LGLALNFSVF 182 >PH Pleckstrin homology domain (lipid and protein interaction domain) Length = 138 Score = 21.5 bits (45), Expect = 3.6 Identities = 4/44 (9%), Positives = 4/44 (9%), Gaps = 3/44 (6%) Query: 109 GAEVYYHEFLPDSKCFHPLKFFTSMTNNMAAKPNGTYTNFGINT 152 Sbjct: 62 NLSIREVDDPRKPNCF---ELYIPNNKGQLIKACKTEADGRVVE 102 >INSL Insulinase like Metallo protease domain Length = 433 Score = 21.4 bits (45), Expect = 3.7 Identities = 11/81 (13%), Positives = 11/81 (13%), Gaps = 6/81 (7%) Query: 331 SLLSSSWLRVIAEVFQNLGSLLRKINPDW------KLDVTVPDYVEGTANLASSMYNQAL 384 Sbjct: 324 TFPPENYEKVKKRVFELLKETYENLTDEQVEEAKSRIINSRLFEEERVENDAFDIGYSYT 383 Query: 385 GVPTGSHGAFRDYQVDAVSLE 405 Sbjct: 384 VVRDLDFYRFFDKNLSRVRRV 404 >CALMO Calmodulin like EF-hand domains Length = 147 Score = 21.2 bits (44), Expect = 4.8 Identities = 14/98 (14%), Positives = 14/98 (14%), Gaps = 9/98 (9%) Query: 254 PDGTAEKAELMAFKRAGT--MAAALIFKVGETRKYGDRDSVTMYAEASNGQMPNLDLLNV 311 Sbjct: 24 NNGSISSSELATVMRSLGLSPSEAEVNDLMNEIDVDGNHQIE-FSEFLALMSRQLKSNDS 82 Query: 312 VHYLAVHRQGFRVNVETFNSLLSSSWLRVIAEVFQNLG 349 Sbjct: 83 EQE---LLEAFKVFDKNGDGLISAAELK---HVLTSIG 114 >VWA Von Willebrand factor A domain Length = 255 Score = 21.0 bits (44), Expect = 4.8 Identities = 2/37 (5%), Positives = 2/37 (5%), Gaps = 2/37 (5%) Query: 142 NGTYTNFGINTV--GIIRAPRGDGKEAIVLVTPYNSQ 176 Sbjct: 136 GNPSLQNALEMARGLLLPVPAHCTREVLIVFGSLSTT 172 >ARM Armadillo repeat Length = 532 Score = 21.0 bits (44), Expect = 6.0 Identities = 7/31 (22%), Positives = 7/31 (22%), Gaps = 5/31 (16%) Query: 529 VLVSLLPYYISQIPGAMPIQYAVIWAVLSIT 559 Sbjct: 330 CLANLL-----TQNHKKSIKKEACWTISNIT 355 >ACET NH2 acetyltransferase domain Length = 173 Score = 20.8 bits (43), Expect = 6.2 Identities = 15/100 (15%), Positives = 15/100 (15%), Gaps = 12/100 (12%) Query: 87 IGESRGGTTEIPKFI---AQQTKNLGAEVYYH--EFLPDSKCFHPLKFFTSMTNNMAAKP 141 Sbjct: 75 NRPGPASHVANASFATHPDARGHGIARELVGHAKDWARAQGFRA-MQFNFVVSTNADA-- 131 Query: 142 NGTYTNFGINTVGIIRA----PRGDGKEAIVLVTPYNSQK 177 Sbjct: 132 VHSWQKAGFDIVGRLPAAFLHPRHGYVDALVMFHDLTEGK 171 >ACYC Adenylyl/Guanylyl cyclase domain Length = 244 Score = 20.6 bits (43), Expect = 6.3 Identities = 3/25 (12%), Positives = 3/25 (12%), Gaps = 2/25 (8%) Query: 367 DYVEGTANLASSMYNQALGVPTGSH 391 Sbjct: 158 TAIGDTVNAAFRL--ESATKQAHFD 180 >LRR Leucine rich repeats Length = 339 Score = 20.3 bits (42), Expect = 9.7 Identities = 10/37 (27%), Positives = 10/37 (27%) Query: 277 IFKVGETRKYGDRDSVTMYAEASNGQMPNLDLLNVVH 313 Sbjct: 171 LFKLRNLRKLNLSGNKIEKLNMTEGEWENLETLNMSH 207 Underlying Matrix: BLOSUM62 Number of sequences tested against query: 105 Number of sequences better than 10.0: 10 Number of calls to ALIGN: 11 Length of query: 718 Total length of test sequences: 20182 Effective length of test sequences: 16435.0 Effective search space size: 11223562.9 Initial X dropoff for ALIGN: 25.0 bits Y. Wolf's SCOP PSSM IMPALA version 1.1 [20-December-1999] Reference: Alejandro A. Schaffer, Yuri I. Wolf, Chris P. Ponting, Eugene V. Koonin, L. Aravind, Stephen F. Altschul (1999), "IMPALA: Matching a Protein Sequence Against a Collection of "PSI-BLAST-Constructed Position-Specific Score Matrices", Bioinformatics 15:1000-1011. Query= BAC01259.1 putative GPAA1 - like protein [Oryza sativa (japonica cultivar-group)]. (718 letters) Searching.................................................done Results from profile search Score E Sequences producing significant alignments: (bits) Value gi|1076126 [3..311] beta/alpha (TIM)-barrel 26 1.6 gi|1350557 [4..271] P-loop containing nucleotide triphosphat... 26 2.7 gi|1170141 [21..455] ConA-like lectins/glucanases 25 6.2 gi|2828820 [148..443] Periplasmic binding protein-like II 24 7.1 gi|544020 [86..429] Nitrogenase iron-molybdenum protein, alp... 24 8.2 gi|128242 [2..526] Nitrogenase iron-molybdenum protein, alph... 24 9.2 gi|1717962 [19..274] Purine and uridine phosphorylases 24 9.7 >gi|1076126 [3..311] beta/alpha (TIM)-barrel Length = 309 Score = 26.4 bits (58), Expect = 1.6 Identities = 14/92 (15%), Positives = 14/92 (15%), Gaps = 9/92 (9%) Query: 76 ANRFAKGI----EAAIGESRGGTTEIPKFIAQQTKNLGAEVYYHEFLPDSKCFHPLKFFT 131 Sbjct: 217 PKNFIFGFRATPEETYGDILGYTIEDFIQLVDKIIEIGKISYLAIASWGHDIY-LNKVRS 275 Query: 132 SMTNNMAAKPNGTYTNFGIN----TVGIIRAP 159 Sbjct: 276 NTKYKGQLVNKVIYDIYKNKLPIISSGGINTP 307 >gi|1350557 [4..271] P-loop containing nucleotide triphosphate hydrolases Length = 268 Score = 25.8 bits (55), Expect = 2.7 Identities = 8/66 (12%), Positives = 8/66 (12%) Query: 237 PVNLDTKIYGANQILYKPDGTAEKAELMAFKRAGTMAAALIFKVGETRKYGDRDSVTMYA 296 Sbjct: 101 VGYARKLGVRTDDLLLSQPDTGEQALEIAEMLVRSGAIDVLVVDSVAALVPKAELEGEMG 160 Query: 297 EASNGQ 302 Sbjct: 161 DAHMGV 166 >gi|1170141 [21..455] ConA-like lectins/glucanases Length = 435 Score = 24.5 bits (53), Expect = 6.2 Identities = 10/35 (28%), Positives = 10/35 (28%) Query: 120 DSKCFHPLKFFTSMTNNMAAKPNGTYTNFGINTVG 154 Sbjct: 170 DAQCPRDVKFINGVANSEGWKPSDSDVNAGVGNLG 204 >gi|2828820 [148..443] Periplasmic binding protein-like II Length = 296 Score = 24.2 bits (52), Expect = 7.1 Identities = 10/60 (16%), Positives = 10/60 (16%) Query: 344 VFQNLGSLLRKINPDWKLDVTVPDYVEGTANLASSMYNQALGVPTGSHGAFRDYQVDAVS 403 Sbjct: 81 VYSNGGSLGEFKDGKWVPTLNKPENVEALQFMVDLIHKYKISPPNTYTEMTEEPVRLMFQ 140 >gi|544020 [86..429] Nitrogenase iron-molybdenum protein, alpha and beta chains Length = 344 Score = 24.1 bits (52), Expect = 8.2 Identities = 16/112 (14%), Positives = 16/112 (14%), Gaps = 16/112 (14%) Query: 52 TYLSENALIPGSANTLFSTEDVQEANR-FAK--GIEAAIGESRGGTTEIPKFIAQQTKNL 108 Sbjct: 146 MYFEKEFGMPYISTIPMGAVDMAECIRQIQRYVNTLAHISSSK--EVDYEPYIDGQTRFV 203 Query: 109 GAEVY-----YHEFLPDSKCF------HPLKFFTSMTNNMAAKPNGTYTNFG 149 Sbjct: 204 SQAAWFSRSIDCQNLTGKETVVFGDATHAASITKIPAREMGIRVSYTGTYCE 255 >gi|128242 [2..526] Nitrogenase iron-molybdenum protein, alpha and beta chains Length = 525 Score = 23.7 bits (51), Expect = 9.2 Identities = 11/106 (10%), Positives = 11/106 (10%), Gaps = 10/106 (9%) Query: 52 TYLSENALIPGSANTLFSTEDVQEANRFAKGIEAAIGESRGGTTEIPKFIAQQTKNLGAE 111 Sbjct: 270 EMMETKYGIPWIKCNFIGVDGIVE---TLRDMAKCFDDP-ELTKRTEEVIAEEIAAIQDD 325 Query: 112 V-YYHEFLPDSKCF---HPLKFF--TSMTNNMAAKPNGTYTNFGIN 151 Sbjct: 326 LDYFKEKLQGKTACLYVGGSRSHTYMNMLKSFGVDSLVAGFEFAHR 371 >gi|1717962 [19..274] Purine and uridine phosphorylases Length = 256 Score = 23.7 bits (51), Expect = 9.7 Identities = 10/79 (12%), Positives = 10/79 (12%), Gaps = 4/79 (5%) Query: 347 NLGSLLRKINPDWKLDVTVPDYVEGTANLASSMYNQALGVPTGSHGAFRDYQVDAVSLEF 406 Sbjct: 151 RYDTSGRERNSRSHIVRTLSILYQQQTYDTYSRVERFK----GSMEEWQAMGVMNYEMES 206 Query: 407 APAFHLKNENAKSSFLLRG 425 Sbjct: 207 ATVLTMCASQGLRAGMVAG 225 Underlying Matrix: BLOSUM62 Number of sequences tested against query: 1187 Number of sequences better than 10.0: 7 Number of calls to ALIGN: 7 Length of query: 718 Total length of test sequences: 256703 Effective length of test sequences: 207231.0 Effective search space size: 140243779.7 Initial X dropoff for ALIGN: 25.0 bits ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ calculation of internal repeats with prospero ***** PROSPERO v1.3 Tue Jan 28 11:29:13 2003 ***** Copyright 2000, Richard Mott, Wellcome Trust Centre for Human Genetics, University of Oxford For help see http://www.well.ox.ac.uk/ariadne For usage use -help using gap penalty 11+1k using matrix BLOSUM62 printing all alignments with eval < 0.100000 using sequence1 BAC01259.1 using self-comparison > 1 BAC01259.1 len 718 from 509 to 604 vs BAC01259.1 len 718 from 575 to 650 score 53 eval 2.097396e-02 identity 31.58% K 2.933681e-02 L 2.543004e-01 H 1.220680e+00 alpha 8.609429e-02 509 GGSWKWLKSARVLLIIQFWAVLVSLLPYYISQIPGAMPIQYAVIWAVLSITILIILYAMF 568 BAC01259.1 | || || | :: | |:|:: : :|: || :::| :| 575 GVEWKLLK-ATMITSITIGMGLMSIINFATAQL-GA---------------LILIPMCLF 617 BAC01259.1 569 GSPSRAGVEWKLLKATMITSITIGMGLMSIINFATA 604 BAC01259.1 | || :| | |:: : | |:::: | | 618 SRPLRAQLEMNFLPRTVLLASNI---LLTVLGFPPA 650 BAC01259.1 ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ ~~~~~ TIGRFAM WARNING: tigrfam is not part of the automated update hmmpfam - search a single seq against HMM database HMMER 2.1.1 (Dec 1998) Copyright (C) 1992-1998 Washington University School of Medicine HMMER is freely distributed under the GNU General Public License (GPL). - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - HMM file: /data/patterns/tigrfam/tigrfam.hmm Sequence file: BAC01259.1.fa - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Query: BAC01259.1 putative GPAA1 - like protein [Oryza sativa (japonica cultivar-group)]. Scores for sequence family classification (score includes all domains): Model Description Score E-value N -------- ----------- ----- ------- --- TIGR00183 prok_nadp_idh: isocitrate dehydrogenase (NA -0.4 30 1 TIGR00085 secA: preprotein translocase, SecA subunit -2.3 43 1 TIGR00847 ccoS: cbb3-type cytochrome oxidase maturati -17.6 72 1 TIGR00023 TIGR00023: conserved hypothetical protein T -120.1 14 1 TIGR00916 2A0604s01: protein-export membrane protein -136.2 58 1 TIGR00893 2A0114: d-galactonate transporter -237.4 53 1 TIGR00886 2A0108: nitrate transporter -254.5 36 1 TIGR00901 2A0125: AmpG-related permease -263.3 66 1 TIGR00786 dctM: TRAP dicarboxylate transporter- DctM -334.7 76 1 Parsed for domains: Model Domain seq-f seq-t hmm-f hmm-t score E-value -------- ------- ----- ----- ----- ----- ----- ------- TIGR00847 1/1 459 506 .. 1 53 [] -17.6 72 TIGR00085 1/1 560 572 .. 1 13 [. -2.3 43 TIGR00916 1/1 492 609 .. 1 247 [] -136.2 58 TIGR00901 1/1 321 612 .. 1 531 [] -263.3 66 TIGR00886 1/1 337 623 .. 1 472 [] -254.5 36 TIGR00893 1/1 308 677 .. 1 427 [] -237.4 53 TIGR00786 1/1 383 695 .. 1 421 [] -334.7 76 TIGR00183 1/1 631 695 .. 417 483 .] -0.4 30 TIGR00023 1/1 553 718 .] 1 243 [] -120.1 14 Alignments of top-scoring domains: TIGR00847: domain 1 of 1, from 459 to 506: score -17.6, E = 72 *->MeiLtiLiPiSllLGgvGLvAFLWSlkSGQYDDleGaaeRILdkgda ++ +++iP +llL+ + vA ++ S +G +e k+++ BAC01259.1 459 ISVGVYMIPFALLLAPLPIVAAALAGGSK----TKGKLEDEC-KTKG 500 ddepkq<-* + q BAC01259.1 501 NADDLQ 506 TIGR00085: domain 1 of 1, from 560 to 572: score -2.3, E = 43 *->lkaiLkkifGspn<-* ++ iL ++fGsp+ BAC01259.1 560 ILIILYAMFGSPS 572 TIGR00916: domain 1 of 1, from 492 to 609: score -136.2, E = 58 *->LdeeGakiFadfTakniGtqkreslaivldnakvisapvvgeaiqpi L +e + na + + + BAC01259.1 492 LEDE---------------------CKTKGNADDLQMEGGS------ 511 tGGsgqItGnFtieeAqdLAllLRsGaLPapikileertiGPslGaelir ++ BAC01259.1 512 ---------------------------------------------WKWLK 516 aGilAlliaLvlVllYmllrY..ewrgai....AaiallvhDLvililav ++ + l+i + +Vl+ +l +Y ++ ga + + A+i ++++ ++++ + BAC01259.1 517 SARVLLIIQFWAVLVSLLPYYisQIPGAMpiqyAVIWAVLS--ITILIIL 564 lslfgGatltLpgIAGllliIGySVddtVVIFdRiREelrdiKykgrtlr +++fg +p+ AG E BAC01259.1 565 YAMFG-----SPSRAGV-------------------EWKL---------- 580 eainlgfnqtLsriidTnvTTLlAalaLfvfGgGaikkGFAltlliGvia L ++++T++T + ++++ f++ +++ BAC01259.1 581 ----------LKATMITSITIGMGLMSIINFATAQLG------------- 607 gtySsI<-* ++ BAC01259.1 608 ----AL 609 TIGR00901: domain 1 of 1, from 321 to 612: score -263.3, E = 66 *->GlPlmL.vgntL..pvWLrsknVslktIGffSlvglPYs.lKfLWSP G+ + ++++n+L ++ WLr+ ++ + + Sl+ ++K BAC01259.1 321 GFRVNVeTFNSLlsSSWLRV--IAEVFQNLGSLLRKINPdWK----- 360 llDtyylpflGkllGRRrSWlvlTQvlLlllLlilSfldPrllskTDDrs lD + ++ + L++ ++ ++ +P + BAC01259.1 361 -LDVTVPDYVE------------GTANLASSMYNQALGVP--------TG 389 tdLpllaglafLiaFfSATQDIvlDAwrleiLsdeelGygstiyivGYRv ++ A D +DA +le +++ l ++ ++ +R BAC01259.1 390 SH--------------GAFRDYQVDAVSLEFAPAFHLKNENAKSSFLLRG 425 GmLLagslaLv..LasalfaNkylRfiPsneglvtlwgviFvwtalllLp G L g+ v++L +++ + +++++ta p BAC01259.1 426 GRLTEGVVRSVnnLLEKFHQ----------------SFFLYFLTA----P 455 gllvtLFLakEpqedvsvpaktyaiadmnqLLSVLLLLiLLiSLPAmitA + + + ++ +p BAC01259.1 456 SKFISV-------GVYMIP------------------------------- 467 LLDRAWPRAGLYALLmGiCLSPWGRqqiRPVRELLAtVRRPLLVAARGKE BAC01259.1 - -------------------------------------------------- - VPLFDFlknAVSVmVLiiLLVtVtAmCRAYYSGAWPRGtLFkliiknyln BAC01259.1 - -------------------------------------------------- - yfkrlleQavlkplkeFfqrknviNLAYqalllLllivLYKLGDsa.... + ++++ + +a l + + +KL D ++++ BAC01259.1 468 -------------FALLLAPLPIV----AAALAGGSKTKGKLEDECktkg 500 .atvLtt....lFlidGmGfskeeiAlVaKvlglLgailGgliGKrGilm +a+ L+ ++++ + + s+ ++ l++ + ++L ++l+ +i+ ++ BAC01259.1 501 nADDLQMeggsWKWLK----SARVL-LIIQFWAVLVSLLPYYIS---QIP 542 qrlnifyalllfGivqaltnalfvwLasnGhhdaitPSHDViSLVmALek i ya + +v+ +t+++ ++ a +G PS + BAC01259.1 543 GAMPIQYAVIW--AVLSITILI-ILYAMFG-----SPSR---------AG 575 ellmlfltitleavtgGlgtvAfvAFlsklsnpkFgATQyaLLsslSal< + ++l+ + ++t G+g +++ F AT a L +l ++ BAC01259.1 576 VEWKLLKATMITSITIGMGLMSIINF----------AT--AQLGALILI 612 -* BAC01259.1 - - TIGR00886: domain 1 of 1, from 337 to 623: score -254.5, E = 36 *->RnLffSwfgFflsFlvWfafspLavqiikGkddlgLStaQlgnlvav ++ ++++ v ++++L+ i+ +d+ L+ v v BAC01259.1 337 ---WLRVIAE-----VFQNLGSLLRKIN---PDWKLD-------VTV 365 pvlagavlRiilGfLvDkfGPRktttlsllllaIPcllaglavqdpstsy p + ++ + ++ ++ G +P+ g + +y BAC01259.1 366 PDYVEGTANLASSMYNQALG-------------VPTGSHGAFR-----DY 397 svLlllrlfiGiaGgsFascmpwiSfFFPkkiqGtAlGLaAGwGNmGggv +v a+++ ++ +F k++ + L +G + gv BAC01259.1 398 QV--------------DAVSLEFAPAFHLKNENAKSSFLLR-GGRLTEGV 432 aqfvmPAlfaiiaslikgafgGlPqtFdahlawgwafvivpalilllial + v l+ + ++++ +f+ P F +++ +++ p+ +ll+++ BAC01259.1 433 VRSVNN-LLEKFHQSFFLYFLTAPSKF-----ISVGVYMIPFALLLAPLP 476 liffvvadtppgkWSeRhksARGlevttlkddyfllimnlpataklsvks ++++++a + k ++ ++++ +l+ ++ BAC01259.1 477 IVAAALAGGSKTK-----------GKLEDECKTKGNADDLQMEGGSW--- 512 arlskiaflvaalkvvvepllgtealatyniwvlnaiieslslkeqlkvf k + BAC01259.1 513 -----------------------------------------------KWL 515 rdkhTWilallYsvTFGsFlgvssifamvFllfkdqfglskvqAGayasl +++++ + + +v ++s ++ ++ ++g +++ +ya + BAC01259.1 516 KSARVLLIIQFWAV-------LVSLLPY---YISQIPG---AMPIQYAVI 552 ggllGllaRPlGGllSDrlGgrfgtraRkllmsflgvamga...alvvlg + l + ++ +++l ++ g +++a v + BAC01259.1 553 WAVL----------------S-------ITILIILYAMFGSpsrAGVEWK 579 lvgpgssgsLavfivlfvalfffvgaGnGstFalvPhifrktaskvkaeG l + ++ s++++++l++ ++f + + + + l+P++ BAC01259.1 580 LLKATMITSITIGMGLMSIINFATAQLGALI--LIPMCLF---------- 617 gdeeAmrnarratGavsGliGAgGnlGG<-* + l++ BAC01259.1 618 ----------------------SRPLRA 623 TIGR00893: domain 1 of 1, from 308 to 677: score -237.4, E = 53 *->LvtvinYLDRanlSfAaptGlqeDLGlsaaqygyvfsAFsigYvlgq L+ v+ YL ++ ++ + + ++ s+ s+ v+ + BAC01259.1 308 LLNVVHYLA--------VH--RQGFRVNVETFNSLLSS-SWLRVIAE 343 fPggl..lLDRi.GHarktlavaiVlWgvftllqafaggFstvtayvsly + ++l++lL++i++ ++ ++v + g + l+ + + + + l BAC01259.1 344 VFQNLgsLLRKInPDWKLDVTVPDYVEGTANLASS-M-------YNQALG 385 iLRvLlGaAGLEAplfPgiikivasWFPakeRatavsifnsaqylggiig + + Ga+ + + a F k + +s+ + + +l ++ BAC01259.1 386 VPTGSHGAF--RDYQVDAVSLEFAPAFHLKNENAKSSFLLRGGRLTEGVV 433 gPlvgwilvhfssmgmkGWqwvFi.ieGilgiilgvlwl...kfikdkpq + + +l++f+ q+ F+ + +++ +++v ++ ++ + p BAC01259.1 434 RSVNN-LLEKFH-------QSFFLyFLTAPSKFISVGVYmipFALLLAPL 475 kakwlteeekyivvggllaeqddskgkgpsepKkyq..ikelLkdrrvwg +++ ++ +g l++ + k kg+ + + ++ ++ Lk rv++ BAC01259.1 476 -PIVAAALAGGSKTKGKLED--ECKTKGNADDLQMEggSWKWLKSARVLL 522 lalgqflvnillyffltWfPtyLkqerglsileaGflaslPgivgfvGmv + +++ + +ll P y+ q g +++ +++++++ i ++ ++ BAC01259.1 523 IIQFWAVLVSLL-------PYYISQIPGAMPIQYAVIWAVLSITILIILY 565 L.gGilSDlllrrgarFWkslvfArktaiiaglvlsllmfatnslvnGmp + +G S ++++ l+ A+ + i+ + lm ++n ++ + BAC01259.1 566 AmFGSPSRAGVEWK------LLKATMITSIT--IGMGLMSIIN-FAT--A 604 pwaalalvaLaffgaglLgailwaviSdvlpgniagltgGlinslGYnlg ++al l+ f+ ++ + + ++lp+ +++ l + lG BAC01259.1 605 QLGALILIPMCLFSR----PLRAQLEMNFLPRTVLLASNILLTVLG---- 646 givgPiviGyiaattGsfagallvvaalaligalsvLllV<-* + ++ ++++ +Gs++ +++ g + ++ BAC01259.1 647 -FPPAAFLIMKGLSKGSWT--------VDIVGDFWLWMEF 677 TIGR00786: domain 1 of 1, from 383 to 695: score -334.7, E = 76 *->GvPVafaLgisallglmflldrgdvdti.al.AmlqklfggtdsFtL aLg+ +++ +f ++d++++++++A k ++ +sF BAC01259.1 383 ------ALGVPTGSHGAFRDYQVDAVSLeFApAFHLKNENAKSSF-- 421 LAiPFFiLAGnllnrGGlarRLlnfakalVGHlrGGLghvnvvasalFaA ll +G+l++ ++ + l+ + BAC01259.1 422 -----------LLRGGRLTEGVVRSVNNLLEKFH---------------- 444 vSGSsvAtaaAlGslliPamtkaGYPkafaaGviaaSGiIGlLIPPSIvm s ++ +t+ +k +++Gv m BAC01259.1 445 ------------QSFFLYFLTAP--SKFISVGV---------------YM 465 IiYGvasiagvSIakLFiAGilPGlLlalslMltiwfvAkrlgypraead I +++ ++P a++l++ + k + + BAC01259.1 466 IPFALL--------------LAPLPIVAAALAGGSKTKGKLEDECKTKGN 501 aEVRrkaslrq.....rlrafrks....iwaLlLPvliIGGifsGi..FT a ++++++ + l+++r++ ++wa+l v ++ ++s i+++ BAC01259.1 502 A----DDLQMEggswkWLKSARVLliiqFWAVL--VSLLPYYISQIpgAM 545 PTEAAavaavYAlilstfvyreL...tlkslcdvlleslrttsm.....V P + A++ av+++ + ++y +++++++ + + ll ++++ts++ + + BAC01259.1 546 PIQYAVIWAVLSITILIILYAMFgspSRAGVEWKLLKATMITSItigmgL 595 llIi..agatvfswlltheqiPqrladllaaaisspivfLiiinilLliv + Ii+ a+a++ ++ l+ +r + + + p+++L+++nilL + BAC01259.1 596 MSIInfATAQLGALILIPMCLFSRPLRAQLEMNFLPRTVLLASNILLTVL 645 GmfmDltpaiLiltPillPvAehlGIDPVhFGvvfvlNleiGllTPPvGt G+++ a Li+ BAC01259.1 646 GFPPA---AFLIM------------------------------------- 655 nLFvasgvakmsltTGevtrallPFLlaqflvLl...lvtyfPalslflP ++l+ G t+ ++ + + + +L + ++ ty + +lP BAC01259.1 656 ----------KGLSKGSWTVDIVGDFWLWMEFLWewsSATYLYVFLVHLP 695 <-* BAC01259.1 - - TIGR00183: domain 1 of 1, from 631 to 695: score -0.4, E = 30 *->PgsvilsgelllehlGWkeaadlikkglekaiaskvvtydfarlmdg P +v+l++ +ll lG+ aa li+kgl k + + df m+ BAC01259.1 631 PRTVLLASNILLTVLGFPPAAFLIMKGLSKGSWTVDIVGDFWLWMEF 677 kvakelkcsefgealvenld<-* e + + + + +l BAC01259.1 678 --LWEWSSATYLYVFLVHLP 695 TIGR00023: domain 1 of 1, from 553 to 718: score -120.1, E = 14 *->islLivfllliaYLi.GSIpsaylvgKilkGiDiRehGSgNpGATNv + L + l+i Y + GS + a + K+lk + + S +G BAC01259.1 553 WAVLSITILIILYAMfGSPSRAGVEWKLLK---ATMITSITIG---- 592 lRvlqskGvsnakkaAllVlifDilKGmlAvalsfllglfdllqgLvaav G+ + ++ + + +l ++ + l++ +l L BAC01259.1 593 ---M---GL--MSIINFATAQLGAL-----ILIPMCLFSRPLRAQLEM-- 627 yqkvYyLtylsciAAvLGHifPiFfkFkGGKgVATsgGslllislwlfli + F+ +ll+ +++l++ BAC01259.1 628 -N---------------------FLPRT----------VLLASNILLTVL 645 ml..avWllvtlltkyvSLsSivGvGtalvlafyvlwlklpylyfFksdP ++++a +l+ l+k G+ +v ++ +wl+ ++l BAC01259.1 646 GFppAAFLIMKGLSK----------GSWTVDIVGDFWLWMEFL------- 678 lkaIlyqngwyipvtlllwYWPLTilviyrHraNIqRLLr...gtEpKvt +y+ v+l+ L + ++ I+ LL++ E+K + BAC01259.1 679 ---WEWSSATYLYVFLVH----LPCWLLC-----IHVLLHpcyQPESKMK 716 qk<-* q BAC01259.1 717 QE 718 // hmmpfam - search a single seq against HMM database HMMER 2.1.1 (Dec 1998) Copyright (C) 1992-1998 Washington University School of Medicine HMMER is freely distributed under the GNU General Public License (GPL). - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - HMM file: /data/patterns/tigrfam/tigrfam.hmm-f Sequence file: BAC01259.1.fa - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Query: BAC01259.1 putative GPAA1 - like protein [Oryza sativa (japonica cultivar-group)]. Scores for sequence family classification (score includes all domains): Model Description Score E-value N -------- ----------- ----- ------- --- TIGR00899 2A0120: Sugar Efflux Transporter 4.3 1.7 1 TIGR01103 fliP_bact: flagellar biosynthetic protein F 0.9 44 1 TIGR00710 efflux_Bcr_CflA: drug resistance transporte -0.1 66 1 TIGR00183 prok_nadp_idh: isocitrate dehydrogenase (NA -0.4 35 1 TIGR00346 azlC: branched-chain amino acid transporter -0.6 79 1 TIGR00400 mgtE: Mg2+ transporter (mgtE) -0.6 39 1 TIGR00123 cbiM: cobalamin biosynthesis protein CbiM -0.7 88 1 TIGR00085 secA: preprotein translocase, SecA subunit -2.3 39 1 TIGR00728 OPT_sfam: oligopeptide transporters, OPT su -2.5 85 1 Parsed for domains: Model Domain seq-f seq-t hmm-f hmm-t score E-value -------- ------- ----- ----- ----- ----- ----- ------- TIGR00899 1/1 30 59 .. 161 190 .. 4.3 1.7 TIGR00710 1/1 520 539 .. 381 400 .] -0.1 66 TIGR00123 1/1 538 562 .. 1 25 [. -0.7 88 TIGR00085 1/1 560 572 .. 1 13 [. -2.3 39 TIGR00728 1/1 571 606 .. 750 787 .] -2.5 85 TIGR00400 1/1 580 609 .. 431 460 .] -0.6 39 TIGR00346 1/1 577 626 .. 173 222 .] -0.6 79 TIGR01103 1/1 641 655 .. 1 15 [. 0.9 44 TIGR00183 1/1 631 695 .. 417 483 .] -0.4 35 Alignments of top-scoring domains: TIGR00899: domain 1 of 1, from 30 to 59: score 4.3, E = 1.7 *->aAvafvlcgvLvwllLPsvprgapgatttl<-* ++ + ++g+++ l+LPs++++ +++++l BAC01259.1 30 FSAVCCTAGIIALLFLPSLAKNTYLSENAL 59 TIGR00710: domain 1 of 1, from 520 to 539: score -0.1, E = 66 *->mvlsclvlavisvLafyyls<-* ++l++ + av ++L+ yy+s BAC01259.1 520 VLLIIQFWAVLVSLLPYYIS 539 TIGR00123: domain 1 of 1, from 538 to 562: score -0.7, E = 88 *->lhimeGflPpeWcllWwllslpvlv<-* + + G +P +++++W +ls+ +l+ BAC01259.1 538 ISQIPGAMPIQYAVIWAVLSITILI 562 TIGR00085: domain 1 of 1, from 560 to 572: score -2.3, E = 39 *->lkaiLkkifGspn<-* ++ iL ++fGsp+ BAC01259.1 560 ILIILYAMFGSPS 572 TIGR00728: domain 1 of 1, from 571 to 606: score -2.5, E = 85 *->nkrawwkwekynyvlaagldaGealagvliaaflclgl<-* + ra +w+ + + + + +G +l+++++ f ++l BAC01259.1 571 PSRAGVEWKLLKATMITSITIGMGLMSIIN--FATAQL 606 TIGR00400: domain 1 of 1, from 580 to 609: score -0.6, E = 39 *->llsgPlittladvlglliyfgiakwllgsl<-* ll + +it++ +gl+ ++ a++ lg+l BAC01259.1 580 LLKATMITSITIGMGLMSIINFATAQLGAL 609 TIGR00346: domain 1 of 1, from 577 to 626: score -0.6, E = 79 *->qwlkgknhksallGlglavvclllfGgeyfllpallgillvltllrk +w+ k + + +g+ +++++ f + ++ l+ + l lr BAC01259.1 577 EWKLLKATMITSITIGMGLMSIINFATAQLGALILIPMCLFSRPLRA 623 ple<-* +le BAC01259.1 624 QLE 626 TIGR01103: domain 1 of 1, from 641 to 655: score 0.9, E = 44 *->LLTvLSlaPaILilM<-* LLTvL + Pa+ + M BAC01259.1 641 LLTVLGFPPAAFLIM 655 TIGR00183: domain 1 of 1, from 631 to 695: score -0.4, E = 35 *->PgsvilsgelllehlGWkeaadlikkglekaiaskvvtydfarlmdg P +v+l++ +ll lG+ aa li+kgl k + + df m+ BAC01259.1 631 PRTVLLASNILLTVLGFPPAAFLIMKGLSKGSWTVDIVGDFWLWMEF 677 kvakelkcsefgealvenld<-* e + + + + +l BAC01259.1 678 --LWEWSSATYLYVFLVHLP 695 // SMART WARNING: smart is not part of the automated update hmmpfam - search a single seq against HMM database HMMER 2.1.1 (Dec 1998) Copyright (C) 1992-1998 Washington University School of Medicine HMMER is freely distributed under the GNU General Public License (GPL). - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - HMM file: /data/patterns/iprscan/data/smart.HMMs Sequence file: BAC01259.1.fa - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Query: BAC01259.1 putative GPAA1 - like protein [Oryza sativa (japonica cultivar-group)]. Scores for sequence family classification (score includes all domains): Model Description Score E-value N -------- ----------- ----- ------- --- [no hits above thresholds] Parsed for domains: Model Domain seq-f seq-t hmm-f hmm-t score E-value -------- ------- ----- ----- ----- ----- ----- ------- [no hits above thresholds] Alignments of top-scoring domains: [no hits above thresholds] // COG WARNING: cogs is not part of the automated update hmmpfam - search a single seq against HMM database HMMER 2.1.1 (Dec 1998) Copyright (C) 1992-1998 Washington University School of Medicine HMMER is freely distributed under the GNU General Public License (GPL). - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - HMM file: /data/patterns/cogs/cogs.hmm Sequence file: BAC01259.1.fa - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Query: BAC01259.1 putative GPAA1 - like protein [Oryza sativa (japonica cultivar-group)]. Scores for sequence family classification (score includes all domains): Model Description Score E-value N -------- ----------- ----- ------- --- COG3094 -67.5 42 1 COG0239 -76.9 85 1 COG0842 -97.3 60 1 COG2245 -101.9 52 1 COG1280 -110.4 59 1 COG0624 -122.9 87 1 COG1808 -124.9 16 1 COG0670 -132.8 84 1 COG2386 -136.4 38 1 COG3366 -176.7 70 1 COG0628 -189.6 91 1 COG1270 -194.5 27 1 COG1175 -198.4 48 1 COG1177 -199.9 42 1 COG0395 -206.0 64 1 COG2244 -211.8 57 1 COG0609 -226.7 62 1 COG0534 -253.0 24 1 COG1007 -329.6 32 1 COG1178 -338.6 91 1 COG0201 -339.7 84 1 COG2233 -353.2 40 1 COG1593 -377.6 29 1 COG1955 -389.1 60 1 COG2056 -418.3 57 1 COG1113 -423.7 43 1 COG1115 -431.2 57 1 COG0370 -532.2 97 1 Parsed for domains: Model Domain seq-f seq-t hmm-f hmm-t score E-value -------- ------- ----- ----- ----- ----- ----- ------- COG0624 1/1 71 439 .. 1 433 [] -122.9 87 COG0201 1/1 335 597 .. 1 540 [] -339.7 84 COG1280 1/1 441 606 .. 1 214 [] -110.4 59 COG0370 1/1 254 612 .. 1 777 [] -532.2 97 COG0628 1/1 257 623 .. 1 381 [] -189.6 91 COG2386 1/1 400 626 .. 1 238 [] -136.4 38 COG1177 1/1 454 627 .. 1 286 [] -199.9 42 COG1115 1/1 445 631 .. 1 507 [] -431.2 57 COG2233 1/1 265 633 .. 1 509 [] -353.2 40 COG3366 1/1 368 646 .. 1 327 [] -176.7 70 COG3094 1/1 524 651 .. 1 139 [] -67.5 42 COG1593 1/1 119 651 .. 1 638 [] -377.6 29 COG2056 1/1 322 653 .. 1 463 [] -418.3 57 COG0609 1/1 303 661 .. 1 346 [] -226.7 62 COG0842 1/1 303 662 .. 1 365 [] -97.3 60 COG1270 1/1 451 662 .. 1 342 [] -194.5 27 COG0395 1/1 449 662 .. 1 304 [] -206.0 64 COG0239 1/1 554 663 .. 1 134 [] -76.9 85 COG1808 1/1 280 664 .. 1 402 [] -124.9 16 COG1178 1/1 168 665 .. 1 615 [] -338.6 91 COG1175 1/1 453 665 .. 1 314 [] -198.4 48 COG1113 1/1 379 677 .. 1 481 [] -423.7 43 COG0670 1/1 497 685 .. 1 262 [] -132.8 84 COG2245 1/1 498 687 .. 1 197 [] -101.9 52 COG0534 1/1 326 693 .. 1 482 [] -253.0 24 COG2244 1/1 219 694 .. 1 510 [] -211.8 57 COG1955 1/1 294 713 .. 1 612 [] -389.1 60 COG1007 1/1 296 716 .. 1 513 [] -329.6 32 Alignments of top-scoring domains: COG0624: domain 1 of 1, from 71 to 439: score -122.9, E = 87 *->eevlelLkeLisipsvsgpkeagdpevleaaeylkelledlGfevei e+v e + i++ g++ +g + e+ ++++ + + lG ev+ BAC01259.1 71 EDVQEANRFAKGIEAAIGESRGG---TTEIPKFIAQQTKNLGAEVYY 114 devgngk.................................nlvalrggeg +e+ + + ++ + ++ +++ +++++ ++ + ++ + + a rg+ BAC01259.1 115 HEFLPDSkcfhplkfftsmtnnmaakpngtytnfgintvgIIRAPRGD-- 162 pgpptslllagHiDVVPpggeggelWstdPFeltederdgGrlYGRGaaD g++ ++l V P ++ +t+ l + G+ BAC01259.1 163 -GKEA-IVL-----VTPYN----SQKVTPNELLS--LAL-GF-------- 190 mKGglaamllAleallaggpelpgnvillfvsDEEggsalrldalGarhl +++++l + +l + i+ + +D g +++++ ++ ++ BAC01259.1 191 ---SVFSLLSRAAWLSK--------DIVWLSADSQFGEYSAVSSWLNQYH 229 aerladrflrpdyvivgEpptglqggrgiakqievpdvdlhkGrrlngdr + + + d i g +l++++g +k++ BAC01259.1 230 NPMFLSHPVNLDTKIYGA-NQILYKPDGT----------AEKAE------ 262 gilv.........klevkGk...............qgHvstpe.lgrNai +++++ ++ ++ G++++ +++++ + +++++++p+ N + BAC01259.1 263 LMAFkragtmaaaLIFKVGEtrkygdrdsvtmyaeASNGQMPNlDLLNVV 312 ekaaealgelaelladagsvaneilrpgpgaegfdsllripspltvtvig +++a +++ +++ + + t+++ BAC01259.1 313 HYLAVHRQ-----GFRVN--------------------------VETFNS 331 iggG.avNviPnycegeaevkfdiRllpgedleevleelrall.....ev + ++++ vi+ ev + l ll++ ++++ BAC01259.1 332 LLSSsWLRVIA----------------------EVFQNLGSLLrkinpDW 359 elevl.sgapfetdpddelvvalaralaellGlepkvvtsgggtDarffa l v+ + ++++++ + + + ++lG +++ + f+ BAC01259.1 360 KLDVTvP--DYVEGTAN----LASSMYNQALGVPTGSHG--------AFR 395 rlgippavvfggggdpddgdlaHspnEyveledlekgvkvlarllerlae ++ +v ++ + H +nE+ + + l +g ++ + ++ + BAC01259.1 396 DY----QVDAVSLE---FAPAFHLKNENAKSSFLLRGGRLTEGVVRSVNN 438 q<-* BAC01259.1 439 L 439 COG0201: domain 1 of 1, from 335 to 597: score -339.7, E = 84 *->mgvkdklkpmferlpavfrpkkgghveLrrKllfTllaLilYRiGsf +++++++ +f+ l + +r + BAC01259.1 335 SSWLRVIAEVFQNLGSLLRKIN------------------------- 356 IPvPGinaaalsdffeqqrgifglfNmFsGGalsrfSifaLGImPYITAS P + ++ + d++e + ++++ + BAC01259.1 357 -PDWKLDVTV-PDYVEGTAN-------LASSMYN---------------- 381 IImQLLvgDviPsLikLdkenGeeGRrKiqqyTRylTivlalvQAlgval Q L + G G + y+ v a+ ++ a+ BAC01259.1 382 ---QALGV----------PT-GSHG--AFRDYQ-----VDAVSLEFAPAF 410 glnnlvgsgpIssavaigpgpsggvvpvlfylliilqLtaGtmllmWLGE l+n ++++s f l G BAC01259.1 411 HLKN------------ENAKSS-------FLLR---------------GG 426 qItkGrGIGNGISLiIfAGIvaglPsaifntielnngtesgpaGalprfi + t+ g+++++ n +e+ + + BAC01259.1 427 RLTE------------------GVVRSVNNLLEK-----------FHQ-- 445 qgivqlspnkdllggsvvaFllnilslllivlatlaiivgVVYveqarRr + +l +l+ BAC01259.1 446 SF--------FL----------YFLT------------------------ 453 IPIqYArRqqgvvGRrlyggGqstyLPiKlNyAGVIPvIFAiLlSalllf + K++ +GV + FA Ll l + BAC01259.1 454 -----------------APS--------KFISVGVYMIPFALLLAPLPIV 478 PstlaqffgsesPLargipiLGtyngpavglg.............gllyy + la+ + + g+ + ++++++ ++ + ++g ++ BAC01259.1 479 AAALAGGSKT--------------KGKLEDECktkgnaddlqmegGSWKW 514 iapylslsawildpldpwqqpgqpvYlalyvvlIifFtyFYteiqGfnPr + +++ v+lIi F ++ ++ + P BAC01259.1 515 LKSAR-------------------------VLLIIQF---WAVLVSLLPY 536 eiAenLkKsGgfIPGiRpdGkqTekyLervipRlTliGalfLgviAvlPe i+ +IPG p i ++ ++ l ++++ +l BAC01259.1 537 YIS--------QIPGAMP------------IQYAVIWAVLSITILIILYA 566 llgalggvpnnvsfyfGGTslLIvVGValdtmeqIeaellsrqYeglrrl ++g+ +++ ++ + a +++ + BAC01259.1 567 MFGSPSRAG--------------------VEWKLLKATMIT---SIT--I 591 kkgglk<-* + g ++ BAC01259.1 592 GMGLMS 597 COG1280: domain 1 of 1, from 441 to 606: score -110.4, E = 59 *->mmdmsfllafligslalslalsPGPdnllvlsnslkyGfraGllaaL + + ++++ ++++ a s + s ++ ++ BAC01259.1 441 EKFHQSFFLYFLT--APSKFI----------SVGVY-------MIPF 468 GlalGdavhvlLaalGla....aLlktspllFtlLkllGAaYLlYLGvqm l l+ ++ v++a++G +++++ L ++ BAC01259.1 469 ALLLAPLPIVAAALAGGSktkgKLEDEC---------------------- 496 lrskgkkleasesaasplsrs.wklflrGlltnllNPKaiLFflsllpqF + kg + ++++ + s+ +k+ + l+++ ++ +sllp + BAC01259.1 497 -KTKGNADDLQM---EGGSWKwLKSARVLLIIQ-----FWAVLVSLLPYY 537 idpqaslaslaqllvlgaiivlvdllwfsllAllgsrlarllrsnprfqr i+ ++ + q++v++a++ + +l+ + ++ + sr+++ ++ ++ BAC01259.1 538 ISQIPGAM-PIQYAVIWAVLSITILIILYAMFGSPSRAGVEWKL-LKATM 585 vlnrlaGllLigfGvklllsrl<-* + + G L+++ + +++++l BAC01259.1 586 ITSITIGMGLMSI-INFATAQL 606 COG0370: domain 1 of 1, from 254 to 612: score -532.2, E = 97 *->pvedleRrSvksmkkivekrVALvGNPNVGKTtLFNaLTGanqkVGN p+ +e +++ k BAC01259.1 254 PDGTAEKAELMAFKRAG------------------------------ 270 WPGVTVEKKEGklrykGreieivDLPGiYSLtpvSdqnSlDEkIARdfLl + + l +k + + R++ BAC01259.1 271 -------TMAAALIFK---------------VGET----------RKY-- 286 FYPneapDlivnVVDAtNLERNLyLTLQLlElg.kpmIlaLNliD..eAk ++ D + A+N g+ p + LN + + BAC01259.1 287 ----GDRDSVTMYAEASN--------------GqMPNLDLLNVVHylAVH 318 keGIrIDekaLeerLGVPVVpTvAkrGeGleeLkekIvelvekkptprri ++G r++++ + +L + + A +l+ L++kI +p + + BAC01259.1 319 RQGFRVNVETFNSLLSSSWLRVIAEVFQNLGSLLRKI------NPDWKLD 362 iprygEeieseikeleklleeqeeakkypaRwlAIkLLegDkevtelvik ++++ e+++ +++ BAC01259.1 363 VT------------VPDYVEGTANLAS----------------------- 377 savvgelkevlkelqeleekygedadllIadeRYelieeIleevvrtqge + BAC01259.1 378 ---------------------S---------------------------- 378 asrtlteklDrvlLHpvlGlPiflavMyLlFqltFsvGaplqdlldggsg + + lG+P t+s Ga+ +d++s BAC01259.1 379 ------------MYNQALGVP------------TGSHGAFRDYQVDAVSL 404 al...fsalgewvgsnlgYapewlrsl.laDGIiGGIVGaVLsFvPlIai ++ + f +e +s++ lr+ +l +G++ V+ +L + + ++ BAC01259.1 405 EFapaFHLKNENAKSSF-----LLRGGrLTEGVVRS-VNNLLEKFHQSFF 448 LFlflSfLEdSGYlaRaAFlmDriMrkfGLpGKaFIPLilGFGCNVPAIM L++ BAC01259.1 449 LYF----------------------------------------------- 451 ATRtLedeRERL.lTilviPFmSCSARLpVYvlFagaFFPsnsgalVlfg L ++ +++ +iPF +l+a+ P BAC01259.1 452 ----LTAPSKFIsVGVYMIPFA---------LLLAPL--P---------- 476 lYlLGivvALltAllLrktv.....yrGetspFimELPpYrlPslktvli +++AL + + +++++ + +G ++ + mE BAC01259.1 477 ----IVAAALAGGSKTKGKLedeckTKGNADDLQME-------------G 509 htWeRlksFlkkAGtiIlagSiLIWfLssfPpsdaaEgGsvgegpknikd +W lks ++ iI + ++L++ L ++ BAC01259.1 510 GSWKWLKS--ARVLLIIQFWAVLVSLLPYY-------------------- 537 SflgkiGkvlePiFaPlGfgqddWqatvsLitGfvAKEvVVsTLgvlYgl i+++ P + P+ + +v+ BAC01259.1 538 -----ISQI--PGAMPIQY-------------------------AVIW-- 553 eqdqeEafsgalselenalsaipetwagLlltasksaiasaltpseadge ++l++ BAC01259.1 554 ---------------------------------------AVLSI------ 558 meegeksqlgallskfftpasAlaflvFvLLYtPCiATlaaIarEsGs.w ++ ++LY + + + + E + BAC01259.1 559 -------------------------TILIILYAM-FGSPSRAGVEWKLlK 582 kwaafsvlyslvflAyvlalIayqvasiAeHPeySliwIvlLLf<-* + + s++ +++ l ++ + + q++ +l+L+ BAC01259.1 583 ATMITSITIGMG-LMSIINFATAQLG-------------ALILI 612 COG0628: domain 1 of 1, from 257 to 623: score -189.6, E = 91 *->mlmspairrllkpvvlrllvlllillllll..........flylfqp + + +++ ++ +++l++ ++++++ ++++ + ++ BAC01259.1 257 TAEKAELMAFKR------AGTMAAALIFKVgetrkygdrdSVTMYAE 297 lllplllAlvlayllnPvvrwLerklgipRplavllvllllllllallll + l l + +++ + ++++ + ++ +ll l BAC01259.1 298 ASNGQMPNLDLLNVVHYLAVHRQG-FRVN--------VETFNSLLSSSWL 338 llvptlvgidqlgqlir.........nnlPqlnnllqqllawlpnig... ++ + ++ lg+l+r+ +++ + + ++P++ ++ +l + + n+ + BAC01259.1 339 RVIAEV--FQNLGSLLRkinpdwkldVTVPDYVEGTANLASSMYNQAlgv 386 ..............llqlslyasldeliqqlisnrlaailgsilssllnl ++++++ ++ + +++ l+++++++ +++ s+ ++ +g ++ ++ ++ BAC01259.1 387 ptgshgafrdyqvdAVSLEFAPAFHLKNENAKSS-FLLRGGRLTEGVVRS 435 lgrqvknllglivslllvllllf..ffLldgeklkegilsllPsrlyrpr + nll + + +++++l+ ++ f g + ++l l P + BAC01259.1 436 VN----NLLEKFHQSFFLYFLTApsKFISVGVYMIPFALLLAP-LPIVAA 480 vkrilselnaslgnyirGqvlvaliiGvlsgigllllgvpyalllallag + + s+ + l++ +++ a + +g +l + l++ ++a BAC01259.1 481 ALAGGSKTKGKLEDECKTK-GNADDLQMEGGSWKWLKSARVLLIIQFWAV 529 llnlIP.yiGaviiwiPaliyalltggglatwggllvlivflviqqledn l +l+P yi + +++P+ ++ ++ +l+++ + +++ BAC01259.1 530 LVSLLPyYISQIPGAMPIQYAVIWAV----LSITILIILYAMFGS----- 570 vLrPklmgkrlg....LhPllillsllgGgslfGfvGlilgpPllavlka r+g + +L ++ + s+ +G++l+ ++ ++ a BAC01259.1 571 -------PSRAGvewkLLKATMITSITIGMGLMSII---------NFATA 604 lldaylrgdiaelifkklaeqlgeees<-* l a++ ++ + ++ + BAC01259.1 605 QLGALIL--------IPMCLFSRPLRA 623 COG2386: domain 1 of 1, from 400 to 626: score -136.4, E = 38 *->mDssFtmevAlkellrlmmafleiikrdLRL...efRrkaeisn... + +e A+ l+ +a +++++r RL+++ R+ +++ + + BAC01259.1 400 --DAVSLEFAPAFHLKNENAKSSFLLRGGRLtegVVRSVNNLLEkfh 444 ..pLlFfllV........itLFpLaiGPePqlLadiAPGilWvaaLLasL ++ l fl+++++ + ++ + p+a+ ++AP + vaa+La+ BAC01259.1 445 qsFFLYFLTApskfisvgVYMIPFAL--------LLAP-LPIVAAALAGG 485 LslerlFrdDfed.GslEqllLsPlplawlvlaKVvahwllmllplilis + d+ +G l wl a+V +l +l s BAC01259.1 486 SKTKGKLEDECKTkGNADDLQMEGGSWKWLKSARV---LLIIQFWAVLVS 532 PLlAilLdlevns.lpalaLtLlLGTpvLSllgA.vgsALTVgLrrgGvL L + + +++ +++ + ++L L l A +gs+ r+Gv BAC01259.1 533 LLPYYISQIPGAMpIQYAVIWAVLSITILIILYAmFGSP-----SRAGVE 577 LplLvlPliIPVLIfavgaidaAlqGlpsdgyLlilgAllalavtLspfA lL i ++++ ++ +++ + lgAl + + L BAC01259.1 578 WKLLKATMI-------TSITIGMGLMSIINFATAQLGALILIPMCL---- 616 iaAalriSvs<-* ++ lr+ ++ BAC01259.1 617 FSRPLRAQLE 626 COG1177: domain 1 of 1, from 454 to 627: score -199.9, E = 42 *->mssarsrwrlsflraflwlvlaFLYLPllilvlySFNtssklgtvwq +s ++s +++++ + +a L +Pl i+++ +++ sk+ + BAC01259.1 454 APS----KFIS-VGVYM-IPFALLLAPLPIVAA-ALAGGSKTKGKLE 493 GFT..npN.LkWYaeLfqde.dllsAaaNSLlIAvlsalvatvlGtLAAf + ++ N ++ + + + ++l +a L+I + ++lv+ BAC01259.1 494 DECktKGNaDD--LQMEGGSwKWLKSARVLLIIQFWAVLVSL-------- 533 ALwRyrfrgknlvsgllllPlvvPdIvtGvsLLllFatwtaiglqLGWpq L +y ++++P ++P + +a +i L BAC01259.1 534 -LPYY----------ISQIPGAMP---------IQYA---VIWAVLS--- 557 drgfftiilaHitFclPfVyvvisaRLqgldlsLieAAaDLGAspwqtFf +i+ ++ F+ P +R +g+ + L+ A + BAC01259.1 558 ---ITILIILYAMFGSP-------SR-AGVEWKLLKATM----------- 585 kitLPllaPgIlsGaLLAFtLSlDDFViTsFvsGVpTiGsetLPlqifsm iTs++ G BAC01259.1 586 ---------------------------ITSITIG---------------- 592 irrGvvsPeiNAlatllllvlslllviasyllgvkrlekrrer<-* +G+ iN + + l + +++ + + s+ l+ + e BAC01259.1 593 --MGL-MSIINFATAQLGALILIPMCLFSR-----PLRAQLEM 627 COG1115: domain 1 of 1, from 445 to 631: score -431.2, E = 57 *->meallsfvetlndflW.gpplivLLlGtGlyFTirlrFvQFrrLgem ++++l+f+++ + f++ g +i+ l + L+ + BAC01259.1 445 QSFFLYFLTAPSKFISvGVYMIPFALLL-------------APLPIV 478 fkllfggrksksgdseeaskggvSsFQALmtsLAarVGtGNIAGVAtAIa l gg k k g+ e+ BAC01259.1 479 AAALAGGSKTK-GKLED--------------------------------- 494 lGGPGAvFWMWvtALfGMAtkFaEstLAqkYRvk.DkddGefrGGGPaYY E+ ++k+++dd ++ GG BAC01259.1 495 -----------------------EC------KTKgNADDLQMEGG----- 510 iekGLgkGGrlkaglmRwLgvlFAifliiAfggigNmVQsNsIadalena ++wL + +++i Q+ ++ +l + BAC01259.1 511 -----S---------WKWLKSARVLLII----------QFWAVLVSLLPY 536 FGgppagvplsSakwvtGivLavLtglvifGGikRIakvasliVPfMAll + BAC01259.1 537 Y------------------------------------------------- 537 YillalvIllmNidqlPaaislIfssAFgpqaAaGGfaGatvaqAIrqGv i+q+P+a + I++ BAC01259.1 538 ------------ISQIPGA------------------------MPIQY-- 549 kRGlFSNEAGmGSAPiAAAaAktdPPHPVrQGlVqmlGVFIDTlviCTaT + ++ l ++T BAC01259.1 550 ---------------------AVI---------WAVL----------SIT 559 AfiILltGaynggeaaeGlytvsglsGgaaLTqaAfsshlGswGayfVai +iIL ++++++ +G BAC01259.1 560 ILIILYAMFGSPS-----------RAG----------------------- 575 alflFAFsTIlGwyYYGEknieFLfgnkavkpkalllyRlvvlaaVviGa + +k l++++ + ++ iG BAC01259.1 576 ------------------------VEWK--------LLKATMITSITIG- 592 vasldlVWnlADlfmglMAipNLIALlLLskvviallkDYfaqrkaGskd mglM i+N A + L+ ++ ++ + + ++a BAC01259.1 593 --------------MGLMSIINF-ATAQLGALILIPMCLFSRPLRAQ--- 624 PvFdadqlpglk<-* + ++l+ BAC01259.1 625 -----LEMNFLP 631 COG2233: domain 1 of 1, from 265 to 633: score -353.2, E = 40 *->m.llatalrsvllsmsdsampstmtksdlvygvddrppllkllllGL + + a ++ +l ++ ++ + ++d v + ++ + BAC01259.1 265 AfKRAGTMAAALIFKVGETRK--YGDRDSVTMYAEASN--------- 300 QHllAMFgatVlVPLivGlalgLaaedlayLIsmsLfasGiaTLlQtlit G+ ++L+ L+ + l + BAC01259.1 301 -----------------GQMPNLD-----------LLN-----VVHYLAV 317 gRpfGiglPi.....vLGsSFAFvgPmIaaGglgkegGadieaamgGifg + + G ++ +++ ++ L+sS v + + ++ G BAC01259.1 318 H-RQGFRVNVetfnsLLSSSWLRVIA--EV---FQNLG------------ 349 aglvygviglLisrfgtgdrLkrlfPPvVtGpVImvIGLsLapVAikmag L++++ + +L++ P V G+ +La+ ++a+ BAC01259.1 350 ---------SLLRKINPDWKLDVTVPDYVEGT------ANLASSMYNQAL 384 GgeaamasgnpdfgslenLlLalvVLliilllnrfgkGflrlipILiGlv G ++s+ g+ + + +v L ++ + + ++ l BAC01259.1 385 GV--PTGSH----GAFRDYQVDAVSLEFAPAFHLKNENAKSSFL----LR 424 vGYvlAlfmGlvdfdsalveaapwfalPtpfyfGaPYtpaFnwgaIltml +G + + v+ ++ ++f ++++ p + ++m+ BAC01259.1 425 GGRLTEGVVRSVNN-LLEKFHQ------SFFLYFLT-APSKFISVGVYMI 466 pvalvti.vEhvGditAtgkvtgkpllgPeYkpgLhrGllaDGlatllAG p al+++++ v+ +A g t+ +l+++ +g l +G + BAC01259.1 467 PFALLLApLPIVAAALAGGSKTKGKLEDECKTKGNADDLQMEGGSWKW-- 514 lfGgfPnTTFaqNiGvvalTgVaSryVivwaAvililLGlfPKfaallqs +S +V ++ +++l +l P ++ BAC01259.1 515 ---------------------LKSARVLLIIQFWAVLVSLLPY---YISQ 540 IPsPVLGGamivlFGmI.AasGiriLi..........rakvdlsknRNLl IP Gam + + +I A++ i iLi ++++ra v+ +++ BAC01259.1 541 IP-----GAMPIQYAVIwAVLSITILIilyamfgspsRAGVEWKLLKATM 585 IvAvsLglGlGgaavPpeflaglPavlrplllsGialgaitAIvLNllLp I+++++g+Gl + + +a ++ + A +L +p BAC01259.1 586 ITSITIGMGLMSI-I-N--FA---------------TAQLGALIL---IP 613 grrrnpiveerreeksvtaeaaeealkkea<-* + +r+ + ++ e++ +++ BAC01259.1 614 ------MCLFSRP----LRAQLEMNFLPRT 633 COG3366: domain 1 of 1, from 368 to 646: score -176.7, E = 70 *->vvdyidkvikammltlaylikilpivliGifiasiiietNilkKlkk v+ + +m+ + + + t+ + BAC01259.1 368 YVEGTANLASSMYNQA------------------LGVPTGSHGAFRD 396 ilkpilrklnlpEecvisiatcFvsPTvGysMLkef.yKegkvnerEviv + + + E++ + + + ++++L ++++ eg+v + + BAC01259.1 397 YQVDAVSL----EFAPAFHLKN--ENAKSSFLLRGGrLTEGVVRSVNNLL 440 aslinsFpsvlshSvftfylPvlvpiLGyflGviYVl..irvlVgliktl + sF+ + +P + G +Y ++ +++l l + + BAC01259.1 441 EKFHQSFFLYFLT------APSKFISVG-----VYMIpfALLLAPLPIVA 479 IGvLylkivlknrdieidsDAGVTmPnsdlkiksnrevvikAfkkTikil + + +++++ + e + + ++ + + ++ k +k + BAC01259.1 480 AALAGGSKTKGKLEDECKT-------KGNADDLQME----GGSWKWLKSA 518 krvipsivivvllvvFLielGlfDyveef..akpltnvLnLsgeavtvai rv+ +i + lv+ L y+ +++a p+ ++ + +t++i BAC01259.1 519 -RVLLIIQFWAVLVSLLPY-----YISQIpgAMPIQYAVIWAVLSITILI 562 TelanisaA..ivtaagfldeGILsekevligLllGniiSfstrylKhSi a + + ++++ +l++ +++ ig+ l +ii f+t l +i BAC01259.1 563 ILYAMFGSPsrAGVEWKLLKATMITSIT--IGMGLMSIINFATAQLGALI 610 PlyiSLFGakfGlKlvmvNiavtvlldilfIalLLll<-* LF + + l m N + + +l + I+l+ l BAC01259.1 611 LIPMCLFSRPLRAQLEM-NFLPRTVLLASNILLTVLG 646 COG3094: domain 1 of 1, from 524 to 651: score -67.5, E = 42 *->mNFDRTFLTflsy.......liLkhlHlifialSvllLvIRfvLllk + F + +l y ++ ++ + i + lS+ +L I +++ BAC01259.1 524 IQFWAVLVSLLPYyisqipgAMPIQYAVIWAVLSITILIILYAMFGS 570 .nkekrlakflKIlPHlnDTLL..LlSGivLmyithFlPFsaiApWLteK + + + k+lK+ T+ +++ G +Lm i F t+ BAC01259.1 571 pSRAGVEWKLLKA------TMItsITIGMGLMSIINFA---------TAQ 605 fllllaYIvLGfialkkrrsHskqkrsiaFlLAlvvlaiivklAvtKvpl + l++ + ++ + Fl v la ++l v+ p+ BAC01259.1 606 LGALIL---IPMCLFSRPLR---AQLEMNFLPRTVLLASNILLTVLGFPP 649 Lf<-* BAC01259.1 650 AA 651 COG1593: domain 1 of 1, from 119 to 651: score -377.6, E = 29 *->seavlsqkakevleelekeararrllsgkvll.vvsvv.......al ++++ + k + + +a+ +++ ++++++v +++ +++++ BAC01259.1 119 PDSKCFHPLKFFTSMTNNMAAKPNGTYTNFGInTVGIIraprgdgKE 165 glslfhlyttfagvlstlilrsvhvAvflalafllypalrksklkgvply + +l+ +y++ ++++ +A+++ + ll +a +++++ + BAC01259.1 166 AIVLVTPYNSQKVTPN----ELLSLALGFSVFSLLSRAA-WLSKD-I--- 206 DwLLal...lalfsafYivln.yleLvlr..sggyttlDlvvmvvavl.. ++L+++++ +++sa+ +ln+y ++++ ++++ + +++++ + ++ ++ BAC01259.1 207 -VWLSAdsqFGEYSAVSSWLNqYHNPMFLshPVNL-DTKIYGANQILYkp 254 ......lvleatrralGvPlaislglfilygllgadppgvqahhgysfrv +++ ++ l+a ra ++ a+++ + +++ d+ BAC01259.1 255 dgtaekAELMAFKRAGTMAAALIFKVGETRKYGDRDS---V--------- 292 gadrlvtllilaqegvfGtpdsvsllaIplFiLfGaflnkgGvgkrlidl +++ e +G++ + ll ++ BAC01259.1 293 ---------TMYAEASNGQMPNLDLL----------------------NV 311 AkalvGhrrGGlAkaaVvaSmLfgsiSGS...svAnvv.atGsftIPlMk + l+ hr G+ V + ++ +S S +++A v++ +Gs++ ++ BAC01259.1 312 VHYLAVHRQGFR----VNVETFNSLLSSSwlrVIAEVFqNLGSLLRKINP 357 raGYppefAaAveaaAStgGqLiPPsmgaiaFimaenvsIgYvdlfiAgi + + + ve +A + + + a +v g + + + BAC01259.1 358 DWKLDVTVPDYVEGTANL---------ASSMYNQALGVPTG-SHGAFRDY 397 iPALLyflalmvmvyleAkknglpg..........rglkasqlpllkgtl + +l ++ ++ k ++ + + ++++ ++g+++s ll BAC01259.1 398 Q---VDAVSLEFAPAFHLKNENAKSsfllrggrltEGVVRSVNNLLE--- 441 alkealwlLllpvilIgglfsGgftPteAAavaaVAivlyalivalvsrr + + + L ++ + + +G++ ++ A ++a l + +al ++ BAC01259.1 442 KFH-QSFFLYFLTAPSKFISVGVYMIPFALLLAP----LPIVAAAL--AG 484 l..........i.l.lfiyrhtltglgklavalvlvkalgfivvilLlkG +++++++ +++ + ++ +++ g+ +++ BAC01259.1 485 GsktkgkledeCkTkGNADDLQMEGGSWKW-------------------- 514 lrellkdlleaLeegarttapVaiavAaAgiisgvititgipltl.adll + +ar+ + + ++ ++i+ +ip ++ ++ BAC01259.1 515 ------------LKSARVLLIIQFWAVLVSLLPYYIS--QIPGAMpIQYA 550 lslsggnillllLllimlllLiLGmgmpttaayiIlspilaPvavqll.. + ++ + ++l +l +++ sp a v +ll+ BAC01259.1 551 VIWAVLS-ITILIILYAMFG----------------SPSRAGVEWKLLka 583 ..lGidPihaalahlfvfyngiigdiTPPVGlalFvaagvAgaDPmktgi + i + +l+ ++n +++ + +a + +++ BAC01259.1 584 tmITSITIGM---GLMSIINFATAQL----------GALIL------IPM 614 eafkkaiapfivpflfvlspvllfpeisllglvwlpllligtai<-* f++ +++ + + ++ p +++++ + +l +l + a+ BAC01259.1 615 CLFSRPLRAQLEMNFL---PRTVLLASNI----LLTVLGFPPAA 651 COG2056: domain 1 of 1, from 322 to 653: score -418.3, E = 57 *->MlenllmNaVViaVivMlVLSLlRVNV..vLALiigA...lvaGlvg +RVNV++ +L+++ +++a ++ BAC01259.1 322 ----------------------FRVNVetFNSLLSSSwlrVIAEVFQ 346 gLgLt............................................. Lg ++ +++ + + + ++ +++ + ++ ++ + ++++++ ++ BAC01259.1 347 NLGSLlrkinpdwkldvtvpdyvegtanlassmynqalgvptgshgafrd 396 ........eTinafisGLg.GnAtvALSYAlLGAFAvAIsKSGltdvlvk + + + e +af L+++nA+ S l G+ lt v BAC01259.1 397 yqvdavslEFAPAFH--LKnENAKS--SFLLRGG--------RLTEGVVR 434 kvirllgkGmhdeeargktlkKyllllvIlliAcfSQNviPVHIAFIPiL +v ll k +++l + + f i V + IP BAC01259.1 435 SVNNLLEK-----------FHQSFFLYFLTAPSKF----ISVGVYMIPFA 469 IPPLLslFNkLkiDRRlVACvLTFGLtAPYilLPVGFGlIFqnsILldNl +l++ L i VA++L G ++ + L d+ BAC01259.1 470 -----LLLAPLPI----VAAALAGGSKT--------------KGKLEDEC 496 hqaanglsvsinvaqvplAMliPalGMvvGLLlAvfvtYRKPReYqekeq ++n ++++ + + l a ++ + Av+v + +q BAC01259.1 497 KTKGNADDLQM--EGGSWKWLKSARVLLIIQFWAVLV----SLLPYYISQ 540 ieeaetedleanyenrefksiavaltLvaIVvaLvvQLySltdSmilGAL i +a ++ y a+ + V++ ++ + il A+ BAC01259.1 541 IPGAMPIQ----Y----------AV--IWAVLSITILI-------ILYAM 567 lGlivfflsG.vvkfk..etddv..fteGvKMMAfIGFVMLvAaGFAeVl G + s+ +v++k ++ ++++t G+ +M I F BAC01259.1 568 FG----SPSRaGVEWKllKATMItsITIGMGLMSIINF------------ 601 kaTGaVedLVnsvsssiGqsKgLaALLMLvVGLlITMGIGSSFsTIPIIA aT L + + s+ L A L BAC01259.1 602 -ATAQLGALILIPMCLF--SRPLRAQL----------------------- 625 tiYVPLClaLGFSPlATiaiiGtAgALGDAGSPASDSTLGPTsGLNADGQ + F P T+++ AS BAC01259.1 626 --------EMNFLPR-TVLL-------------AS--------------- 638 HdHIwDTcVPTFiHYNiPLlVFGwIAAMvL<-* Ni L V+G A +L BAC01259.1 639 ---------------NILLTVLGFPPAAFL 653 COG0609: domain 1 of 1, from 303 to 661: score -226.7, E = 62 *->llllillllllllls...........ilslslGaisistfLfpsdvl + l ll+++ l++++++ + + +++ sl ++ +++v+ BAC01259.1 303 MPNLDLLNVVHYLAVhrqgfrvnvetFNSLLSSSWLRV----IAEVF 345 qalfg............................................. q+l + ++ +++ + + + ++ +++ + ++ ++ + ++++++ + BAC01259.1 346 QNLGSllrkinpdwkldvtvpdyvegtanlassmynqalgvptgshgafr 395 ........................eqiildlRLPRilaallvGAsLAvaG + + + + + + + ++++ ++ + ++ RL BAC01259.1 396 dyqvdavslefapafhlknenaksSFLLRGGRLTE--------------- 430 ailQgltR..NPLAdPsiLGissGAalgavlaillfpgasssslkrSvil g+ R+ N L + + +++++ ++ + s ++ BAC01259.1 431 ----GVVRsvNNLLE----------KFHQSFFLYFLTAPSKFISVG---- 462 npyllplaAfiGaliaallvyllarrsgggn................... y++p+ A++ a + +++ l + ++ g +++ +++++ ++ + +++ BAC01259.1 463 -VYMIPF-ALLLAPLPIVAAALAGGSKTKGKledecktkgnaddlqmegg 510 ....slspirLiLaGvalsalfsAltslllylad.lqqvlfWllGSlsga + + s L+++ ++ l+sll y ++++ ++ + BAC01259.1 511 swkwLKSARVLLII-----QFWAVLVSLLPYYISqIPGAMP--------I 547 nWsdvllllpivllglilllllarkLnlLsLGddlAksLGvnverlrlll ++ ++++l i++l++++++ +G Gv ++l+ + BAC01259.1 548 QYAVIWAVLSITILIILYAM----------FGSP--SRAGVEWKLLKATM 585 lllvvlLtgaaVavaGpIgFvGLiaPHiaRllvGpdhrylLPlSaLlGal + +++ g+ + ++a lGal BAC01259.1 586 ITSITIGMGLMSIIN--------------------------FATAQLGAL 609 LLllADilaRtilaPaelPvG.........ivTalilliGavaPyFlYLL +L +++R + a e+ +++ ++i+ +++ G P +L+ BAC01259.1 610 ILIPMCLFSRPLRAQLEMNFLprtvllasnILLTVL---G--FPPAAFLI 654 rrrrkisletrgrg<-* +++ +g BAC01259.1 655 MKG-------LSKG 661 COG0842: domain 1 of 1, from 303 to 662: score -97.3, E = 60 *->kfrali.ellr.llrdpllllliilpplllllifgygf.gtl.dlpv + ++ + + + l + + + + + ll+++++ ++ + BAC01259.1 303 MPNLDLlNVVHyLAVHRQGFRVNVETFNSLLSSSWLRViAEVfQNLG 349 avvdedqsalsrqlielkatslflrkllvltlrelkrflrsrgeiigalv + +++++ + v + e + l s + +++al BAC01259.1 350 SLLRKINPDWK-------------LDVTVPDYVEGTANLAS-SMYNQALG 385 iPedflwlllfgfalvi.vigggqdgeivaysaevggasdylayvlpgla +P + +++ ++++++ +++ ++++ + + ++ BAC01259.1 386 VP-------TGSHGAFRdYQVDA------VSLEFAPAFHLKNENAKSSFL 422 lmavlsaslfgvisalfdrrssyfvypaaesvialrlGtlerllvsPlsr l + +gv++ + + + + +++++ l + BAC01259.1 423 LRGGRLT--EGVVRSVNNL----------------LEKFHQSFFLYFLTA 454 lsillgrilasllrsllqaailrg.......................... s +++ ++ ++ll a ++++ ++++++++ +++ +++++ ++ BAC01259.1 455 PSKFISVGVYMIPFALLLAPLPIVaaalaggsktkgkledecktkgnadd 504 ................lvillllalflllfgfivfgiplsgsllllllav + ++++ + ++ + l+i+ a ++ l +++++ip + +++++ BAC01259.1 505 lqmeggswkwlksarvLLIIQFWAVLVSLLPYYISQIPGAMPIQYAVIWA 554 llsvlltlglgallislvlallvstpetagaiasllilpliflSGvfyPl +ls+ ++++l+ +++ + ++ + + +++ +i ++ ++ + BAC01259.1 555 VLSITILIILY-AMFG----SPSR-AGVEWKLLKATMITSITIGMGLMSI 598 ellPdwlqwiayanPltyavealRylllgaglsdvwfsllvLallgllll ++ + + ++P+ + + lR+ l + ++ + l+ +ll++l + BAC01259.1 599 INFATAQLGALILIPMCLFSRPLRAQLEM-NFLPRTVLLASNILLTVLGF 647 llgllllrrrekkar<-* +++l+ ++++k+ BAC01259.1 648 PPAAFLIMKGLSKGS 662 COG1270: domain 1 of 1, from 451 to 662: score -194.5, E = 27 *->melafalfpaievialllAviLDlllGEPPariHPVvwfGkliafle +a++ f+ ++v+ + A++L ++P + +++ k ++le BAC01259.1 451 FLTAPSKFISVGVYMIPFALLLA----PLPIVAAALAGGSKTKGKLE 493 riwnrkrsk.aarflaG...vlalaatllvvaaflallLalllaslplpL ++k++++ + G+ ++l a +ll++ f a+l++ll+++ + + BAC01259.1 494 DECKTKGNAdDLQMEGGswkWLKSARVLLIIQ-FWAVLVSLLPYYISQIP 542 nlivqalLLkvksslairsLaeaaekvaralaagDveeARrlLsWmlVsR + ++++ +++l+i+ L ++ +++ g+ + R+ + W l BAC01259.1 543 GAMPIQYAVI-WAVLSITIL------IILYAMFGSPS--RAGVEWKL--- 580 DTsqLseeellsAAIESlAENlvDgVvAPLFYfilGilvvflGlPGAllY L + ++s +I ++l + BAC01259.1 581 ----LKATMITSITI---GMGL-----------MS--------------- 597 RavNTLDaMvGYrneryedfGwfaARLDDiLNyiPARLtGavlvLiaapl i+N+ A L +Li p BAC01259.1 598 -----------------------------IINFATAQLG----ALILIPM 614 rggstrqavrrdrrrapkwpSPNsGwpmAAmAgaLgVrLeKpGvYqlngs ++r++r ql+ BAC01259.1 615 C------LFSRPLRA------------------------------QLEMN 628 eRpkLGdgpqpgtvaDieralalvrrvlltvlvflllaaglvlvigpga< p r + l++ lltvl f ++a++++ ++ g BAC01259.1 629 --------FLP-------RTVLLASNILLTVLGFPPAAFLIMKGLSKGS 662 -* BAC01259.1 - - COG0395: domain 1 of 1, from 449 to 662: score -206.0, E = 64 *->rkillylfLilfaliilfPflwlvltSfkpdgntdsselfsgpptlf +l+ ++ + ++++Pf +l++ p +++ +g+ BAC01259.1 449 LYFLTAPSKFISVGVYMIPFALLLAP--LP----IVAAALAGGSKTK 489 Pstftlenyfr.nYrkvfklttggnfpflraflNSlivalvttvlsvlls ++ e + +n ++ +++g+ + +++ S v l++ + +vl+s BAC01259.1 490 -GKLEDECKTKgNADDLQ-MEGGS----WKWLK-SARVLLIIQFWAVLVS 532 slAAYAlaRlrFkGrkllfllilatlMiPfqvlliPl.YllirkLGLlnp l+ i ++ + P++Y +i+ + BAC01259.1 533 ----------------------LLPYYISQIPGAMPIqYAVIWAVL---- 556 lGvlldTywGLILpyaagglpfnifllrqfFdt.IPkELeEAAriDGAsp +++ LI+ ya +g p + ++ + L+ A +i + BAC01259.1 557 ------SITILIILYAMFGSPS---------RAgVEWKLLKATMI---TS 588 fqiFfrIvLPLskPglAtvaiftF.igsWNdFlwpliflsdpndslnypl ++i g+ ++i++F +++ ++ + l + BAC01259.1 589 ITI-----------GMGLMSIINFaTAQLGALILIPMCLFS--------- 618 yTLpvgLanlingeygtdlvtapewglimAaavlaaLPililFlffQkyf ++L ++L + +++ A +l +++ F+ + + BAC01259.1 619 RPLRAQLEMNF-LP----------RTVLLASNILL---TVLGFPPAAFLI 654 vkGltaGgvKG<-* +kGl+ G+ BAC01259.1 655 MKGLSKGS--- 662 COG0239: domain 1 of 1, from 554 to 663: score -76.9, E = 85 *->mmlksllavalGGAlGAvlRylvslllntlfgrsesfPlGT.L..lV ++l ++++++l + +G+ R v+ l + +s+ +G++L +++ BAC01259.1 554 AVLSITILIILYAMFGSPSRAGVEWKLLKATMI-TSITIGMgLmsII 599 NlvGSFllGflltakylaekvAvlspdwrlllGTGFlGalTTFSTFSvEt N ++ lG+ l+ l + ++s +r+ ++E BAC01259.1 600 N-FATAQLGA-LI---LIPMC-LFSRPLRA----------------QLEM 627 vsLlqegrlgkalayvvllslllgllavlLGfllarrllg<-* L + ++ +++ +ll++l+ a++L + ++ BAC01259.1 628 NFLPR---TVLLASN-ILLTVLGFPPAAFLIMKGLSKGSW 663 COG1808: domain 1 of 1, from 280 to 664: score -124.9, E = 16 *->msKsKssGHRFsLLmLLRLRRGFtnnLDLtDEmkVrvlnrlgvserl + + G R s+ m G nLDL r g+ + BAC01259.1 280 VGETRKYGDRDSVTMYAEASNGQMPNLDLLNVVHYLAVHRQGFRVNV 326 RsisiyipkkflagieeilkkngsedvleeIsviepllgpIldasigtvv + + + ++l +i+e +g l+ I +l +++d gt++ BAC01259.1 327 ETFNSLLSSSWLRVIAEVFQNLG--SLLRKINPDWKLDVTVPDYVEGTAN 374 kDaeeaelvvrklkkllLgvkeavvvaaapafvvpfeeeepkGLDvLkee ++ + + + + + v a ++f ++f + + BAC01259.1 375 LASSMYNQALGVPTGSHGAFRDYQVDAVSLEFAPAFHLKNE------NAK 418 ellerlelyylaldvAklskevgvvg........vlagvvalsGvissnv + + + + +v++ + + +++ +++ l+ a s is v BAC01259.1 419 SSFLLRGGRLTEGVVRSVN--NLLEKfhqsfflyFLT---APSKFISVGV 463 gvlIavmliaPLlgvgiGiaidlslvvgdvkLlakgakvLllaniltivl ++l+aPL v ++a s + g + k ++ ++ BAC01259.1 464 YMIPFALLLAPLPIVAAALA-GGSKTKGKLEDECKTKGNADDLQMEGGSW 512 aalvglfllskldvlqylaeigLdkevsalsaivAvllGiaga.lstfsa l ++ l+ q+ a+ vs l ++ + G ++++++ a BAC01259.1 513 KWLKSARV---LLIIQFWAVL-----VSLLPYYISQIPGAMPIqYAVIWA 554 ilssPsRFFLiLvtgaivsveaallppaVlrgillatepaylssiiklll +ls+ iL+++ + +++ V +l at+ ++ +l BAC01259.1 555 VLSI-----TILIIL--YAMFGSPSRAGVEWKLLKATMITSITIGMGLMS 597 qinvgLvdvsalaii...laygikaerqypklaarlytlklldleleLli in+ + ++al i+ l + ++++ r ++l++ +l++ L++ BAC01259.1 598 IINFATAQLGALILIpmcLFSRPLRAQLEMNFLPRTVLLASNILLTVLGF 647 avkllailagapsdstl<-* + i+ g+ +s BAC01259.1 648 PPAAFLIMKGLSKGSWT 664 COG1178: domain 1 of 1, from 168 to 665: score -338.6, E = 91 *->vvasly.v.m.ysaillvlavgalatlatvklvvmggdplriglwl. v ++ y+ + ++ ll la+g+ +++ + +++ ++ +++l+ + BAC01259.1 168 VLVTPYnSqKvTPNELLSLALGF--SVFSLLSRAAWLSKDIVWLSAd 212 ..liglllaalllllpplvlavlalsafsg...glaeflavlsdayllrl +++ +++++++l +ls+ ++ + ++ ++l + + BAC01259.1 213 sqFGEYSAVSSWLNQY---HNPMFLSHPVNldtKIYGANQIL--YKPDGT 257 lgnTLllallvTvlslilGlplAyllsr...ydfPG.rrwlrwll..... + l+a+ G+ +A l+++ ++++ +G+r+ + + ++ +++ BAC01259.1 258 AEKAELMAFK------RAGTMAAALIFKvgeTRKYGdRDSVTMYAeasng 301 aLPiLLviPalvvAfgfislfGksGwLarllgelfGlssreywlpdiygP +P L l ++ +++ ++G+ r e f ++ + BAC01259.1 302 QMPNL----DLLNVVHYLAVH-RQGF--RVNVETFN--------SLLSS- 335 lgGiilalvlfnyPlvyllaaaalesidpsleEaArsLGasrwqvFrrVt ++ ++a v+ n+ + l++i+p+ + +Vt BAC01259.1 336 SWLRVIAEVFQNL-------GSLLRKINPDWK--------------LDVT 364 LPllrPaiaagalLvFlyclsdFgavliLGGspqytTlttaIyq.eilgs +P + a+ l++ + +LG ++T + + ++++ + BAC01259.1 365 VPDYVEGTAN---------LASSMYNQALG----VPTGSHGAFRdYQVDA 401 .qldlatAalLallLLllsllllwvvvkllerfsrg.rqkvyssgqswar +l++a A+ L + s +ll + r++ g + v +++ BAC01259.1 402 vSLEFAPAFHL-KNENAKSSFLLR-----GGRLTEGvVRSVNNLLEKFHQ 445 piprilllglaalvallfclllfsvlllgvilplsflllWtvlltsdews + + l + +++ + ++f++ll+ + ++++ l + + +++e BAC01259.1 446 SFFLYFLTAPSKFISVGVYMIPFALLLAPLPIVAAALAGGSKTKGKLEDE 495 damlgplfstsfwhalins....LtlallAalialllaLllaylvrrsrs + +n+++ ++ +++ l ++ ++L +++++++ BAC01259.1 496 ------------CKTKGNAddlqMEGGSWKWLKSARVLLIIQFWAVLV-- 531 rlsrfidrLsmLplAvPGvVlalGllllfnnldnhwvdlaameGvkpllv s Lp+ +++ a+ ++++ +++ BAC01259.1 532 ---------SLLPYYISQIPGAMPIQYAVIWAV----------------- 555 lygtllllVlAyalralPfalrsleaalrqidprLeeaArsLGasrwqif + +l++ ya++ +P ++ +w+++ BAC01259.1 556 --LSITILIILYAMFGSPSRAGV----------------------EWKLL 581 rrvtLPLllpgllaAaalvFalsmkElsATllLgppdftTLttaiynyls + + g ++ Fa+ ++l+A l+L p + + ++++ BAC01259.1 582 KATMITSITIGMGLMSIINFAT--AQLGA-LILIPMCLFSRPLRAQL--- 625 ggdgryaaAavllvasaAlvLvlislllfvllikrygersqgt<-* + ++ + +vll+ + l+ + + ++f l k++ + s++ BAC01259.1 626 --EMNFLPRTVLLASNILLTVLGFPPAAF-LIMKGLSKGSWTV 665 COG1175: domain 1 of 1, from 453 to 665: score -198.4, E = 48 *->wlayllllPalll...fllFiyyPlietlaiylSFtdwdgfppaa.a ++ +++ +++ + ll+ +P++ a+ + g + + + BAC01259.1 453 TAPSKFISVGVYMipfALLLAPLPIVA--AALAGGSKTKGKLEDEcK 497 q.itpgesefVGlkNfvrlftlfsDptFwqalknTllytllsvplqlvlg +++ l+ + ++++ ++ l+ +++v ++l+ BAC01259.1 498 TkG-----NADDLQMEGGSWK------WLKSARVLLIIQFWAVLVSLLP- 535 LllAlLLnqkrlkGrglfRtllflPyaiSpVvaaliWkflFnprdsfGli + q + G +++++++ + ++ + +i + +F + + BAC01259.1 536 ----YYISQ--IPGAMPIQYAVIWAVL--SITILIILYAMFGS-P----- 571 NqlLnllGidpppviaipWlndpfwAllaiilvnvWkgtgfnmlifLAgL + + W + ++i + +g + +i BAC01259.1 572 ---------SRAG---VEWKLLKATMITSITIG-----MGLMSII----- 599 QsIPqeLYEAAriDGAsrwqrFrhITLPlLrPtilfvlilstigafqvFd + + +ga+ + BAC01259.1 600 ------------------------------------NFATAQLGALIL-- 611 eiyllTgGGPagGPpvgnaTdtlalyiYreaFeggpQdfGyAsAiavilf +++ l + P + a + + + + + ++l+ BAC01259.1 612 IPMCLFSR-----P--LRAQLEMNFLPRTVLLASN-----------ILLT 643 livliltliqfklfkrkveegs<-* ++ + + + + + +k+ + + BAC01259.1 644 VLGFPPAAFLIMKGLSKGSWTV 665 COG1113: domain 1 of 1, from 379 to 677: score -423.7, E = 43 *->M.dpslvmaddnqdaeqeLkRgLknRHIqlIAlGGAIGTGLFlGSgs M + +a + + + + ++ ++ BAC01259.1 379 MyN----QALGVPT---GSHGAFRDYQVD----------------AV 402 aIqmAGPsvlL.......aYaIaGlivflIMRaLGEMavanPvaGSFsdY + +A P+ L++++ ++++++ G + + v +S BAC01259.1 403 SLEFA-PAFHLknenaksSFLLRGG------------RLTEGVVRSVNNL 439 ArkylGpwAGFltGWlYWflWvlvgmaElTAvgiYmqyWfpaFPdvPqWi +k+ + fl+ l++++ + vg+Ym +p BAC01259.1 440 LEKFHQSF----------FLYFLTAPSKFISVGVYM---IP--------- 467 wALialvlllavNLisVKlFGElEFWFAlIKVaAIvafIvlGlvllfggf +AL ++ l + +++l+gg BAC01259.1 468 FALLLAPLPIV-------------------------------AAALAGGS 486 ggggtGfsNLwahtGGFFPnGllGlllalqvvvFAFgGiElvGitAgEak g L + t g a+ BAC01259.1 487 KTKG----KLEDE-----------------------------CKTKGNAD 503 dPe......ksipkAinsViwRIliFYvGslfvilslyPWnqvgsggsgS d + ++++ k + +A ++ I+ F +v++sl+P+ + g + BAC01259.1 504 DLQmeggswKWLKSAR---VLLIIQFW----AVLVSLLPYYISQIPG-A- 544 PFVtvFskiGipaAasImNfVVLTAAlSslNSGlYstsRMLysLAeqGdA i ++ ++ + +L+ +Ly + BAC01259.1 545 ------MPIQYAVIWAVLSITILI---------------ILYAM------ 567 PkffaklsrrGVPvnAillsavvlllgVlLNYvaPsvekvFelltsssgl f+ sr GV ++ + ++ + + +++ + + + ++ t+ g+ BAC01259.1 568 ---FGSPSRAGVEWKLLKATMI-TSITIGMGLMSI-I----NFATAQLGA 608 galfvWlmIllsqLkfRkarkpaqgaalkFkmpLyPfttyltlaFllfvL ++l++ m l+s +R++++ m + P t BAC01259.1 609 LILIP--MCLFSR-PLRAQLE----------MNFLPRT------------ 633 vlMafdpdtRislfvtpiwlvlLvigYlvfgkrrkaeahgarpv.kaee< vl+a + +++++ ++L++ l +g + + +++ + e BAC01259.1 634 VLLASN---ILLTVLGFPPAAFLIMKGLSKGSWTVDIVGDF--WlWMEF 677 -* BAC01259.1 - - COG0670: domain 1 of 1, from 497 to 685: score -132.8, E = 84 *->MglydrmqsrnfektqaeargetliaqvvlkmtygLLltalvasaag + +++++ ++ + ++ +a+v l+ +l +l+++++ BAC01259.1 497 KTKGNADDLQMEGGSWKWLK----SARVLLIIQFWAVLVSLLPYYIS 539 af.valaallpspglfsttflvlilallglvfliflmikkrnsaYWYtgl ++++a+ ++++++++ vl + +l++ + +f +++ + + l BAC01259.1 540 QIpGAMPIQYAVIWA------VLSITILIILYAMFGSPSRAGVE---WKL 580 alffvytgLs.GytLspilnvylatsGaGgdvIasAFgiTaivFggmSay ++ +t + G+ L +i+n+++a+ G a++ m ++ BAC01259.1 581 LKATMITSITiGMGLMSIINFATAQLG-------------ALILIPMCLF 617 GlttrFN.kkDlsflgkfLfmaLigLvvasLvNlvlGFslFlgssalnla + ++fl +++++a N++l +lg + + ++ BAC01259.1 618 SRPL--RaQLEMNFLPRTVLLA---------SNILL---TVLGFPPAAFL 653 ISalGvilFsglIlyDtqnIirggrkligddteisedlsIraALsLYLdf I + +g + +d+ BAC01259.1 654 I---------------MKGLSKGS-----------WT----------VDI 667 INLFlsLLrIlgilnddD<-* ++ F+ + +l ++ BAC01259.1 668 VGDFWLWMEFLWEWSSAT 685 COG2245: domain 1 of 1, from 498 to 687: score -101.9, E = 52 *->vikMeLsnAKllgGiGaiLqLvGaavgllGiLiwiLSivGlVLvLia k + + gG L+ +a +ll i+ + vL++ BAC01259.1 498 -TKGNADDLQMEGGSWKWLK---SARVLL--------IIQFWAVLVS 532 l..ymISkqvgddrIFnyyLigfismvagl.ilfvivifAtvGvsivall l +y IS++ g I +y++i a+l+i ++i+ A ++++ + BAC01259.1 533 LlpYYISQIPGAMPI-QYAVIW-----AVLsITILIILYA----MFGSPS 572 kksmmslahglaalgsiLaGfViLyIayiigtyFqkKsyEliAq...... + + + + ++ + si +G+ ++ I + + q ++ li +++ BAC01259.1 573 RAGVEWKLLKATMITSITIGMGLMSIINFATA--QLGALILIPMclfsrp 620 .yTgVdmFrtAGLlYfiGaiLLIVlGvGf....lilliA.....aILeiV + m + iLL Vl Gf++ +++ + +++++ iV BAC01259.1 621 lRAQLEMNFLPRTVLLASNILLTVL--GFppaaFLIMKGlskgsWTVDIV 668 aFFSLPdEikaseqaaVsp<-* + F L E + +a BAC01259.1 669 GDFWLWMEFLWEWSSATYL 687 COG0534: domain 1 of 1, from 326 to 693: score -253.0, E = 24 *->gsirkaLlkralrLAiPiilanllqtlyglvDtfmvGhlgGadaLA. + Ll ++ ++a ++q+l +l + +++ L BAC01259.1 326 VETFNSLLS----SSWLRVIAEVFQNLGSLLR-----KINPDWKLDv 363 ........AVglaspilflliaigmglgtGtsvlvaQaiGAgdregarra + ++ ++ +las++++ + g+ tG +GA + ++ BAC01259.1 364 tvpdyvegTANLASSMYNQ----ALGVPTGS-------HGAFRDYQVD-- 400 arqglvllalllslllgllglfllellLrllgaseevlalAaeYLrilil v+ + f++ + L+ + + +l BAC01259.1 401 ----AVS------------LEFAPAFHLKNEN------------AKSSFL 422 glpfallsfvlrgalrgaGdtrtpmvvsiignllNivLdyiLIfGkgvlF l v r++ + + + ++++ ++++++ ++ ++ BAC01259.1 423 LRGGRLTEGVVRSVNNLLEKFHQSFFLYFLTAPSKFISVG--VYM----- 465 GfpglGivGAAiATviarwigflvllfyllrgkkgllislfdrv.lkllk ++ A + ++++l++ + + + ++ +++ +l BAC01259.1 466 --IPFALLLAPLPI----VAAALAGGSKTKGKLEDECKTK---GnADDLQ 506 pdrkvlkkllrlGflPialeeilfsvgsfllntlfvarfGtdavAAyqia ++ ++k +l+ + + + ++ v++ ll +++ + y++ BAC01259.1 507 MEGGSWK-WLKSA-RVLLIIQ-FWAVLVSLLPY-YISQIPGAMPIQYAVI 552 lriaslifmppfGisiAvttlvGqnlGAgnyerarraarlglklslligl + + s+ +++++ + ++ +++G +++ ++++ ++++++++++++ BAC01259.1 553 WAVLSITILIILYAMFGSPSRAGV-----EWKLLKATMITSITIGMGLMS 597 ilalllllfpepiasllftsdpevialaaqlllilaisqpfdlFvgiqvv i+ +++ +++ i ++ +++ +a + + f+ BAC01259.1 598 IINFATAQLGALILIPMCLFSRPLRAQLE---------MNFL-------- 630 lsGvlrGaGdtkvpliislisywgirlPlayllafftgfgknfwllltgl ++ l s i + ++++P+a +l + BAC01259.1 631 ----------PRTVLLASNILLTVLGFPPAAFLIMKGL------------ 658 glaGvWlgliignlvaaiigvvlallwrllrrwrkgteaaaekaa<-* + G W i+g ++ l+ +l++w t++ + BAC01259.1 659 -SKGSWTVDIVG----DFW-----LWMEFLWEWSSATYLYVFLVH 693 COG2244: domain 1 of 1, from 219 to 694: score -211.8, E = 57 *->slkrrllkg....sivlsiatliskllgfvvlvllaRllGpegfGly s ++ l ++++++ +l +k+ g ++ + p g BAC01259.1 219 SAVSSWLNQyhnpMFLSHPVNLDTKIYGANQIL-----YKPDGTAEK 260 afalaflgllviladlGlplalsryvaeyrekgdkerarrllstvlvlti a +af +++a+l + + +r + ++ ++++ + + BAC01259.1 261 AELMAFKRAGTMAAALIFKVGETRKYGD-------RDSVTMYAEASNGQM 303 llsvllflilllaapfiaefflnlkdpdlaalirllalalllisiisflr ll ++ +la++ f +++ ++ + +ll+ s+ +++ BAC01259.1 304 PNLDLLNVVHYLAVHRQG--F--RVNVETFN-------SLLSSSWLRVIA 342 gifqgfqrmkylalsqviesilrfllilslvflillvlsvlllgasGaAa +fq ++ +l + i + + + + + ++g + BAC01259.1 343 EVFQNLG-----SLLRKINPDWKLDVT----------VPDYVEGTAN--- 374 laavigalvvgavlslligliklgrkflkpsr...kalfkrllrfgipll la+++ ++++g+ ++ + + + + + ++l + + ++ l BAC01259.1 375 LASSMYNQALGVPTGSHGAFRDYQVDAVSLEFapaFHLKNENAKSSFLLR 424 lsslagllygyiDrillgafLgAGvastlgdeavGiYngaaqplvtvlli +++l ++++++ +l ++ +++ + + + + +++++ + BAC01259.1 425 GGRLTEGVVRSVNNLLEKFH--------QSFFLYFLTA-PSKFISVGVYM 465 lasslstvlfPyisrayeegklkllsl.....................fl ++ +l + +P+ + a + g +++ +l+++ +++++ ++ + ++++ + l BAC01259.1 466 IPFALLLAPLPIVAAALAGGSKTKGKLedecktkgnaddlqmeggswkWL 515 ksllrlllilaiPaalglllladplilllfgekylgseaapllillilvf ks+ +ll+i ++l+ l +i+ + g+ ++ +a+++ +l++ + BAC01259.1 516 KSARVLLIIQ---FWAVLVSLLPYYISQIPGAMPIQ--YAVIWAVLSITI 560 llsilgvpgaslLqa.....lgktklalyislfgallnlvLnfllLiprf l+ ++++g s a+ + +l k+ +++ i++ +l +++ + ++ BAC01259.1 561 LIILYAMFG-SPSRAgvewkLLKATMITSITIGMGLMSIINF---ATAQL 606 GieGAAlatvisyaivtilsiiysrklgglsaklkkilklliaallmalg G++ + +++s + + l ++ + l + + a++ ++ BAC01259.1 607 GALILIPMCLFSRPLRAQLEMNFLPRTVLLASNILLTVLGFPPAAFLIMK 656 ivvlavllllllillaaaglvgalillgalvavglavrdylllrls<-* ++++ +++ +++ + +l+ ++ a ++y++l++ BAC01259.1 657 GLSKGSWTVDIVGDFWLW--------MEFLWEWSSATYLYVFLVHL 694 COG1955: domain 1 of 1, from 294 to 713: score -389.1, E = 60 *->mAAEAEGEHvEsEGslidvkfgEkveivvkvglnprkyllrilapal m AEA s G + + + v +l ++r BAC01259.1 294 MYAEA------SNGQMPNLD-----LLNVVHYLAVHRQGFR------ 323 vgsvvLflvtivfiralmlptglvlalyLllpllilafavaypylradsk v +++ +l+ l ++ +f + lr + BAC01259.1 324 --------VN------VETFNSLLSSSW--LRVIAEVFQNLGSLLRKINP 357 rlsinsrLlyfITymavLSTsnlnrteilrilsekpeeyGplskEfrKiy + +++ + a L++s n +l+ G+ + y BAC01259.1 358 DWKLDVTVPDYVEGTANLASSMYN-----QALGVPTGSHGAFRD-----Y 397 dLvdkWgrSLaeAcrfiAkrtpSeifaDfLdRLAyaldSGedleEFLerE + vd+ +a A+ + + +S+ fL R G l E r BAC01259.1 398 Q-VDAVSLEFAPAFHLKNENAKSS----FLLR-------GGRLTEGVVRS 435 qravmddYetfYeraLySldvykdiYvSlllSivFlla...fvivlPILl + ++ + S+ +y Fl a+++f+ v++ ++ BAC01259.1 436 VNNLLEKFHQ-------SFFLY------------FLTApskFISVGVYMI 466 ganDitrtltlsafavtaielllvlvifkrlPkDplwh....egeikTkr + l+l+ + + a++l ++ + l + + ++ +++ + + BAC01259.1 467 PFA-----LLLAPLPIVAAALAGGSKTKGKLEDECKTKgnadDLQMEGGS 511 diklkrlaiiaialaaivllFliyavtllnltgplftlPvpfyaAvgltP ++lk + ++i+ +v l+ ++ +++ ++g++ + + a +++t BAC01259.1 512 WKWLKSARVLLIIQFWAV-LVSLLPYYISQIPGAMPIQYAVIWAVLSITI 560 AGLVLlyvGyvARkEEgkVkkkDEaFpaFIRSLGsslSaaGnslvkaLey L++ly+ +F++ s +++ l+ka BAC01259.1 561 --LIILYA----------------MFGS------PSRAGVEWKLLKA-TM 585 LsahnFGiLtedIkrLYkRLalriDsnkAWdlFsAetGSyLIqkfSeIFv + G+ ++ S+I BAC01259.1 586 ITSITIGM----------------------------------GLMSIINF 601 esidlGGdPdlvGevISenfeeiVrLRkkRyqsvStFvGilyGlhgAlaG + +lG+ + + S R +R q +F + + BAC01259.1 602 ATAQLGALILIPMCLFS---------RPLRAQLEMNF--------LPRT- 633 fsLfIslgvaklidgIfskFsvplgGeevgsIfniiPGGmqGLNiGiFps +L++s +++ + ++F + + + +++ s BAC01259.1 634 -VLLASNILLTV-----LGFPPAA--FLIMKGLSKG-------------S 662 sdvdLleillviillvlsiisslaikVvdGGhkvnsLyyfVillwisaiv vd + + ++ + s+ + +++L + + w+ +i BAC01259.1 663 WTVDIVGDFWLWMEFLWEWSSATYL--------YVFLVH--LPCWLLCIH 702 myvtleivksllsvivvsvvlv<-* ++ + +++ BAC01259.1 703 VLLHP--------CY---QPES 713 COG1007: domain 1 of 1, from 296 to 716: score -329.6, E = 32 *->mdalilvleilnlalllPelillltalvvllvelflsrksRrlylav a+++ + l+l + + l + ++ v ve f+s s + +l v BAC01259.1 296 AEASNGQMPNLDLLNVVHYLAVHRQGFRV-NVETFNSLLS-SSWLRV 340 lsllalvv.allsllatalweqgagpkaflgafavDklsliyklviLlsa ++ ++ +++ll ++++ w + ++ ++ +++ l++ BAC01259.1 341 IAEVFQNLgSLLRKINP-DWKLDV------------TVPDYVEGTANLAS 377 lltllfy........................................... ++ + ++++++ ++ + + + + + + ++++ +++ ++++ BAC01259.1 378 SMYNQALgvptgshgafrdyqvdavslefapafhlknenakssfllrggr 427 .......aysyaeeansskgEFYaLlL...fatlGmivlvssnnLlliFi +++ ++ ++e+ + + +Y L+++++f+ +G+ ++++ BAC01259.1 428 ltegvvrSVNNLLEKFHQSFFLYFLTApskFISVGVYMIPFAL------- 470 gLEllSLplYiLvalsrdsrrSlEAAlKYfllGalaSafllyGiAllYga +L+ l+ ++ ala + ++ BAC01259.1 471 ----------LLAPLP-------------IVAAALAGG----------SK 487 t.GtLdLs....gIalalknagednplLlllGlvfllvGlaFKlsaVPFH t+G L+ + +++g+a++l ++g + ++L + v+l+ BAC01259.1 488 TkGKLEDEcktkGNADDLQMEGGSWKWLK-SARVLLI------------- 523 fWtPDVYeGAPtpvvAFlStapKiAaFvvllRlfvtafgtdliafdtpdw i + +vl+ l+++ ++ i++ p+ BAC01259.1 524 -----------------------IQFWAVLVSLLPYYISQ--IPGAMPIQ 548 ylvfavLAvlSMliGNlaALtQtnvKRmLAYSSiaHaGYlLiglaavtkg y v++ AvlS++i + + ++ R a + L+ + t BAC01259.1 549 YAVIW--AVLSITILIILYAMFGSPSR-------AGVEWKLLKATMITSI 589 gnadkslglsAglfYllvYafmslGaFgv..lallegldrgvgaesYadi + gl+ ++ +a + lGa+ + ++ l++++ r + + BAC01259.1 590 -------TIGMGLMSIINFATAQLGALILipMCLFSRPLRAQLEMN---- 628 sdykGLlkrhPllAallsivlfSLAGIPPlaGFwGKfyvfaaavqsGhlw l r llA+ +++ ++ + PP a ++++ ++++G ++ BAC01259.1 629 -----FLPRTVLLASNILLTVLGF---PPAA------FLIMKGLSKGSWT 664 LavlavlnGSaisayYYlRvvvamffeepeqePinqnpvgngekvvvlas +++ + + + ++e+ +++ +v+l+ BAC01259.1 665 VDIVGDFW-----------LWMEFLWEWSSA-------TYL---YVFLV- 692 alavlvlGilpnpvidlvkksaalnLl<-* + ++++ ++++ + + + + BAC01259.1 693 --HLPCWLLCIHVLLHPCYQ-PESKMK 716 // hmmpfam - search a single seq against HMM database HMMER 2.1.1 (Dec 1998) Copyright (C) 1992-1998 Washington University School of Medicine HMMER is freely distributed under the GNU General Public License (GPL). - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - HMM file: /data/patterns/cogs/cogs.hmm-f Sequence file: BAC01259.1.fa - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Query: BAC01259.1 putative GPAA1 - like protein [Oryza sativa (japonica cultivar-group)]. Scores for sequence family classification (score includes all domains): Model Description Score E-value N -------- ----------- ----- ------- --- COG1928 3.7 1.3 1 COG2851 1.1 23 1 COG1393 0.2 91 1 COG0080 -0.2 67 1 COG0128 -0.6 53 1 COG1279 -0.8 78 1 COG0159 -1.1 99 1 COG1784 -1.8 89 1 COG0814 -2.2 1e+02 1 COG3292 -2.4 52 1 Parsed for domains: Model Domain seq-f seq-t hmm-f hmm-t score E-value -------- ------- ----- ----- ----- ----- ----- ------- COG1784 1/1 26 49 .. 337 360 .. -1.8 89 COG0159 1/1 70 89 .. 252 271 .] -1.1 99 COG1279 1/1 184 199 .. 215 230 .] -0.8 78 COG0128 1/1 359 378 .. 1 20 [. -0.6 53 COG2851 1/1 526 548 .. 451 473 .] 1.1 23 COG0814 1/1 544 567 .. 196 219 .. -2.2 1e+02 COG3292 1/1 593 612 .. 1 20 [. -2.4 52 COG1928 1/1 592 617 .. 282 307 .. 3.7 1.3 COG1393 1/1 643 657 .. 112 126 .] 0.2 91 COG0080 1/1 644 662 .. 73 93 .. -0.2 67 Alignments of top-scoring domains: COG1784: domain 1 of 1, from 26 to 49: score -1.8, E = 89 *->pvlfliavSstAaiivvlllpvla<-* + ++++av tA+ii++l lp la BAC01259.1 26 HHILFSAVCCTAGIIALLFLPSLA 49 COG0159: domain 1 of 1, from 70 to 89: score -1.1, E = 99 *->ekaleelralvkeLkaglre<-* +++++e+ +++k + a++ e BAC01259.1 70 TEDVQEANRFAKGIEAAIGE 89 COG1279: domain 1 of 1, from 184 to 199: score -0.8, E = 78 *->lAvqLlvdslallskn<-* l+++L+++ ++lls + BAC01259.1 184 LSLALGFSVFSLLSRA 199 COG0128: domain 1 of 1, from 359 to 378: score -0.6, E = 53 *->mdvtvlkgsiveGtvkvPgS<-* ++++v ++ veGt+++++S BAC01259.1 359 WKLDVTVPDYVEGTANLASS 378 COG2851: domain 1 of 1, from 526 to 548: score 1.1, E = 23 *->kWAvgiSlViliiAllaGIipll<-* +WAv++Sl i+ + G+ p+ BAC01259.1 526 FWAVLVSLLPYYISQIPGAMPIQ 548 COG0814: domain 1 of 1, from 544 to 567: score -2.2, E = 1e+02 *->alPiaagggGfwplllmlvlawpl<-* a+Pi+ + ++ + ++++l++++++ BAC01259.1 544 AMPIQYAVIWAVLSITILIILYAM 567 COG3292: domain 1 of 1, from 593 to 612: score -2.4, E = 52 *->MslirlLvfRgvlltalLvL<-* M l f +++l+al ++ BAC01259.1 593 MGLMSIINFATAQLGALILI 612 COG1928: domain 1 of 1, from 592 to 617: score 3.7, E = 1.3 *->flslkqyikhvlaRllgLliiPfliy<-* +++l+++i+++ a+l L++iP +++ BAC01259.1 592 GMGLMSIINFATAQLGALILIPMCLF 617 COG1393: domain 1 of 1, from 643 to 657: score 0.2, E = 91 *->lriGfneeeyleila<-* +++Gf+++++l + + BAC01259.1 643 TVLGFPPAAFLIMKG 657 COG0080: domain 1 of 1, from 644 to 662: score -0.2, E = 67 *->ivKtPPasaLlKKaaGiekGS<-* +++ PPa++L++K G+ kGS BAC01259.1 644 VLGFPPAAFLIMK--GLSKGS 662 //