Promoter predictions for seq : from bold to bold: 1 GAATTGCTGAGAAGTTGCATCAGCAAGAGGGGCAAGGTCAGACAGGATTT 51 GAGGCCGCCAGGGGCACCTAAGCTGTGGGTGCAGAGAATGGGATGCAGAG 101 GTAGAGACAAGAAGGGTAGAGGTGGCAGGTATCCCTGATAAGGGGGAGAC 151 CATGGGTTCATCCTGGGATTCCACCCTCAGAGTCAGATGCCCTTGGGATC 201 ACCGACTAATGCCTCTCCACCTCACTCTGGACTGCCCAAGGAATCTGTCC 251 TGTGCTACCCACAGTGCAGATTGCAACAGGGCTCCTCCTCCCCACCCAGG 301 GCCCTGATTAAGGGGATGGATTGCACACTGTAGTGAGACATCCATCCTGA 351 CCCCACCTCATCAGCCAGGGAGCTCCCTGAAGACAGGCCATCGAGAGAGG 401 CACACAACAGGCTGTGGTCTAAAATAAACTTTTAATTGCACATTTGTGTC 451 TTGGGTTATCTGTGGGGTGAGAAACCTCCTCACTTCCAATCCCAGCTATC 501 TGGGTACAATCTGGTGTTGTGGTCTGGCTGTCGGCGAGGGGTAGAGGTGG 551 GTGCAGGACTGGCCCCCGAGTCTGCACCGACCTCTTCAGGGCAGGGAGCT 601 CAGACCTGGGAAGGGAGGGGCAGCACAGGCTGGTGGTCAGCCTGCAGGTG 651 CAAGAGGAGGGTTGGCCTCTCCCCCTCCCGGCCTGAGTCGGGGTCCACTC 701 ACTCACCATGGTAGTACTTGCACTGCTTCAGGGCCTCGCTGAAGCCCTCA 751 CACAGGGACAGGTCACTCTGAGTGGTGGAACAGTCCAGGAACTGCCTGAT 801 CTCATAGGCGCAGGGCCCCATCTGCAGGGGCTGGGGGGCAGCGGGGGTGG 851 GGGCCTGGGGGTACAGTGCAAGAGGCTGCAGGATCAGCTTGGAGTTGGCA 901 CCTGGAGGTGCGTTTCAAACCTGGGGCCACCTGCCCCTCCCCACATCCCC 951 AGCCTGGGTGTCCCAAGGGTCCTAGGCAGGCTGAGTGATCCAGCAGCAGA 1001 GGCGGCTACAGACAGCATGTTTCTGCTGCCTTTTTTTTTTTTTTTTTTGA 1051 GACAGAGTCTCACTCTGTCACCTAGAGTGGAGTGCAGCGGCGCAGTCTCG 1101 GCTCACTGCAACCTCCGCCTCCCGGGTTCAAGTGGTTGTCCTGCCTCGGC 1151 CTCCCAAGTAGCTGGGGCTACAGGCGCCCGCCACCATGCCCAACAAATTT 1201 TTTGTATTTTTAGTAGAGAGGGGGTTTCACTATGTTGGCCAGGCTGGTAT 1251 CGAACTCCTGAACTCATGATCCACCCGCCTTGGCCTCCCAAAGTGCTGGG 1301 ATTACAGGCATGAGCCACTGCGCCTGGCCTCTGCTGCCTTCTAATGGGCA 1351 CTTTGGGCGGGTGCAGGGGAGGGAGGAGAAGGCTCTAGGAAAGCCCAGTA 1401 CTCTGCCTGTACTCTCAGAGTATGAGCCCCACCCCCACAAGTTCTAGCAC 1451 CTTCCTGGCTCAGCTACACTAATCCTCATGACCCCTGACCTAGCCACAAA 1501 GCCACATTTGGCTTACGGCGGTGGGGTTCCTAGCAGGGGAAAGGTCATCA 1551 GGGAAGTGGCTCCCCTTCGCCCTTGCGCCCAGCGGCCACAGTGCCCAGAC 1601 CTGGGTGTGGGGAGAACAGGGGCTCAGGAAATCCAACATCCCTCCCCTCT 1651 GCTAGGGTGAGGTGACCTTGGGCGAGTCTCTGAGCCCTCAGGGCCTGGGT 1701 TTCCCATCTTTAAAGGGCTGGTTGGTGTCTCTCCCTGGGAGCTTAGGGAC 1751 CCTGGCCTCAAGGGAGAGGCCGGGAAGCCTGCCTCTAAGTGACAGTGAAC 1801 TCAAGACCAGTGGACTAGGACCCTTCCCGGGCAGAGTCCTGGGCTGAAGC 1851 AACGGTGAATACACGCGGACACTCCTCACTGGACACTTGGGCAGCTCCCT 1901 GTGTGGCCTCGAGATAATCCTGCCTCAGTTTCTCTTGGACCCGCTGCTCA 1951 CCTGCTGGACAGCAGGCTGGGAGGGCTCCGAGCTCCCCCCGCTGAAGGCT 2001 CCGGTCAGGGCGCTGCCCATGACGTGTCCCACAGCCGAGCCCACGGCTAC 2051 CCCTGCGGCCGTGGTCGCCATCTGAGCCATGAGCCCCGGCTGGCCCGAAG 2101 GGGCGGGGGCTGGGGCGGCTGCCGAGGGCGGTGGGTGCGCGGGCGGGTGG 2151 GCAGAGGGCGCGGCGGGGCGGCTGCGGGGGTGGGAGGAAGCAGGGTTAAT 2201 CCTGGCCAGACCCCAGGCTGGAGGGCTGCAGCTCCTGGAAACGACCCCCG 2251 GAGAGATGGACGACCCACGTCTCCACACGTGGGTGCTGCACCCCCACCCC 2301 TCCCCCCGCCAAGATGGCGCAGCAGCAGCCAAGGTCACTCTGCGGACGCC 2351 CTTAGGGGAGTGCCCACACTTCCCTAACCCCCTCCCCACAGGGCCCTTGT 2401 CCCCCTCACACCTGGCTGGCCGGGAGGCCGCGCTGCGGCTTCCCCGAGGC 2451 ATGGTGGCGGCGGTGGGACCCGGGCGACCTTAGAGACGGCGGCAGCGGTG 2501 CTGTCGCGGGGACAAATGCCGCAGCGCTTGTCACAGCCGGCGCAAAAAAG 2551 GCGGGGCCCCGGGCGGGGCCTCAGGAACACGCCCCCAGCGGGAGGCGGCA 2601 CTGCCCCCAACCCTATCCCCCTTCTCCCGTCCTCCGGACATCTCCCTTCC 2651 AGCAGCTCCGGGTCTCGGCTGGAAATGGGACCCTGCTCCCTCCCTGCGCC 2701 TGGCAGGCTGTGCGGGTTCTGGGGACAGGGGCCAGTGTCGCAAAGCGCCG 2751 AGGGCTTAAGGAGGGCGACAGTGCCTTCTGTTAGGACCACCGCAGAGGGC 2801 AGGGAGCGGAGTTGGGGGTTGTTGCGAGCCCTGGAGGGGAGAGGAGACGG 2851 GGAGGCGACGGGATGGGGCCAGCTGGGAAGGGGACGCGAGGCTCCAGGCT 2901 GGACTCCGCTCTCTGCCCCCTCCCGGACTCGGCTGTCTGTCCCCTCCCTC 2951 CAGACAGGGTCTGCTGACCACCGCGTGGCCTGGGAGTCTCCGGTGGCCTA 3001 GGGAAGTGAAGCGCGGCCCTGGGGAAGGCCTGGAGCAACCCATCCCCAGA 3051 ACTCCCACGAGGGGGCGTCCCAACCCGTCTTCGACTGTTGGCCAAAATGC 3101 GCTGCCAATGCTGGCAGCCTTACGCAGTGCCCGCGGGGGATATGAGGCCC 3151 CCCGCGCGGCCCTGAACCCCACCGGATTCCCCGGGCCGGCCCGACCGCCC 3201 CCACCTAGTCCCTGGCCCCGCGAGTGCAACCCCCGACACTAACGGCCTTT 3251 ACGCGACATCCGAGCAGCGTGTCTATCCCAAAGGCCTAGGAGCATTTGCC 3301 CGGCTCGGTCAAATCTAGCGCAAGTTTGAAGCCTGCGGCCTCGCAATTTT 3351 AGCAGCTTCGTTCCAGGCCAGGTGAGCTGGAGCGCGGATTTAGAATGCTT 3401 TCCTGCTCTTCGAGAACTTGAGGTTTTAATTTTAATTTTTACTTTTTTTT 3451 TTTTTGACACGGAGCCTTGCTCTGTCGCCCAGGTTGGAGTGCAGTGGCGA 3501 TCTCGGCTCACTGCAACCTCCACCTCCCGGATTCAAGCAATTCTCCTGCC 3551 TCAGCCTCCTGAGTAGCTGGGACTACAGGCTCGTGCCACCACACCTGGCT 3601 AATTTTTGTATTTTTAGTAGAGACGGGGTTTCACCATGTTGGCCAAACTG 3651 GTCTCCAACTCCTGAGCTCATGATCCTCCTACCTCAGCCTCCCAAAGTGC 3701 TGGGATTACAAGCATGAGCCATCGCGCCCGGCTGAGCTTGAGGTTTTTAT 3751 GGTCTTTTCTGGGTGACTTTGCAGTGGTCATATTATTACTCCTAGGTGGG 3801 GGAAGGTAGGGACTGTCCCTCATCACTGCTGTGCCTGCAGAGGACTCCTG 3851 ACACCGGGATTGTCGGCAGGGTGCAACCAAGAGCAAAAGCACCCAGAAGT 3901 GGGGGTGTGGGTCACCAAGTAAGAGAGGAGGCCCCTCCATAAGGAAGGAG 3951 CAGGTCAAGGGGCAAGGGGAGTGGTGGCTGGGGCGTTTGGCCCTGCCTGG 4001 CATGCTCAATTCTGTGTGTGCACAGCTGTCCCCAGCCATGCATGCTCTCC 4051 ATGGAAGTCAGGTGCAGCCCTGTGTTCTCTCACACTGCAGTCCTTTCAGC 4101 CAGCCTGTGAACAGGGCCCTTCCCCGCCCTGTCTTCAGTTTCTCCTCTCT 4151 GAAGGGGGGTGACCACCCACCTTGAGGGTGGCCTCCCAGTGAGGTGAGCA 4201 CACCAAGGGTGGGCCATTCTTTCCTGGCAGACTCACCAGGCCTGGTCCAG 4251 CCGCTACTCCCTTGTTCATGCAGTCCATGCCACCTGGTTCCTGTTTGCCA 4301 CACCAAACAAACTGAACACTTCTCATGGTGGTCCCCGGGCCTAAACTGCA 4351 GAGAGTTGTTACTTCAGCTCCCATGGCTGATGGACTGCAGGTTCCACAAA 4401 GCATGGCCTTGCACCTCGGAGGCACCCAAGACCACTTAAGTGGGTTACAC 4451 AAATGAAAATGAGGCTGAAAGGATGACACTGAAGCAGGCATTGAAATGCC 4501 AACTTTTTTTTTTTTTTTTTTTGAAACGGAGTCTTGCTCTGTCGCTCAGG 4551 CTGGAGTGCAGTGGTATGATCTCCACTCACTGCAACTTCTGCCTCCTGGG 4601 TTCAAGCGATTCTCTTGCCTCAGCCTCCCGAGTATCTGGGACCACAGGCA 4651 CATGCCACCATGCCCCGCTAATTGTTTGTTTTTTGTTGTTTTTTTGTTTT 4701 TGTTTTTGTTTTTTTAAACAGAGTCTCACTCTGTTGCCAGGCTGGAGTGC 4751 GGTGGTGTGATCTCAGCTTACTGCAACCTCCACCTCCCGGGTTCAAGTGA 4801 TTCTCCTGCTTCAGCTTCCCAAGTAGCTGGGACTACAGGCACGTGCCACC 4851 ATGCCCAGCTAATTTTTTTTTGTATTTTTAGTAGAGACAGGGTTTCACCA 4901 TGTTGGCCAGGATGGTCTCTTGTGATCCGCCTGCCTCAGCCTCCCAAAGT 4951 GCTGGGATTACAGGCATGAGCCACCACGCCTGGCCTGAAATGCCTACTTT 5001 TTAAAAATGAGTCACTTTAGCGGGGAATAGTGGCTCATGTCTATATCCCA 5051 GAACTTTGTGAGTCCCAGGCAGGCAGATTGCTTGAGTCCAGGAGTTCGAG 5101 ACCAGCCTGGCCAACATGGTGAAACCCTGTCTCTACAAAAAATACAAAAA 5151 TTAGCAGGGCATAGTGATGCACACCGGTAGTCCCAGCTACTTGGGAGGCT 5201 GAGGCAGGAGGATCGTTTGAGCCTGGAAGGCGGAGGTTGCAGTGAGCCAA 5251 GATCACACCATTGAGCAAGACCCTGTCTCAAAATAATAAAGGTAAATAAA 5301 AGTGAGTGACTTTAAAGAAAAGATTGCCTTTCAGGCGCGGTGGCTCACGC 5351 CTGTAATCCCAGCACTTTGGGAGGCCGAGGCGGGTGGATCACCTGAGGTC 5401 AGGAGTTCAAGACCAGCCTGGCCAACACGGTGAAACCCCATCTCTACTAA 5451 AAATATAAAAAATTAGCTGGGCATGGTGGCGGGCGCCTGTAGTCCCAGCT 5501 ACTCGGGAGACTGAGGCAGGAGAATGGCATGAACCTGGGAGGCGGAGCTT 5551 GCAGTGAGCCGAGATCGCGTCACTGCACTCCAGCCTGGGCGACAGAGAGA 5601 GACTCCGTCTCAAAAAAAAAAAAAAGAAAAGAAAAGATTACGAACAGTGT 5651 AGGTTGTAAAAATTACTGCTGTCTTACTTCTCCTCGCCCTCTAGGAGTCT 5701 CTGTGGAGTCTTCTTGAATAAGCTGTGAAACATTTCCCCACCCGCTTCCC 5751 TTTCTTGGCCCAGGCTTCCTGACCACAGCCTCACCTTTGAGCAGCTCAGA 5801 GCCCTGCCTGCCAGGATGCGAGCCACTGCCTGGATCGTGGCTCTGCAGGG 5851 CCACCCATGATGGAACAGGTCGCCTGGTGAGGTGGTGAGGCCCTTCATCA 5901 TTTCAGGGGGTGTACAGCAGTACTGGGTAACCCTGAAAATCAGGGATGAG 5951 ACTGAAGGATCCCTTCCAGAATGTGGACACAACCCCTCCTTTGAGTCTCA 6001 CCTCCTTTCCTTCTTTTTTTTGAGATGGAGTCTCGCTCTGTCACCCAGGC 6051 TGGAGTGCAATGGAACAATCTCGGCTCACTGCAACCTCCGCCTCCCGGGT 6101 TCAAGTGATTCTCCTGCTTCAGCCTCCTGAGTAGCTGGGCTTACAGGTGT 6151 GTGCCACCACACCTGGCTAATTTTTGTATTTTTAGTAGAGACAGGGTTTC 6201 ACCATGTTGGTCAGGCTGCTCTCCAACTCCTGACCTCGTGATCCACCTGC 6251 CTCGGCCTCCCAAAGTGCTGGGATTACAGGCATGAGCCACTGCGCCCGGC 6301 CTCTCGCCTCCTTTCTTTCAAGTCTGAGGTGGCTAGCCTGGAGGGGTCCC 6351 CAGGGGCTTGTTGGGCACTGACGAGGGGACACCCTGGGGGCCCAGTACTC 6401 CACTCAGACACCTCCCACAGCCTTCTGACAGCATCCTTCCCACAGCTGAC 6451 CAGCTAGAGGACCTCCTGACCCCTCCACTTGTCTGCTGGTGTGGAAAAGC 6501 CTGGACCCTCCCTCTTGGAGCCTCAGTTTCCCTATCTGTAAACTTCGGTC 6551 TATCCCAAGCTGAAGAACTGGCCAGTCCCTGCCATATGCCTCACTTTCCC 6601 CTGGGACACATTTTAATATCCCTTTCCTGGCCAGGTGCAATGGCTTCCCC 6651 ATGTAATCCCAGCACTTTGGGAGGCCAAGGTGGGCAGATCACTTGAGGTC 6701 AGGAGTTCGAGACCAGCCTGGCCAACATGGTGAAACCCCATCTCTACTAA 6751 AAATACAAAAATTAGCCAGGCATGGTGGCGCACGTCTGTGGTCCCAGCTA 6801 CTTGGGAGGCTGAGGTAGCAGAATCGTTTGAACCTGGGAGGCGGAGGTTG 6851 CAGTGAGTGGAGATCACACCACTGCACTCCTGCCTGGGAGACAGAGTGAG 6901 ACTGTGTCTCAAAAAATAATAATAATAAAAAATAAAATAATATCCCTTTC 6951 CTCACAGGGGCTATTGTGTCATCTTCTAGAAGGATCCGTTGAGGCTCTGA 7001 GGGGTGGGGGAACTTGCTTGTGGGTAGGACCACCTGTCAGAGGTCAGAGG 7051 TCAGGCCACCAAGGAGACCCAGTGGGATGCGCCTTCCAAAGGTGGGGGTA 7101 CGGATGGGACCCATGAAACCTGACTCCTCTCAGACTCTAGCCAAGTCTAA 7151 GACTTTGGACGGCCACCACCCAGAGGAGAAACTGAGACCCAGAGCGGCAC 7201 GGGTTGGCCAGGGTCACCCAGCACCAGATAGGGACTTTGCCAGCCCCGGG 7251 GCAGGACCCTGTCTCCGGCCCTCGACCCCGCTGGGCCGTACCCTCCCCGT 7301 TCACCTCCCCCACCCGGGCCGCGGCTGCTAGGAGAGTTCAGAACAAAAGG 7351 CGGCGGGGGGCGGGGCCGAGGCGGGCCGGGGGTGGGGCGGAAGCTATAAG 7401 GGGCGGCGGCCCGGAGCGGCCCAGCAAGCCCAGCAGCCNCCGGGGCGGAT 7451 GGCTCCGGCCGCCTGGCTCCGCAGCGCGGCCGCGCGCGCCCTCCTGCCCC 7501 CGANTGCTGCTGCTGCTGCTCCAGCCGCCGCCGCTGCTGGCCCGGGCTCT 7551 GCCGCCGGTGAGTGCCCGCCACTCGCCGGCCGCTCCTCGCTGAGGGGGCG 7601 CCGGGCACGCGGGCTGGGCCCAGCGGCGGATCCGGACCGAAGGGGGCGCC 7651 CCGGGTGGCCTCCAGCGCCCGGTACCCGAAACGCTTTCTGGTTCCCTCTA 7701 GGCGTGATAGACAGCGAGCTTGCAGTCCCTGGGGGTGTGAAGGGGAGCCG 7751 GCGCCGGCATCGTTCGGGCTGGTGGGACGGGACTCCACGCTGGACTCACG 7801 CTTGCTCCCAGCGTGGGGACCTGCCTCTCGCGCTCCAGCCGCGGGTGCTG 7851 GAGTGTGCGTTGAAGGAAGCAGCAGAGGGAGTGGTAACAGGGCCCCCTAT 7901 TCATCGCAGGGACAAAGCCGAGCAGATCCCAGGCAGGTGTCAGCCTGCAG 7951 GTGTGTGGCCGCAGTTAGTACACCTCCAGGTGTGCGGTGGGATGAAGGAT 8001 AAAGGGAGAAGGGAGGGCAGCGCTGTGCTGCGGGAAATGGGGTCTAAGCC 8051 TGGAGATGTCCCCCGCGGGCACCTGCCGGCAGCAGTTTTGGGAGGCTTGG 8101 AGCCAGGAGAACATGAGTATGAAATAGTATGAGTGCAGTGTGTGTGTGAC 8151 TGAGAGGTGGCTGTCAGAAAGAAGCGGGGAGAGAAAGGAAAGGGAGAAAA 8201 TGTGTGCTCAGGAGAGGAGAAGAAAGCCCAGGTAAGAGAGGACAGCAGAG 8251 TGAGGAGAGGAGGGACTGTCATGGAGTGTGTGTGACAGCTTGCATGTGTC 8301 CCAAAAGCTGCCCCTTCCCTGGGGCGCTTCCAGGCACCAGTCACAGGGCC 8351 TGGGGCCAGAGCCTGGGCCACTCCACTCCACAGCGGAGGCATCCAGACAG 8401 CTCGGGCAGGGAAGGGAGCAGGAGGGTGTGACAGGCAGGAGTCTCAGGAC 8451 TGGCCTGGTGAGGAGGCAGGCCTGGCCAGGCTGGGACCCTCTGTCCAGCC 8501 ACTCTGGCTCTGCTCAGGGCAGCCTTGTCCTGTGCTGGCTATGGGAGCAG 8551 AAAGGGATGGGATGGGTGGGGTAGAGTGAATGGGGGGCATCTCAGTCTGG 8601 AGTCTTGCCTCCTTCCCAGCCCCTCTTGGTTCCTAGGACTTGGGACAGAG 8651 TCAGGAATCACTGTGGGTAGACATTGAACCACAGGTCTGGAAATTGGAGA 8701 GATCTGGGTTTGAACCATGTGACCCTGGGTGAATCCTGCCATGCCTCAGA 8751 ATCTCACCCACTCCATCTCTAATGGGAGTGCAGGTGGGAGGGGGCAATGG 8801 TGCCTACTGCTGTGTCCACTGTTGAGCAATGAGGTGATGCCAGGCTCACT 8851 AGAAACACTGTCACCTGTAGCTGCCACTTTGACGCTTGTGCATGGTCAGT 8901 GCTGGAGCTGGGGGCCAGCTTGGGGTGGGGTCTGAGCTGGACTGTGGTCT 8951 GTCACTCTGCAAACACGCAGGGAGCATGTGGGGTCACCTCTGATGTGTTT 9001 ATCCCCCGGCTGCCTTCAGCAAGGCTGGGAGAGCTCTGTAAATATTTATC 9051 CAGCCCAGTTCCCAGCTTTCAGGGTTGATGGAAGCCCTGCAGTCCGATGG 9101 ATGGGACGAAGCTACCGTCCCTCCTGGAGCCAGCAGCAGAGGGGTGAGAA 9151 TAGGAAGATTAGGTGGAAGCTTGAGGGAGAAGATATCCCTGCTTGCCACC 9201 TGGCTGTGTGAGTTGGGCAGGTTGCTTGACCTCTCTGAGCCTGCATGGTG 9251 ATGCTCCCGTCTTCCTCCATAGCCTAACAAGTGCCCTCCTCGAAGTCTTG 9301 CCTCCCACCGACATCAGAAGGCATAGCTATGATTAATTATACCCACTAGA 9351 CCAACTGCAAACTGAGGCCCAGAAAGGGGCACAATGAGCCCAGCCTTTCG 9401 CAGTGTTTCCTGGCACCCTGGGGTCCTCCCTTATCTCCCCTCTATCTGGT 9451 GCCTCCCATTTTCTGGATTGCAAGTTGACCCCAGGGCGGGGTTGACTCCG 9501 AGTCTCTAAGCTCTGCCAGGACACTGGGTCAGCTGAGAAATTCCTTGAAT 9551 GTGTCTGCAGCTGAGGTTTGGTGTCTGGCAGAAGAGGGTGGGGCTGAGTG 9601 AGCTAAGAACCCACCACAGGGCAGGCCAAGGGAGGCTACTGGCCAAGACA 9651 GAGGGAATGCACTGGAAGCAGAAATGCTTCTTGGGAAAGTGGGTTTTGGA 9701 GAAAAGCCAAGAGCTGGGGAGACAGGGGGAAGCCCTGGAGGTGGGAGGCA 9751 TGAGCCCCAGGTCTAGGTCCAGCTCTTGCATGACTTGCTGTGTGACCCCG 9801 GGCAAAGTCCTTGAGCTCTCTGGGCTCTGTCTCATCCTCTGGGAAATGGG 9851 GGAGCTGCTTCTTCCTCAAGCTCCCCAGGGGTGTAGATGAGGCCAGTGGT 9901 GGCCTGTGTCTACCACCTCACCTCACTGAATCCTGAGAGTCCTGGGAAGG 9951 CCAAGCCCACCTGTCTCACAAGTAGCAAGCTGCTTCATGCCGTGCCTTGT 10001 AGTTAGGGCATCTGGCCCCAGGCCAGAAAGGACTTGGAAAGATGAAGTGT 10051 GGGAGGACAGTGAGGCAGAATGTGTGTCCTATCCCAGCCATGGGGAAACT 10101 GAGGCCAAGAGCTCATGGGCTCTAAAAAAATAATGGGCTCTAAAAACAAC 10151 CTCACTAGCTAGAGGCCTCATTTCTAGCATCTCCTGCTCACTCAGTTTCT 10201 AAGAATTTAATGAACAAGACAGGGACAGGGAGAAGAGACCTTGATATGTG 10251 GGAGACCCTGCACAAGGCGCTTCTCTTTTCTAGGCCTGAAAGCTCCCTTC 10301 TAGCTGCAACTTCTGTTTTACCATTTAAAGCCTGAAGATTTTGTTGGGTA 10351 GCTAGAGGGGCGGTGGGTGATGGGAACTGTAGGGACTGGCAAGGGCAGAT 10401 TCCCAAGCGTGGGAGGGGAGGGCCTACAGGGACAAGGAACAGGGCCAGGT 10451 AAGAGGTGCTTTTCTGTGCCAGAGGCTACAGCAGGCATTACAGAAGGATG 10501 TCATGTAGCCCTTTGCTGTGGCATCATTCCCATTTGACAGATAGGAAAGT 10551 TGAGGCTAAGAGAGGAAGGTGACTTGTCTAGAGTCTCCATGTCATAGAAC 10601 TAGGCCTTCTGAACCTCAGTGGCTGGCTCTTTTTGTGACACTAAGACTGT 10651 CATGATCCAGGGGTTTGGGCATGGACTGGGTGCCTCATGAGGGAAGGACA 10701 GGAAGGGACAGTGGAAGGGGTGGGGGCATGCCCGTGGCTCTCCCTCCATG 10751 ACTGTCTGCACCAGGACTCACTAGAGGGCAGAGGAGAGAAGAGATTTCTG 10801 ACCATGGGCATAGCAGGGCTGGCAGGCTGGGCTCTGGGTTGTTTGGAGCT 10851 CCCAGGGGAGAGCACAGCTCTGCCCTTTCCAGGGAGGGTCTTCATACCCC 10901 TGCCAGGCTGGGGAAAGGAATCTGTTTCTCCTGTAAGTCAAGGGGGCAGT 10951 GTGGCGAGGAGGAGGAGGCAGGCCCCTCTGGCAGCCCACATCCTCCATGG 11001 GAGAACTTGGATATGTAGCATCTCCACCTGCCTGGTGGATATTGGAAGCC 11051 CAACGAACCTGCCTGCAAAATGCCCGGGAAATAGCAGGCGCTGAATAATT 11101 TGCACCTCACCAAGGTGAGGCCAGCCTGGGTCCCTCCTTCAGGGGAAGGG 11151 TAACTCAACCCCTGCAGAGCAGAGCAGAGGTAGCAGGGAGCTGGGTGGGC 11201 TGTGAGCACAGACAGTCCGCTGCCTGCCAGCTGTTGTCTGATCAAGCTGC 11251 TTAACTCCTCTGGGGCCCATTTCCTCATCCTGGAAATGAGGGTGATGATG 11301 GTGGAGTTGGTGAGAGGTTCCATCAGGGGCTAAGAGCAAGTCTGTGGAGC 11351 TGTTGCCAAGGTCCTAGCCTGCAACCAGTGCTAAGTACTTTTTTTTTTTT 11401 TTTTGAGATGGAGTCTCGCTCTGTCACCCAGGCTGGAGTGCTGTGGCACG 11451 ATCTCGGCTCACTGCAAGCTCTGCCTCCCAGGTTCACGCCATTCTCCTGC 11501 CTCAGCCTCCCGAATAGCTGGGACTACAGGAGCCCGCCACCACGCCTGGC 11551 TAATTTTTTGTATTTTTAGTAGAGACGGGGTTTCACCGTGTTAGCCAGGA 11601 TGGTCTCGATCTCCTGACCTCATGATCCGCCTGCCTCGGCCTCCCAAAGT 11651 GCTGGGATTACAGGCGTGAGCCACCAGGCCCGGCCAACCAGTGCTAAGTA 11701 CTTATTAACAATAAGCCCAGGCCGGGCGCGGTGGCTCACGCCTGTAATCC 11751 CAGTACTTTGGGAGGCCGAGGCAGGTGGATCATGAGGTCAGGAGTTCAAG 11801 ACCAGCCTGGCCAAGATGGATAAACCTCATCTCTACTAAAAATACAAAAA 11851 TTAGCCAGGCGTGAGGCTGGGTGCGGTGGCGCATGCTTGTAATCCCAGCA 11901 CTTTGGGAGGCCGAGGCAGGCGGATCACAAGGTCAGGAGATCGAGACCAT 11951 CCTGGTAACATGGTGAAATCCCGTTTCTACTAAAAATACAAAAAAATTAG 12001 CCGGGCATGGCAGCAAGTGCCTGCAGTCCCAGCTACTCAGGAGGCTGAGT 12051 CAGGAGCATGGCATGAACCCGGAAGGCGGTGGTTGTAGTGAGCCCAGATC 12101 ACACCACTGCACTCCAGCCTGGGCGATAGAGCGAGACTCTCTCTCAAAAA 12151 AAAAAAAAAAAAAATTAACCAGGCGTGGTGGCAGGCACCTGTAACCCCAG 12201 CTACTCGGGAGGCTGAGGCAGAGAATTGCTTGAACCCGGGAGGTGGAGGT 12251 TGTAGTGAGCCGAGATCGCGCCACTGCACTCCAGCCTGGGCAACAGAGTG 12301 AGACTCCATCTCAAAAAAACAAAACAAAAAACCAATAAGCCCAGAATCGG 12351 CCAGAACCCACAACCCAGTGAGACTGCGCTGTGCCAGGTAACCATGCAAT 12401 AAGCAAGCTCGGAAATGGGGGGAAGCTGCTTCAGGGCCTCACGCCCTAAG 12451 TAGTTCCATAGCCTCCGTGTTCAGAAGGCCCCCCTTCATGGTGGGGTGTT 12501 CTTGATGCCTCAGCTCTGGGATCAGGAGCAGGGAGCGTTGGGACGCTGAT 12551 CAGATCCCTGGGGTGTATGGAGCCTGGGAGAGCTGCCAAAGGCTGAGGGT 12601 GAGGTGGGGCCTGAGTGGCTGAGCTCCTACCCCAAATATGGCTGTGAGGA 12651 GGCTGCAGTTGCCCAGACCAGACAGGTGCTGAGTCTCTCAGCAAGCAGCA 12701 CAGCTCCATCCCTCTCCTTCAGTGCAGGAAGGACACTTGGCTTCTGTGCG 12751 GTGTCCAGAGCCAGGCCTTAGCCTTAGGCCTGAGCCACCAGAGTCCTGGC 12801 CTGGCCTTGCCATGCCCTTGCTGGGTGACTTTGCAGGGTCTCCAACCTCT 12851 CTGAGCTTCTGCTTTTCTCATGCACAAGAACTGTAACTTCTGCCCTGGAG 12901 ACTTATAGACAGGTAGCAGGATGTAGCTTAGGTCTGAATCGCCATCTGTG 12951 GTCCTGGGGCTTTGGCGAGTGGGGCTGGGCACCTGGTAATTAACTGTCCC 13001 CACCCTCCCCGCTTGAAGAAGGCAGGCAGATCACAGATCAGCTCCCACTG 13051 TACTCCTAGCCCTGGTGGGGTGTGACCAAAACCACCTCTGCTAGAAGCCA 13101 GGCCTCAGTGGCCAGGTGCCTTCCCGGGTGCTGGGCCTGTGCTAGGTGCT 13151 GTACATACCTCACCTTGTCCAGTCCACAAGTCAGTGTCACCCCCAGCAGT 13201 CAGGTTACACTGACTGAGGCCACACTGCAGGGCTACAGCCGCTCCACACT 13251 GGGGAGGAGGTGGGGGAAATCCTGGGCAGGAGAGAGGGCAGATCCTGCTC 13301 ACATGACAGGGACAGGAGCATGACCCAGGTGTGTCTGAACTTAGCGGACA 13351 CAGGAAGAAAGGGAGTTGGAAGCAAGTTTCTGTGGGGAGCAGGAGGAGGT 13401 TGCCTGGTGTTCCTTCGGAGGAAGCTTTTTGGGGTCCATTCCTGGAGTGT 13451 ATGGCTCATAGCCAGTCCCAGTGTGCCCCCACCCCCAGACCTCATTGGCC 13501 TAAGTAGCTGGAGTAGGTGACAGGCAGCCCAGGGCCCTCCACGATGTGGG 13551 GGACAGCTTGATGCCTTGGAACAAGGTGCCAAGAAACCAGAGAGCCAGCC 13601 AGATGCCAAAGGGCCCTGCCATGTGCCGGTGCCCTTTCCCTCTCCATTTG 13651 CCCAGCCACACAGTGGGCTGGGGTTGCACGTGTGTTTGCTGACAGGCCAC 13701 ATCTCTAACTGTGGGCCATGTGGACCTTAGGCCTGACCAGACCCTCATGT 13751 CATCCTCCTGCCTANGGACGCCCACCACCTCCATGCCGAGAGGAGGGGGC 13801 CACAGCCCTGGCATGCAGCCCTGCCCAGTAGCCCGGCACCTGCCCCTGCC 13851 ACGCAGGAAGCCCCCCGGCCTGCCAGCAGCCTCAGGCCTCCCCGCTGTGG 13901 CGTGCCCGACCCATCTGATGGGCTGAGTGCCCGCAACCGACAGAAGAGGT 13951 TCGTGCTTTCTGGCGGGCGCTGGGAGAAGACGGACCTCACCTACAGGNTA 14001 GGGGCCTGGGAGCAGGACACTAGGATGCCACCTGTGTGTC
Proscan: Version 1.7
Processed Sequence: 14040 Base Pairs
Promoter region predicted on forward strand in 2336 to 2586 Promoter Score: 60.58 (Promoter Cutoff = 53.000000) TATA found at 2542, Est.TSS = 2572 Significant Signals: Name TFD # Strand Location Weight AP-2 S01936 + 2379 1.108000 JCV_repeated_sequenc S01193 - 2386 1.658000 UCE.2 S00437 + 2418 1.278000 GCF S01964 - 2433 2.284000 AP-2 S01936 - 2467 1.091000 myosin-specific S00608 - 2478 1.115000 GH-TRE S00193 + 2547 3.911000 Sp1 S00781 + 2550 3.191000 GCF S01964 + 2551 2.361000 T-Ag S00974 + 2553 1.086000 Sp1 S00801 - 2555 3.119000 APRT-mouse_US S00216 - 2557 7.604000 AP-2 S01936 + 2557 1.108000 AP-2 S00180 + 2557 1.863000 Sp1 S00978 + 2561 3.013000 JCV_repeated_sequenc S01193 + 2561 1.427000 Sp1 S00802 - 2566 3.061000 AP-2 S00346 - 2566 1.672000 AP-2 S01936 - 2568 1.091000 EARLY-SEQ1 S01081 - 2568 5.795000 (Sp1) S01187 - 2568 6.819000 Sp1 S00953 - 2569 5.737000 AP-2 S00346 + 2582 1.355000 Promoter region predicted on forward strand in 7130 to 7380 Promoter Score: 55.96 (Promoter Cutoff = 53.000000) TATA found at 7341, Est.TSS = 7371 Significant Signals: Name TFD # Strand Location Weight AP-1 S00090 - 7131 1.052000 UCE.2 S00437 - 7164 1.216000 AP-2 S01936 + 7244 1.108000 T-Ag S00974 + 7248 1.086000 AP-2 S01936 - 7251 1.091000 UCE.2 S00437 - 7270 1.216000 UCE.2 S00437 + 7284 1.278000 AP-2 S00346 + 7306 1.355000 PuF S02016 - 7315 1.391000 JCV_repeated_sequenc S01193 - 7315 1.658000 GCF S01964 - 7322 2.284000 GH-TRE S00193 + 7346 3.911000 Sp1 S00781 + 7352 3.191000 NGFI-C S01813 + 7353 6.885000 (Sp1) S00857 + 7356 4.589000 Sp1 S00064 + 7357 10.681000 Sp1 S01542 + 7357 6.661000 Sp1 S00801 - 7357 3.119000 T-Ag S00974 + 7357 1.086000 Sp1 S00979 + 7357 6.023000 Sp1 S00645 + 7357 12.906000 JCV_repeated_sequenc S01193 + 7358 1.427000 AP-2 S01936 - 7358 1.091000 Sp1 S00978 + 7358 3.013000 GCF S01964 + 7360 2.361000 AP-2 S00346 - 7360 1.672000 Sp1 S00802 - 7363 3.061000 UCE.2 S00437 + 7364 1.278000 EARLY-SEQ1 S01081 - 7365 5.795000 (Sp1) S01187 - 7365 6.819000 APRT-mouse_US S00216 - 7366 7.604000 PuF S02016 + 7380 1.082000 Promoter region predicted on forward strand in 12400 to 12650 Promoter Score: 56.87 (Promoter Cutoff = 53.000000) Significant Signals: Name TFD # Strand Location Weight JCV_repeated_sequenc S01193 + 12417 1.427000 AP-2 S01936 - 12542 1.091000 T-Ag S00974 + 12606 1.086000 APRT-CHO_US S00215 - 12610 1.628000 SRF S00496 + 12632 17.211000 SRF S00492 + 12632 8.606000 (SRF) S00230 + 12632 2.868000 SRF S00495 - 12641 17.211000 SRF S00492 - 12641 5.737000 (SRF) S00230 - 12641 3.155000 Promoter region predicted on reverse strand in 7616 to 7366 Promoter Score: 54.01 (Promoter Cutoff = 53.000000) Significant Signals: Name Strand Location Weight UCE.2 - 7581 1.278000 UCE.2 + 7578 1.216000 Sp1 - 7570 3.191000 Sp1 + 7565 3.119000 T-Ag - 7500 1.086000 GCF - 7485 2.361000 GCF + 7484 2.284000 UCE.2 - 7481 1.278000 UCE.2 + 7478 1.216000 Sp1 - 7448 3.292000 Sp1 + 7443 3.361000 GCF + 7406 2.284000 EARLY-SEQ1 - 7392 6.322000 APRT-CHO_US - 7388 1.860000 HSV_IE_repeat + 7387 1.363000 AP-2 - 7387 1.108000 JCV_repeated_sequenc + 7385 1.658000 ETF - 7382 2.458000 PuF + 7380 1.391000 Sp1 - 7375 2.755000 UCE.2 + 7374 1.216000 Sp1 + 7370 2.772000 APRT-mouse_US - 7366 6.003000 Promoter region predicted on reverse strand in 5593 to 5343 Promoter Score: 68.60 (Promoter Cutoff = 53.000000) Significant Signals: Name Strand Location Weight AP-2 - 5592 1.355000 CREB + 5568 1.147000 Sp1 - 5548 50.000000 Sp1 + 5539 3.608000 Sp1 - 5483 2.755000 Sp1 + 5478 2.772000 NF-GMa - 5439 1.912000 Lymphokine_CS - 5439 2.220000 Sp1 - 5384 2.755000 Sp1 + 5379 2.772000 UCE.2 + 5373 1.216000 Besonders interressant im folgenden die hits um2 und 7 kb : ---Berkley+2 andere Promotor predictersPromoter predictions for 1 eukaryotic sequence with score cutoff 0.80 (transcription start shown in larger font):
Promoter predictions for seq0 :
Start End Score Promoter Sequence
908 958 0.89 TGGAGGTGCGTTTCAAACCTGGGGCCACCTGCCCCTCCCCACATCCCCAG
1702 1752 0.99 GGGTTTCCCATCTTTAAAGGGCTGGTTGGTGTCTCTCCCTGGGAGCTTAG
2507 2557 0.89 TGTCGCGGGGACAAATGCCGCAGCGCTTGTCACAGCCGGCGCAAAAAAGG
2528 2578 0.89 AGCGCTTGTCACAGCCGGCGCAAAAAAGGCGGGGCCCCGGGCGGGGCCTC
2536 2586 1.00 TCACAGCCGGCGCAAAAAAGGCGGGGCCCCGGGCGGGGCCTCAGGAACAC
3085 3135 0.88 TTCGACTGTTGGCCAAAATGCGCTGCCAATGCTGGCAGCCTTACGCAGTG
3126 3176 0.82 TACGCAGTGCCCGCGGGGGATATGAGGCCCCCCGCGCGGCCCTGAACCCC
3134 3184 0.96 GCCCGCGGGGGATATGAGGCCCCCCGCGCGGCCCTGAACCCCACCGGATT
4995 5045 0.96 ATGCCTACTTTTTAAAAATGAGTCACTTTAGCGGGGAATAGTGGCTCATG
5288 5338 0.86 ATAATAAAGGTAAATAAAAGTGAGTGACTTTAAAGAAAAGATTGCCTTTC
5447 5497 0.97 CTCTACTAAAAATATAAAAAATTAGCTGGGCATGGTGGCGGGCGCCTGTA
7336 7386 0.99 GGAGAGTTCAGAACAAAAGGCGGCGGGGGGCGGGGCCGAGGCGGGCCGGG
7386 7436 1.00 GGTGGGGCGGAAGCTATAAGGGGCGGCGGCCCGGAGCGGCCCAGCAAGCC
7877 7927 0.84 GCAGAGGGAGTGGTAACAGGGCCCCCTATTCATCGCAGGGACAAAGCCGA
9034 9084 0.88 GAGAGCTCTGTAAATATTTATCCAGCCCAGTTCCCAGCTTTCAGGGTTGA
9877 9927 0.91 TCCCCAGGGGTGTAGATGAGGCCAGTGGTGGCCTGTGTCTACCACCTCAC
10117 10167 0.89 CTCATGGGCTCTAAAAAAATAATGGGCTCTAAAAACAACCTCACTAGCTA
10133 10183 0.96 AAATAATGGGCTCTAAAAACAACCTCACTAGCTAGAGGCCTCATTTCTAG
11058 11108 0.95 ACGAACCTGCCTGCAAAATGCCCGGGAAATAGCAGGCGCTGAATAATTTG
11984 12034 0.93 ACTAAAAATACAAAAAAATTAGCCGGGCATGGCAGCAAGTGCCTGCAGTC
BLASTN 2.1.1 [Aug-8-2000] Tem6genomic erste 13 000 mit blastn against unigenHuman
to check the probability that the 7 kb promotor is the one
nur ersten zwei hits sind interresant the rest is alu wie aus einem test
gegen alu ermittelt wurde Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= (14,040 letters) Database: Hs_contigs.seq 110,965 sequences; 93,868,674 total letters