Google Ads

Google Ads

Bible Wheel Book

Google Ads

+ Reply to Thread
Page 27 of 37 FirstFirst ... 17232425262728293031 ... LastLast
Results 261 to 270 of 368
  1. #261

    Long Repeating Sequences

    The Crispr database does not seem to include long sequences that repeat. I took the first Archea in their list, and found that it contains a sequence of 306 bases that repeat once.


    GGGGTTATAGTAGGCCCTTCTCACATGCTCCTAGACATTTTTACAGAAAG AGGAATCTACCACAAGGTTAACGGAAGGTGGAGGAGAATAGCTTTAGCAC ACTTCTCTTATGATAACCCCATTGTCAACGGATTAGCAATAATAGCAGGA GTTATAATGCTGTTTGCAGCAATACATAATCACAACTATGATTACTACTA CCAATATTATCATTATTACAACTATTATTCCTAGTGAGATAATATACGAA AAGAGAAATATTTTTAAATACATTTTCTATATCTTTTGTCGTGATTTGTG AGAAGT - occurs at position 458021

    GGGGTTATAGTAGGCCCTTCTCACATGCTCCTAGACATTTTTACAGAAAG AGGAATCTACCACAAGGTTAACGGAAGGTGGAGGAGAATAGCTTTAGCAC ACTTCTCTTATGATAACCCCATTGTCAACGGATTAGCAATAATAGCAGGA GTTATAATGCTGTTTGCAGCAATACATAATCACAACTATGATTACTACTA CCAATATTATCATTATTACAACTATTATTCCTAGTGAGATAATATACGAA AAGAGAAATATTTTTAAATACATTTTCTATATCTTTTGTCGTGATTTGTG AGAAGT - occurs at position 967546

    THis is the longest repeat within the archaea - Acidianus hospitalis

  2. #262
    Join Date
    Jun 2007
    Location
    Yakima, Wa
    Posts
    12,743
    Quote Originally Posted by Craig.Paardekooper View Post
    The Crispr database does not seem to include long sequences that repeat. I took the first Archea in their list, and found that it contains a sequence of 306 bases that repeat once.


    GGGGTTATAGTAGGCCCTTCTCACATGCTCCTAGACATTTTTACAGAAAG AGGAATCTACCACAAGGTTAACGGAAGGTGGAGGAGAATAGCTTTAGCAC ACTTCTCTTATGATAACCCCATTGTCAACGGATTAGCAATAATAGCAGGA GTTATAATGCTGTTTGCAGCAATACATAATCACAACTATGATTACTACTA CCAATATTATCATTATTACAACTATTATTCCTAGTGAGATAATATACGAA AAGAGAAATATTTTTAAATACATTTTCTATATCTTTTGTCGTGATTTGTG AGAAGT - occurs at position 458021

    GGGGTTATAGTAGGCCCTTCTCACATGCTCCTAGACATTTTTACAGAAAG AGGAATCTACCACAAGGTTAACGGAAGGTGGAGGAGAATAGCTTTAGCAC ACTTCTCTTATGATAACCCCATTGTCAACGGATTAGCAATAATAGCAGGA GTTATAATGCTGTTTGCAGCAATACATAATCACAACTATGATTACTACTA CCAATATTATCATTATTACAACTATTATTCCTAGTGAGATAATATACGAA AAGAGAAATATTTTTAAATACATTTTCTATATCTTTTGTCGTGATTTGTG AGAAGT - occurs at position 967546

    THis is the longest repeat within the archaea - Acidianus hospitalis
    I think duplicates like that might be a major force in evolution. If a whole gene accidentally gets duplicated, then the second gene is a free space to add/change code for a similar but different function than the first without disrupting the original functionality.

    ETA: After writing that line I Googled "gene duplication" and found this wiki article:
    Gene duplication is believed to play a major role in evolution; this stance has been held by members of the scientific community for over 100 years.[3] Susumu Ohno was one of the most famous developers of this theory in his classic book Evolution by gene duplication (1970).[4] Ohno argued that gene duplication is the most important evolutionary force since the emergence of the universal common ancestor.[5] Major genome duplication events are not uncommon. It is believed that the entire yeast genome underwent duplication about 100 million years ago.[6] Plants are the most prolific genome duplicators. For example, wheat is hexaploid (a kind of polyploid), meaning that it has six copies of its genome.

    The duplication of a gene results in an additional copy that is free from selective pressure. One kind of view is that this allows the new copy of the gene to mutate without deleterious consequence to the organism. This freedom from consequences allows for the mutation of novel genes that could potentially increase the fitness of the organism or code for a new function. An example of this is the apparent mutation of a duplicated digestive gene in a family of ice fish into an antifreeze gene.

    Another view is that both copies are equally free to accumulate degenerative mutations, so long as any defects are complemented by the other copy. This leads to a neutral "subfunctionalization" or DDC (duplication-degeneration-complementation) model,[7][8] in which the functionality of the original gene is distributed among the two copies.

    The two genes that exist after a gene duplication event are called paralogs and usually code for proteins with a similar function and/or structure. By contrast, orthologous genes are ones which code for proteins with similar functions but exist in different species, and are created from a speciation event. (See Homology of sequences in genetics).
    Makes a lot of sense to me.
    • Skepticism is the antiseptic of the mind.
    • Remember why we debate. We have nothing to lose but the errors we hold. Who but a stubborn fool would hold to errors once they have been exposed?

    Check out my blog site

  3. #263

    results for Aciduliprofundum boonei T469

    Here are the results for Aciduliprofundum boonei T469

    Total = 23
    GTTTCCACACCACATGGTACTTCTTCTAC - 0 - 977786 - difference = 977786
    GTTTCCACACCACATGGTACTTCTTCTAC - 0 - 977852 - difference = 66
    GTTTCCACACCACATGGTACTTCTTCTAC - 0 - 977919 - difference = 67
    GTTTCCACACCACATGGTACTTCTTCTAC - 0 - 977984 - difference = 65
    GTTTCCACACCACATGGTACTTCTTCTAC - 0 - 978050 - difference = 66
    GTTTCCACACCACATGGTACTTCTTCTAC - 0 - 978116 - difference = 66
    GTTTCCACACCACATGGTACTTCTTCTAC - 0 - 978182 - difference = 66
    GTTTCCACACCACATGGTACTTCTTCTAC - 0 - 978250 - difference = 68
    GTTTCCACACCACATGGTACTTCTTCTAC - 0 - 983166 - difference = 4916
    GTTTCCACACCACATGGTACTTCTTCTAC - 0 - 983232 - difference = 66
    GTTTCCACACCACATGGTACTTCTTCTAC - 0 - 983297 - difference = 65
    GTTTCCACACCACATGGTACTTCTTCTAC - 0 - 983369 - difference = 72
    GTTTCCACACCACATGGTACTTCTTCTAC - 0 - 983435 - difference = 66
    GTTTCCACACCACATGGTACTTCTTCTAC - 0 - 983502 - difference = 67
    GTTTCCACACCACATGGTACTTCTTCTAC - 0 - 983568 - difference = 66
    GTTTCCACACCACATGGTACTTCTTCTAC - 0 - 983637 - difference = 69
    GTTTCCACACCACATGGTACTTCTTCTAC - 0 - 983706 - difference = 69
    GTTTCCACACCACATGGTACTTCTTCTAC - 0 - 983772 - difference = 66
    GTTTCCACACCACATGGTACTTCTTCTAC - 0 - 983841 - difference = 69
    GTTTCCACACCACATGGTACTTCTTCTAC - 0 - 983912 - difference = 71
    GTTTCCACACCACATGGTACTTCTTCTAC - 0 - 983978 - difference = 66
    GTTTCCACACCACATGGTACTTCTTCTAC - 0 - 984042 - difference = 64
    GTTTCCACACCACATGGTACTTCTTCTAC - 0 - 984103 - difference = 61


    And here is the sequence

    GTTTCCACACCACATGGTACTTCTTCTACTGATTGAAAATGATATATGTTTGAGGTGAGAAAAATG
    GTTTCCACACCACATGGTACTTCTTCTACGGAGTTTGGGATGAAATAAATAAAAAAGTGGAGAATAG
    GTTTCCACACCACATGGTACTTCTTCTACGTAATCTACAAAAAGATAAACAATAAATTCGTATTG
    GTTTCCACACCACATGGTACTTCTTCTACTGATTGAAAATGATATATGTTTGAGGTGAGAAAAATG
    GTTTCCACACCACATGGTACTTCTTCTACAGATATGGCAAAGGAAATAAGCAAAGTTATTGAAATT
    GTTTCCACACCACATGGTACTTCTTCTACCTTAACTGATGAAAACATAGATAAAATCGTTGGAAAC
    GTTTCCACACCACATGGTACTTCTTCTACTATCTACAATGAAGATACTGATGAGCTTGTTGCAATTAT
    GTTTCCACACCACATGGTACTTCTTCTACCAAGATTAACAGATATAGTAATACACTGGGTTTATTTAAACATTTTTAGA GCCATTTTGCGTCAAGTCTCCAATAAACCCATGAGTATATAAACTCCTGA CGCAATATAAAGATTTTTTATATTTTTATTCTATTTTTCCCTTTGCAAGT AAAACCTGCTTATTCCTTGTTCTACTGAACACAAAAATCATAAAAATACA AAATATAATAACTTTTATCTTTCAATCTTTATTTTCTTTTAAATACTCCT GACGCAAAATGAAATTTTAGAAGATCATGTCCGTTGTTCCTTTCTCTATC CCTATGTTTATCACTTTCAAGTACTTCTCGCTGGGAAGTACATAGATCGT TACAGAATCATAATCCTCCTCTATAATACCCTTTATTCCGTCCTTTATGG CTTGGAGCTGGCTTTGGGTTATATCACCTCGGAACACGGAATTTTGTATC CAGTGCAGGTAAGAACGCAGAAATTTGTTGACCTTGTTTACCCTTTCTTC TGCAACATCATAAACTATTATCACATGCATTCTACCACCACACCCTAAAT GCCCTGTACCTCTTATCATTCATCACATGCTTCACTAGCTTGTAAGCTTC CAACCTCAAAAGCCCTCTCTGGCTCACGCTCCTTTTTAACCTCCTGTGAT AAACAGTGGAGCTCATTCGGTCTTCCCATTTTCTTATGTATTTTCTCTTT CCTTGCTCATTTAGAAATACACCGTTGAACTCTTTTCTAAAATCATCCTC AGTTATTATACCGTGATTTACTATGTCATGAATGTGTCTATAAACCATCG TGGGCTTAAATACATCTGCAATATCCAAAGATAATGAGAATCTCCTCTCT GATGGTTCGTGGAGGTAACTTATGCTCGGGTTCAAATGCGTGTGATAAAT CTCGGTTAAAACTGCAGAGTATAGAAGAGTGTTCCCAAAGGAGATCAAAG CGTTGAGCTCGTTTATTGGCGGCCTGTAATCTCTACGCTCAAGTTGGAAT TTCTTTAGCGTATAATCTAAGCTCTTGTAGTAGTATCCCCAAATCTGTGC CTCTCTGTTCATAAGCTCCTCTATGCTATCTCCCTCAACTTCCACTCTTT CAATCTTTTCTATTATCTCCTCTAATTCTTTGTTGCTCTTTTTGAGATTT CTAAGGATGTTGTGCTTTATCCCCTCTATCATCTCCTTTGCTATGTACTT TCTCTTCTCCCAAAATATGTAGTGCTTTGCCTGATTTACCACAACCTCAC CAGAGATACTTCTCTCCCTAGGATATAATGTGCCTTCATAATTGCCGTAC ATATCAAAGAAATGCACCACCACGCCTGAGTTGAGCACATGCTTTATGGC ACCAGAGCTGAAACTAACTGGAGCTATACATGTAATGTCCCTTAGGTTGT GAATTGGTATTGATCTCTTCTCTCCCTTTCTAACCAAGTATATGGTGTTT GCCTCCCTCTTTATTATAGCCTCCTTTGTGATGTATAAACTATCCATAAA ATCACGCCCAGCAAAGCTCTAAATACGAGCATTTTTCACATATCTTCCGC CAAACTGGTTTTGGAGGCCTTTCTTTGCCTATCTCAACCTCCATCTCTTT AAGCAAATCTTCAATCTTCTTCTCATTTTCTTCATTCAACTCAACCATTT CCGTGTACTTCTCCTCAGGAACTGCAATCTTGCCTTTAACTTCCACTCCC CTATCCTTCAGCCATTTTAAATAAAACAATAACTGCATTCTGGCTGCGTT GAGCATCTTGGAACTTCTCTTAACCTCGTATATCCATACTTCCCCGTTCT CCTTCCTCGTGAAATCTATGGATACACCCTCACCAAACAGATTCTTTATC TCTCCCTTGTAACTCTTTTCGTGAAGCATCTTTCCTAAGATAATGTACTC ATTTTCCAATCTCACTCCTCTGTAAGAGTACCAAGCGGCCCTCTTGCACA CAACATACTCATGAACCAAGCTTCCTGTAAATTCCATATTAATCACTCAT TGTCTATGGGATTCAAAAACCCGAATCCAAGGGAGTTCTTTTCCCCTATT CCACAGTCCGTTATGAACCTGTAAAACTTGCCATATCCCTTGGGAATCCT CTTTTGAAGCAAGGACCACACTGAGCCTATTACTATGAATGTCTTCTCAT CCTTTTTTACCACCACGGAAACCTCCTTTTGAAATTGAAGCAGGTCAAAA AGGTCCTCGTCCAGCTCCAGCTCTTCATCATAGTACACATTATACTTCTT CACTGCATTATCTTTCAATCTATCCATGAAAAAGCGCATGCTTCCCCCGT CCCTGAATGAAAAGTACTTGCTCTTCTTATTATCCTTTTGAATCACCACT GGGCTTCCACTCCTAAACTTTCCTGTGGTTTTTACCTTGAACTTTTTGAG CTCTGCAACTTTCATTGGGCCATCGGATAGGTAAATATGCTCTAAATCCC TCAATCTTGAGTAAAGCTCACCTATGAACTTGGGATTTGGCGAGGATACT ATCAAGTTTTTCTCTTTATTTGGATAGAAATCTCCGGGAGGGAATAAATC GGAGTATGTGAAGAACTTAAACCCATTTTTAGAGTGCAAATCTGAGAACT CTGTTCCATCAAGGAGCGAGTATATGGCTCCTTGAATCGTATGCTTGTTT ACCTTTGAAAATGGTAAAAATTTCTCTGGCACTAGCTTAATGAGCAATCT CATGCGTGTAATAGAGAAAATATAGTATAAACATTTTGTGTTAAACTTCG CACCTGAGCATCTTTCCGATCTCTAGTCCCATGTCTGTTGGATGAATCCA AGAAGTTTGCTTGTTGTAAACTATTAAATTTGCCTGCTGGTATATCTGTA TGAGGTAGGGTGCCATAGGCTCTTCTACGAGTGATACTCCTCCCGGCTTT AATATTCTCTTCAACCTAGCAATATCCTCTGGAGAGTACGGTATTACAGG AATATTTATGAATTCCAACTTTGCTCCATCTGGAAATAGAAGCCTATCAA ATTTCTCCCGTAGCTCTTTGTACATCTCTATTCTCTTCTCTAAATCCTTC TCGCTGAACCTCATTATGTCCTCCATTATCTGTTGCTGATAATCGTTGAA TGCTCCAATCTCCTTGTTCACTACATGGTAAATGCCAGTCACTCCGAACA TTGAACCAAGAAGAGCTGCCATAACCGTTACCTCTTTTCTACCACCAGCA ACATTCAAAAATATCCTATCTGTGCCTATTTTGAATTTCTCTATGCATAT TGCCTTTGTTATCTCCTTGGCTGCAAGCAGGGCATCTTCCTCCGAAGCTA TGTCATTCTTTGGGAGCACATGCACATGTATTCTCAACTTGGGATAATGA ACCTTGAGAGCTGCCTCAACCAATCTAGTGCCTGCTAGCACGAACTCATC CTTTGTTGGAAGGAGTATTACATCGCTCAGATACTCGCTCTCTAAGAACC TCACCATCTCGGTTATAACCGAGGGTGATTTTCCAACAGGTGCAATCATA GCCACTTTCATTTCCAATCCTCCAGGTACTTTATCAGGGTAGAGCAATCT TTCAATATACTTTTTGTGGAGTTCTTTAACTCTTCCAAATTCAAGCTATC CTTGCTCATACCGCTGTGGGCTATCAAGTTTCTCTTTGATGTTATCAGAT TCCACAGGGTTATCAAATTCACCCCGCTCTTTCTCTCTATCTCATCCACA TTACTTCTAAGGTAATAATAAACTCCGTTCTTCTCTATTTTCTCCCCCCT CTTCTCCATCGCCATACCTGTTATTCCCTTCTCAACTTCAATTCTATTCT CCCTCTCTAGCCATAGGGAATTGTCAAAGTTGGATAGGTAAAAAACGAAG TTTACTATCCATTCTCTCAAATTCTCTAAGCTCTCTCCCAATAGCCCTAT CCCTGTTTGGTACTCTGCAATCAGCAACTGCTTTCTCAGAGTTTCCTTGC TGAGCTTATCGTAGCAATCCTCGCATCCAAATTTTTCAAATATGGTGAAG TCCCTGTAGAGCAAAGTCAATGGCTTAAAGTACTTTTCTAGCTCTTCTTT GACATCTGGAGAGAGTTTAGAGAGATTATATGCGTATTTCATCACATCCC TTATTTGATTTAGGTATATGCCCCTGCTTATCCTCTCTAACATATTTGCG TACTTTTTGGTATATCTCATACTTGGCTTTGATTCCCTAGCGTACATCTG ATTTTGCCACTCTTTCATTATTGAGGATATACCCTCGTTTCTACCGTAAT CCTTGAGATCTCTGGCAGAGTATATCCACTCTGAAAAAACCAGAGCATCG GAGAGGTCAAAAATTGGGACATAATCACCATCTCTGGACTCGTAGGCACC GTAGTAAATTCCGCGAATCTTTATGCCCCTAGCAACCTGAAGGTAGGTCA TTATAACGAAGGATAAGAAAGGTATATGGCGAAAAGCGTGCGTTATATCC AGTATCACTTCATCTCCCTGCTCTAAATTATCGTTGAGTTTGTCTAAAAT CTCCCAAACATCCCTCTCCTCTCTTGGAATGGAAATCTTTATTGCTTGAA TTTTCTCATCTTGCGCTTCCCTTTTCAAACTTTCCAAGTTCTTCATCCCT TCCTCCGTGGTTATTACGAGCCATTTATCTGGATTGAAAAACTCACCGAG GGCGCGAGAAACGAAGGAAGTTTCAAAATAGTTATCCTTCGAGCCGAGGT AATACTTCACCCTTGAATATTTCCCCGTGCCCATAAAGGATACCAAAACT AGCACAAAACCACCCCACTTCTGCAAATTTCTTCCATGCCGTTCTATCCT CTTTTTCTATATTAATATTTTCACAAACCATTCGGTTCTTCTTCTACAGG CTGACATCCAGCACTTTTGGGATTATGCGCAGGGGGT
    GTTTCCACACCACATGGTACTTCTTCTACGGGAGCAAGAAGATTATGACGGGAGGATGATAAGGAT
    GTTTCCACACCACATGGTACTTCTTCTACTCTAAAGCAGAAAATTACAGCAATTCTTGATTCATT
    GTTTCCACACCACATGGTACTTCTTCTACGAAATAAAGAAATACTACGGACTCAAAGGCTACAATAGCCCAG
    GTTTCCACACCACATGGTACTTCTTCTACACTACGATTTACTCAGGGAATTTATGGGAAATATACG
    GTTTCCACACCACATGGTACTTCTTCTACATAGAATAACAAAATACATATCAAAGCACAAGGAATTA
    GTTTCCACACCACATGGTACTTCTTCTACATAAACGAAAATAGTGAATCTTCATGGGAAGATAGAT
    GTTTCCACACCACATGGTACTTCTTCTACTAGATAGACAAATGTTTAATATGTTGTTATCAGATATAAT
    GTTTCCACACCACATGGTACTTCTTCTACTAGATAGACAAATGTTTAATATGTTGTTATCAGATATAAT
    GTTTCCACACCACATGGTACTTCTTCTACGGATATATAACAAAAAAGGATGGCTATAACATATCAG
    GTTTCCACACCACATGGTACTTCTTCTACGAATACAGTCTAATAAACAATGTATTAAAAGAGTTTTATC
    GTTTCCACACCACATGGTACTTCTTCTACTGGAATGAAAGAGGAAGCGGTTAAAACCGCAATTGAGATTTT
    GTTTCCACACCACATGGTACTTCTTCTACAGAATATAATCAGGCGTTAAAAGAATACGAGGATATG
    GTTTCCACACCACATGGTACTTCTTCTACCGTGAGGGACAAGAGAGCGACCGCAAGAGCAATAG
    GTTTCCACACCACATGGTACTTCTTCTACTAGCGTGGGCGTATTGGATGAGATATTGAATA
    GTTTCCACACCACATGGTACTTCTTCTAC

    Here there are 23 repetitions of a 29 letter sequence, and 22 spacers separating the repetitions
    Each cycle approximates to 66
    So the average length of the spacers would be 66 - 29 = 37 letters

    This might be a good place to search for an alphabetical sequence.
    Last edited by Craig.Paardekooper; 08-09-2012 at 01:32 PM.

  4. #264

    Using EMBOSS tool for measuring word counts

    Here is an online tool called EMBOSS that is used to determine word counts.

    http://emboss.bioinformatics.nl/cgi-...boss/wordcount

    Simply -

    1. select the text file comtaining the DNA sequence,
    2. set the word length and
    3. set the minimum word frequency and
    4. click "Run wordcount"

  5. #265

    Another online tool for detecting repeats

    Here is a tool that shows the repeats in a graphic format

    http://arbl.cvmbs.colostate.edu/molk...dot/index.html

    1. Copy and paste the DNA sequence into textbox called DNA Number 1,
    2. then paste it also into textbox named DNA Number 2
    3. then click the "Make Plot" Button

    A Dot PLot is created showing all the repeats in the sequence


    Here is a tool that will search all existing genomes to see if your repeat sequence is found in any of them -

    http://www.girinst.org/censor/index.php
    Last edited by Craig.Paardekooper; 08-01-2012 at 02:43 AM.

  6. #266

    Rakocevics new website

    Here is a link to Rakocevic's website where he outlines many mathematical patterns that he has discovered in DNA -

    http://www.rakocevcode.rs/page.php?3

  7. #267

    Is there a pattern in complimentary repeats also

    Here is a pattern that I found in Elusimicrobium Minutum Archaea when searching for ordinary repeats.

    The longest repeating sequence is 36 bases long (6 x 6)
    Each of the repeats is separated from the previous one by 66 bases
    There are 13 repeats altogether

    There are 12 spacers between these 13 repeats.

    Total length of the 12 spacers = 13 x 30 - 1
    Total length of the 13 repeats = 13 x 6 x 6
    Total length of whole sequence = 13 x 66 - 1

    ATTCTATAAAATCAATTCTCGGAGGGCAACCCTAAC


    Total = 13
    ATTCTATAAAATCAATTCTCGGAGGGCAACCCTAAC - 0 - 266139 - difference = 266139
    ATTCTATAAAATCAATTCTCGGAGGGCAACCCTAAC - 0 - 266204 - difference = 65
    ATTCTATAAAATCAATTCTCGGAGGGCAACCCTAAC - 0 - 266270 - difference = 66
    ATTCTATAAAATCAATTCTCGGAGGGCAACCCTAAC - 0 - 266336 - difference = 66
    ATTCTATAAAATCAATTCTCGGAGGGCAACCCTAAC - 0 - 266402 - difference = 66
    ATTCTATAAAATCAATTCTCGGAGGGCAACCCTAAC - 0 - 266468 - difference = 66
    ATTCTATAAAATCAATTCTCGGAGGGCAACCCTAAC - 0 - 266534 - difference = 66
    ATTCTATAAAATCAATTCTCGGAGGGCAACCCTAAC - 0 - 266600 - difference = 66
    ATTCTATAAAATCAATTCTCGGAGGGCAACCCTAAC - 0 - 266666 - difference = 66
    ATTCTATAAAATCAATTCTCGGAGGGCAACCCTAAC - 0 - 266732 - difference = 66
    ATTCTATAAAATCAATTCTCGGAGGGCAACCCTAAC - 0 - 266798 - difference = 66
    ATTCTATAAAATCAATTCTCGGAGGGCAACCCTAAC - 0 - 266930 - difference = 132
    ATTCTATAAAATCAATTCTCGGAGGGCAACCCTAAC - 0 - 266996 - difference = 66

    Searching for a Complimentary Sequence

    However, it would be interesting to see if the complimentary sequence also occurs in a repeat cycle.

    We obtain the complimentary sequence by converting every A to a T, every C to a G, every G to a C and every T to an A

    Here is a web tool that converts a sequence into it's compliment

    http://www.bioinformatics.org/sms/rev_comp.html

    I searched for the reverse sequence, the complimentary sequence and the reverse-complimentary sequence, but nothing was found. So this particular repeat sequence occurs in the normal forward reading direction only. There wasn't even a single instance of the reverse sequence or compliment or reverse-compliment occurring, let alone a repeat of them.


    Additional Sequences Discovered

    However I did discover several additional independent sequences in Elusimicrobium Minutum in the normal forward reading direction.

    Over the next few days I will outline these additional cycles. Perhaps they form a bigger pattern - "wheels within wheels" - we will soon find out.


    Total = 14
    TCAATTCTCGGAGGGCAACCCTAAC - 0 - 266150 - difference = 266150
    TCAATTCTCGGAGGGCAACCCTAAC - 0 - 266215 - difference = 65
    TCAATTCTCGGAGGGCAACCCTAAC - 0 - 266281 - difference = 66
    TCAATTCTCGGAGGGCAACCCTAAC - 0 - 266347 - difference = 66
    TCAATTCTCGGAGGGCAACCCTAAC - 0 - 266413 - difference = 66
    TCAATTCTCGGAGGGCAACCCTAAC - 0 - 266479 - difference = 66
    TCAATTCTCGGAGGGCAACCCTAAC - 0 - 266545 - difference = 66
    TCAATTCTCGGAGGGCAACCCTAAC - 0 - 266611 - difference = 66
    TCAATTCTCGGAGGGCAACCCTAAC - 0 - 266677 - difference = 66
    TCAATTCTCGGAGGGCAACCCTAAC - 0 - 266743 - difference = 66
    TCAATTCTCGGAGGGCAACCCTAAC - 0 - 266809 - difference = 66
    TCAATTCTCGGAGGGCAACCCTAAC - 0 - 266875 - difference = 66
    TCAATTCTCGGAGGGCAACCCTAAC - 0 - 266941 - difference = 66
    TCAATTCTCGGAGGGCAACCCTAAC - 0 - 267007 - difference = 66


    Total = 9
    CAATACAGTGAAGACGG - 0 - 586562 - difference = 586562
    CAATACAGTGAAGACGG - 0 - 586646 - difference = 84
    CAATACAGTGAAGACGG - 0 - 587036 - difference = 390
    CAATACAGTGAAGACGG - 0 - 587162 - difference = 126
    CAATACAGTGAAGACGG - 0 - 587288 - difference = 126
    CAATACAGTGAAGACGG - 0 - 588961 - difference = 1673
    CAATACAGTGAAGACGG - 0 - 754692 - difference = 165731
    CAATACAGTGAAGACGG - 0 - 754776 - difference = 84
    CAATACAGTGAAGACGG - 0 - 755070 - difference = 294


    I also found several repeat sequences that were thousands of bases long. The memory on my computer wasn't able to cope, and finding these sequences manually took a long time. So I will have to create some new software that can do the job quickly.
    Last edited by Craig.Paardekooper; 08-03-2012 at 08:38 AM.

  8. #268

    Email from Perez : Information waveforms in DNA

    Dear friends,

    In complement with our 2012 full joint paper with dr Andras Pellionisz Silicon Valley ( https://plus.google.com/103572438711...ts/DPw4yr5peTv

    ) which discuss particularly about the possible links between a fractal nature of human DNA sequence and cancer translocations within chromosomes ,

    I send you attached the first proof/evidence of the FRACTAL NATURE of whole HUMAN GENOME DNA TCAG sequences...

    This constitutes only an unformal draft preliminary to a full paper to be done...

    Evidence of 4 (four) embedded layers of periodic waves structuring dna sequence is demonstrated by the attached graphs.

    These unformal charts show - in both cases of human chromosomes X and Y - the evidence of a complex high level INFORMATION TOPOLOGY overlapping and structuring whole chromosomes...

    These results demonstrate evidence of INFORMATIONS WAVES within DNA sequence

    Meanwhile

    the PHYSICAL (electro-magnetic) nature associated with these waves remain an open question to be discussed (Montagniere et al:

    http://arxiv.org/abs/1012.5166

    http://arxiv.org/pdf/1012.5166v1.pdf

    http://www.21stcenturysciencetech.co...Montagnier.pdf


    copy for information Pr Luc Montagnier (Mrs Christine Restif FMPRS UNESCO Paris and Suzanne McDonnell Shangai)

    and

    J. R. Fourtou, President Foundation Bordeaux University

  9. #269
    Here is an analysis of the repeats in Acidianus Hospitalis

    There are 62 repeats, split into 3 sections by large areas of code where there are no repeats. The number of repeats in each section are 26, 26 and 10 repeats respectively.

    The repeat itself is 22 bases long, and the spacer between each repeat averages 42 bases long

    Total = 62
    TGCATCCCAAAAGGGATTGAAA - 0 - 354894 - difference = 354894
    TGCATCCCAAAAGGGATTGAAA - 0 - 354958 - difference = 64
    TGCATCCCAAAAGGGATTGAAA - 0 - 355021 - difference = 63
    TGCATCCCAAAAGGGATTGAAA - 0 - 355086 - difference = 65
    TGCATCCCAAAAGGGATTGAAA - 0 - 355148 - difference = 62
    TGCATCCCAAAAGGGATTGAAA - 0 - 355210 - difference = 62
    TGCATCCCAAAAGGGATTGAAA - 0 - 355274 - difference = 64
    TGCATCCCAAAAGGGATTGAAA - 0 - 355338 - difference = 64
    TGCATCCCAAAAGGGATTGAAA - 0 - 355403 - difference = 65
    TGCATCCCAAAAGGGATTGAAA - 0 - 355467 - difference = 64
    TGCATCCCAAAAGGGATTGAAA - 0 - 355534 - difference = 67
    TGCATCCCAAAAGGGATTGAAA - 0 - 355598 - difference = 64
    TGCATCCCAAAAGGGATTGAAA - 0 - 355662 - difference = 64
    TGCATCCCAAAAGGGATTGAAA - 0 - 355724 - difference = 62
    TGCATCCCAAAAGGGATTGAAA - 0 - 355786 - difference = 62
    TGCATCCCAAAAGGGATTGAAA - 0 - 355849 - difference = 63
    TGCATCCCAAAAGGGATTGAAA - 0 - 355912 - difference = 63
    TGCATCCCAAAAGGGATTGAAA - 0 - 355975 - difference = 63
    TGCATCCCAAAAGGGATTGAAA - 0 - 356039 - difference = 64
    TGCATCCCAAAAGGGATTGAAA - 0 - 356103 - difference = 64
    TGCATCCCAAAAGGGATTGAAA - 0 - 356167 - difference = 64
    TGCATCCCAAAAGGGATTGAAA - 0 - 356230 - difference = 63
    TGCATCCCAAAAGGGATTGAAA - 0 - 356295 - difference = 65
    TGCATCCCAAAAGGGATTGAAA - 0 - 356359 - difference = 64
    TGCATCCCAAAAGGGATTGAAA - 0 - 356423 - difference = 64
    TGCATCCCAAAAGGGATTGAAA - 0 - 356486 - difference = 63
    TGCATCCCAAAAGGGATTGAAA - 0 - 356613 - difference = 127
    TGCATCCCAAAAGGGATTGAAA - 0 - 356676 - difference = 63
    TGCATCCCAAAAGGGATTGAAA - 0 - 356741 - difference = 65
    TGCATCCCAAAAGGGATTGAAA - 0 - 356805 - difference = 64
    TGCATCCCAAAAGGGATTGAAA - 0 - 356871 - difference = 66
    TGCATCCCAAAAGGGATTGAAA - 0 - 356935 - difference = 64
    TGCATCCCAAAAGGGATTGAAA - 0 - 357002 - difference = 67
    TGCATCCCAAAAGGGATTGAAA - 0 - 357065 - difference = 63
    TGCATCCCAAAAGGGATTGAAA - 0 - 357128 - difference = 63
    TGCATCCCAAAAGGGATTGAAA - 0 - 357193 - difference = 65
    TGCATCCCAAAAGGGATTGAAA - 0 - 357257 - difference = 64
    TGCATCCCAAAAGGGATTGAAA - 0 - 357322 - difference = 65
    TGCATCCCAAAAGGGATTGAAA - 0 - 357385 - difference = 63
    TGCATCCCAAAAGGGATTGAAA - 0 - 357448 - difference = 63
    TGCATCCCAAAAGGGATTGAAA - 0 - 357511 - difference = 63
    TGCATCCCAAAAGGGATTGAAA - 0 - 357573 - difference = 62
    TGCATCCCAAAAGGGATTGAAA - 0 - 357636 - difference = 63
    TGCATCCCAAAAGGGATTGAAA - 0 - 357698 - difference = 62
    TGCATCCCAAAAGGGATTGAAA - 0 - 357762 - difference = 64
    TGCATCCCAAAAGGGATTGAAA - 0 - 357827 - difference = 65
    TGCATCCCAAAAGGGATTGAAA - 0 - 357892 - difference = 65
    TGCATCCCAAAAGGGATTGAAA - 0 - 357956 - difference = 64
    TGCATCCCAAAAGGGATTGAAA - 0 - 358019 - difference = 63
    TGCATCCCAAAAGGGATTGAAA - 0 - 358084 - difference = 65
    TGCATCCCAAAAGGGATTGAAA - 0 - 358147 - difference = 63
    TGCATCCCAAAAGGGATTGAAA - 0 - 358211 - difference = 64
    TGCATCCCAAAAGGGATTGAAA - 0 - 378319 - difference = 20108
    TGCATCCCAAAAGGGATTGAAA - 0 - 378395 - difference = 76
    TGCATCCCAAAAGGGATTGAAA - 0 - 378458 - difference = 63
    TGCATCCCAAAAGGGATTGAAA - 0 - 378521 - difference = 63
    TGCATCCCAAAAGGGATTGAAA - 0 - 378584 - difference = 63
    TGCATCCCAAAAGGGATTGAAA - 0 - 378647 - difference = 63
    TGCATCCCAAAAGGGATTGAAA - 0 - 378711 - difference = 64
    TGCATCCCAAAAGGGATTGAAA - 0 - 378774 - difference = 63
    TGCATCCCAAAAGGGATTGAAA - 0 - 378840 - difference = 66
    TGCATCCCAAAAGGGATTGAAA - 0 - 383406 - difference = 4566

    The first 26 repeats are all followed by T, and the spacer ends with GT

    The second group of 26 repeats are all followed by G, and the spacer ends with GT

    The final group of 10 repeats are all followed by G, and the spacer ends with GT



    TGCATCCCAAAAGGGATTGAAATTATAGCAATATTCCTCATTGTTTTAACACCTTCTTAGCAGT
    TGCATCCCAAAAGGGATTGAAATACTTCTATGAAACTATTGCTACCCCATAGGTCATACTTGT
    TGCATCCCAAAAGGGATTGAAATTTGCATAAACACCGCCTGCCAGTATGAGGTTATTAGCCAGGT
    TGCATCCCAAAAGGGATTGAAATCCTATAGTGAAATAGTAATTTGCTATTTTACTCAATTGT
    TGCATCCCAAAAGGGATTGAAATATATTGGAAGGGGCTTTGTCTTACCCAATCTTTAGAAGT
    TGCATCCCAAAAGGGATTGAAATTTGTTCCGCAAGTACGTTTGTCTTGCCTTCTTATATTACGT
    TGCATCCCAAAAGGGATTGAAATTAATTATTGAATTATCGCGGTCAAAATGAACGGTAATATGT
    TGCATCCCAAAAGGGATTGAAATAGAACAATTTCTCATCCTATCGGTTCCCGTTAAAATTTAAGT
    TGCATCCCAAAAGGGATTGAAATGGTTTTTTCACTTCTTCAGGAGTAGATAAAAAAATCTCCGT
    TGCATCCCAAAAGGGATTGAAATTTGAATTTCTCACCAGCGTCCTTAAGATTCATCGCTCTCATCGT
    TGCATCCCAAAAGGGATTGAAATAGACACTATATCCTTTTATTGAAAATATCGTCATTATGTGT
    TGCATCCCAAAAGGGATTGAAATCAAAGCACGCAAAGTCACTAATAGACATACCACTACCCAGT
    TGCATCCCAAAAGGGATTGAAATTCCCTAAGATTTTTAGCAATAAATTAAGGTATGTTGAGT
    TGCATCCCAAAAGGGATTGAAATTCGGTAGCGTCTATAGCAAAGCACATTCCATCAGAAGGT
    TGCATCCCAAAAGGGATTGAAATTAACGTATTGGTCAGGGCTAAGAACTTTCCTATATGCAGT
    TGCATCCCAAAAGGGATTGAAATAAGTTCGACCCGTACGCGGTATTCGGGAGGACATTTGCGT
    TGCATCCCAAAAGGGATTGAAATTTAACAGCATGCTTGTGATAGAATTGTTATGTTTTGCAGT
    TGCATCCCAAAAGGGATTGAAATAATAAAATCGGAATAGCAAAACCTATTGCACTCAATTCTGT
    TGCATCCCAAAAGGGATTGAAATGGTACTATCTCTACTTGGTCGCATACAGCGTTTATATCTGT
    TGCATCCCAAAAGGGATTGAAATTCTAATCTCTTCAAGAAGTTTCATCTTCACGCTTTCTGGGT
    TGCATCCCAAAAGGGATTGAAATATGAACTCCCAGACCTTGACCTTAACACGTGCGTAGTCGT
    TGCATCCCAAAAGGGATTGAAATAGACAAGAACAATGGCATTGCCAACTTCATATAGGTAAACGT
    TGCATCCCAAAAGGGATTGAAATATGACGGTCTCAGATAACGGCTGTATCGTCACAACAGCAGT
    TGCATCCCAAAAGGGATTGAAATATCTTAGCTTCAGCGTTATTTACAACGATTTTTATTTTCGT
    TGCATCCCAAAAGGGATTGAAATTAGAAAGCTACGGGATCCCCATGGAGCTAAGAGGAATCGT
    TGCATCCCAAAAGGGATTGAAAGATAACTCTACAATATTCCATTACACATTGACTTATCATGTTGCATCCCA AAAAGGGATTGAAGTAAAAAAATGTAATATCTATTCAGACCCATTGAAAT AGTGT
    TGCATCCCAAAAGGGATTGAAAGATATTGTTGAAAAATAACGGAAAACCTCTTCAGAATATGT
    TGCATCCCAAAAGGGATTGAAAGAAAAGGTGATGCGAATGGATCCCCTAAATAAATCTAAATCGT
    TGCATCCCAAAAGGGATTGAAAGATACGAACTGTTTACAAGGTTAATTAGAGAATACAACAAGT
    TGCATCCCAAAAGGGATTGAAAGAGATAGGAAACCTTGGTCTTGCTCCATAAAGTTCTGTGTCTGT
    TGCATCCCAAAAGGGATTGAAAGCTCTCTCAAGATCGGTATATATCTTTTTGACCAAGCAAAGT
    TGCATCCCAAAAGGGATTGAAAGTTGATGTAATGAAGTCAACGACTTTTTTGGCAACTGAAAATCGT
    TGCATCCCAAAAGGGATTGAAAGAATACCAATGAGTGAGTGAAATAGAATATATCTCCTCTGT
    TGCATCCCAAAAGGGATTGAAAGTGAAGGAATTACGTCGGTTTAAGCAAATATACGTGATGGT
    TGCATCCCAAAAGGGATTGAAAGTTATAACAACCAGTAAAGACATTCTCTATTATTTTTGAAAGT
    TGCATCCCAAAAGGGATTGAAAGTAAACCTCGACCTCCTGGGTTTAACAGTGCTGTTCTCAAGT
    TGCATCCCAAAAGGGATTGAAAGAGTTTCCTCATATACGTAATCATATTCCGCCTCTAAGTCCGT
    TGCATCCCAAAAGGGATTGAAAGTTAATTATAACATCTTCATTCTCTTCCTCATCATACCCGT
    TGCATCCCAAAAGGGATTGAAAGACTATTACTTTCGTTAAATTCTACTGCAGTAAGCTATCGT
    TGCATCCCAAAAGGGATTGAAAGGCTTTTTCGATTTGCTTTTTAATCAATTCAACAAAGGTGT
    TGCATCCCAAAAGGGATTGAAAGTAGAAGCGTGAAGAAAATCGTTGTAATTGATGAGGCTGT
    TGCATCCCAAAAGGGATTGAAAGGGATTTAATTTATTTGGATATATGACTACTAATATATCGT
    TGCATCCCAAAAGGGATTGAAAGTGTGAGAAATTGAGTATAAGTTGTACAACGCGATTAAGT
    TGCATCCCAAAAGGGATTGAAAGGTGTAGTTCTCTGGGTCAGAGCAATATTCGCCAAATTTTGT
    TGCATCCCAAAAGGGATTGAAAGTTTTCACTATAGAAAAGTACTGGATAGCCTTGAACTTCATGT
    TGCATCCCAAAAGGGATTGAAAGACTGTTGTTTAGCCTCTTCCTCCTTCTGAGCCCATTCCTCGT
    TGCATCCCAAAAGGGATTGAAAGCTATACTTCCGTTTTCATCTTTAATCCTACTTCTAACCCGT
    TGCATCCCAAAAGGGATTGAAAGATAACTTTACAAGTTCTTCTAAATTCTTTTGAGTCTGCGT
    TGCATCCCAAAAGGGATTGAAAGTTTATAGCTATTCTGTTAACCTTTATGTTGGTGTTGCACAGT
    TGCATCCCAAAAGGGATTGAAAGACCTATTTGCGTTAATAGAAAGGAGAAATTCCCGAATTGT
    TGCATCCCAAAAGGGATTGAAAGTTACAGTAACGTTTATTGTTAATTCGTATTTATTGTCATGT
    TGCATCCCAAAAGGGATTGAAA
    area of no repeats
    TGCATCCCAAAAGGGATTGAAAGCCGCTTAATATCACAATTGTTCCCAAATTTTCGACTTCAAATTCATAAC TTGT
    TGCATCCCAAAAGGGATTGAAAGCCGCTTAATATCTTAGTATTAACTGAACCTCTGTAACCGT
    TGCATCCCAAAAGGGATTGAAAGGAAATAGACACTGAATAAGTATTGTCTGTTGTCGGTTAGT
    TGCATCCCAAAAGGGATTGAAAGAATATATAGAAGAGTGGTACCAGTCCTAGTAATGTGGCGT
    TGCATCCCAAAAGGGATTGAAAGTGGGATTGGGTTTCACTCACACCCATTCACCCCGATTTGT
    TGCATCCCAAAAGGGATTGAAAGATTTTTATTGTGACATTAGATAGCGTAATACTGTAATGAGT
    TGCATCCCAAAAGGGATTGAAAGGCAGCTTTTGCAGCAGTAGCTTGGGATAATCGCAGTTTGT
    TGCATCCCAAAAGGGATTGAAAGAATTCAATTTTACCTAACTCTCCTATTTTGACCTCTCTTTTGT
    TGCATCCCAAAAGGGATTGAAA
    are of no repeats
    TGCATCCCAAAAGGGATTGAAA
    Last edited by Craig.Paardekooper; 08-10-2012 at 09:48 AM.

  10. #270

    Software program that can isolate all the repeats in a genome

    Finally, I have managed to create a software program that can isolate all the repeats in a genome as long as you specify the length of the repeat and the frequency range of the repeat

    I tested it out on Acidianus hospitalis for all repeats of length 22 and frequency between 10 and 60

    CATCCCAAAAGGGATTGAAATA - 10
    GAAATAAGGAAGAACTGAAAGA - 10
    TGTTGAAATAAGGAAGAACTGA - 10

    CATCCCAAAAGGGATTGAAATA POSIT: 354960 DIFF : 354960
    CATCCCAAAAGGGATTGAAATA POSIT: 355150 DIFF : 190
    CATCCCAAAAGGGATTGAAATA POSIT: 355340 DIFF : 190
    CATCCCAAAAGGGATTGAAATA POSIT: 355536 DIFF : 196
    CATCCCAAAAGGGATTGAAATA POSIT: 355851 DIFF : 315
    CATCCCAAAAGGGATTGAAATA POSIT: 355977 DIFF : 126
    CATCCCAAAAGGGATTGAAATA POSIT: 356169 DIFF : 192
    CATCCCAAAAGGGATTGAAATA POSIT: 356232 DIFF : 63
    CATCCCAAAAGGGATTGAAATA POSIT: 356297 DIFF : 65
    CATCCCAAAAGGGATTGAAATA POSIT: 356361 DIFF : 64

    GAAATAAGGAAGAACTGAAAGA POSIT: 1563015 DIFF : 1206654
    GAAATAAGGAAGAACTGAAAGA POSIT: 1563190 DIFF : 175
    GAAATAAGGAAGAACTGAAAGA POSIT: 1563543 DIFF : 353
    GAAATAAGGAAGAACTGAAAGA POSIT: 1563601 DIFF : 58
    GAAATAAGGAAGAACTGAAAGA POSIT: 1564012 DIFF : 411
    GAAATAAGGAAGAACTGAAAGA POSIT: 1564188 DIFF : 176
    GAAATAAGGAAGAACTGAAAGA POSIT: 1564305 DIFF : 117
    GAAATAAGGAAGAACTGAAAGA POSIT: 1564537 DIFF : 232
    GAAATAAGGAAGAACTGAAAGA POSIT: 1564653 DIFF : 116
    GAAATAAGGAAGAACTGAAAGA POSIT: 1565004 DIFF : 351

    TGTTGAAATAAGGAAGAACTGA POSIT: 1563068 DIFF : -1936
    TGTTGAAATAAGGAAGAACTGA POSIT: 1563422 DIFF : 354
    TGTTGAAATAAGGAAGAACTGA POSIT: 1563714 DIFF : 292
    TGTTGAAATAAGGAAGAACTGA POSIT: 1563830 DIFF : 116
    TGTTGAAATAAGGAAGAACTGA POSIT: 1563888 DIFF : 58
    TGTTGAAATAAGGAAGAACTGA POSIT: 1564008 DIFF : 120
    TGTTGAAATAAGGAAGAACTGA POSIT: 1564184 DIFF : 176
    TGTTGAAATAAGGAAGAACTGA POSIT: 1564243 DIFF : 59
    TGTTGAAATAAGGAAGAACTGA POSIT: 1564766 DIFF : 523
    TGTTGAAATAAGGAAGAACTGA POSIT: 1564824 DIFF : 58


    CATCCCAAAAGGGATTGAAAGT - 11
    CATCCCAAAAGGGATTGAAATT - 11
    CGTTGAAATAAGGAAGAACTGA - 11

    CATCCCAAAAGGGATTGAAAGT POSIT: 356937 DIFF : -1207887
    CATCCCAAAAGGGATTGAAAGT POSIT: 357067 DIFF : 130
    CATCCCAAAAGGGATTGAAAGT POSIT: 357130 DIFF : 63
    CATCCCAAAAGGGATTGAAAGT POSIT: 357195 DIFF : 65
    CATCCCAAAAGGGATTGAAAGT POSIT: 357324 DIFF : 129
    CATCCCAAAAGGGATTGAAAGT POSIT: 357513 DIFF : 189
    CATCCCAAAAGGGATTGAAAGT POSIT: 357638 DIFF : 125
    CATCCCAAAAGGGATTGAAAGT POSIT: 357764 DIFF : 126
    CATCCCAAAAGGGATTGAAAGT POSIT: 358021 DIFF : 257
    CATCCCAAAAGGGATTGAAAGT POSIT: 358149 DIFF : 128
    CATCCCAAAAGGGATTGAAAGT POSIT: 378586 DIFF : 20437

    CATCCCAAAAGGGATTGAAATT POSIT: 354896 DIFF : -23690
    CATCCCAAAAGGGATTGAAATT POSIT: 355023 DIFF : 127
    CATCCCAAAAGGGATTGAAATT POSIT: 355212 DIFF : 189
    CATCCCAAAAGGGATTGAAATT POSIT: 355276 DIFF : 64
    CATCCCAAAAGGGATTGAAATT POSIT: 355469 DIFF : 193
    CATCCCAAAAGGGATTGAAATT POSIT: 355664 DIFF : 195
    CATCCCAAAAGGGATTGAAATT POSIT: 355726 DIFF : 62
    CATCCCAAAAGGGATTGAAATT POSIT: 355788 DIFF : 62
    CATCCCAAAAGGGATTGAAATT POSIT: 355914 DIFF : 126
    CATCCCAAAAGGGATTGAAATT POSIT: 356105 DIFF : 191
    CATCCCAAAAGGGATTGAAATT POSIT: 356425 DIFF : 320

    CGTTGAAATAAGGAAGAACTGA POSIT: 1562953 DIFF : 1206528
    CGTTGAAATAAGGAAGAACTGA POSIT: 1563126 DIFF : 173
    CGTTGAAATAAGGAAGAACTGA POSIT: 1563246 DIFF : 120
    CGTTGAAATAAGGAAGAACTGA POSIT: 1563306 DIFF : 60
    CGTTGAAATAAGGAAGAACTGA POSIT: 1563364 DIFF : 58
    CGTTGAAATAAGGAAGAACTGA POSIT: 1563772 DIFF : 408
    CGTTGAAATAAGGAAGAACTGA POSIT: 1564124 DIFF : 352
    CGTTGAAATAAGGAAGAACTGA POSIT: 1564417 DIFF : 293
    CGTTGAAATAAGGAAGAACTGA POSIT: 1564475 DIFF : 58
    CGTTGAAATAAGGAAGAACTGA POSIT: 1564707 DIFF : 232
    CGTTGAAATAAGGAAGAACTGA POSIT: 1564882 DIFF : 175


    AGTTGAAATAAGGAAGAACTGA - 12
    CTTTCAATCCCTTTTGGGATGC - 12

    AGTTGAAATAAGGAAGAACTGA POSIT: 1563011 DIFF : -1871
    AGTTGAAATAAGGAAGAACTGA POSIT: 1563186 DIFF : 175
    AGTTGAAATAAGGAAGAACTGA POSIT: 1563539 DIFF : 353
    AGTTGAAATAAGGAAGAACTGA POSIT: 1563597 DIFF : 58
    AGTTGAAATAAGGAAGAACTGA POSIT: 1563654 DIFF : 57
    AGTTGAAATAAGGAAGAACTGA POSIT: 1564065 DIFF : 411
    AGTTGAAATAAGGAAGAACTGA POSIT: 1564359 DIFF : 294
    AGTTGAAATAAGGAAGAACTGA POSIT: 1564533 DIFF : 174
    AGTTGAAATAAGGAAGAACTGA POSIT: 1564591 DIFF : 58
    AGTTGAAATAAGGAAGAACTGA POSIT: 1564649 DIFF : 58
    AGTTGAAATAAGGAAGAACTGA POSIT: 1564942 DIFF : 293
    AGTTGAAATAAGGAAGAACTGA POSIT: 1565000 DIFF : 58

    CTTTCAATCCCTTTTGGGATGC POSIT: 371297 DIFF : -1193703
    CTTTCAATCCCTTTTGGGATGC POSIT: 371360 DIFF : 63
    CTTTCAATCCCTTTTGGGATGC POSIT: 371423 DIFF : 63
    CTTTCAATCCCTTTTGGGATGC POSIT: 371487 DIFF : 64
    CTTTCAATCCCTTTTGGGATGC POSIT: 371551 DIFF : 64
    CTTTCAATCCCTTTTGGGATGC POSIT: 371616 DIFF : 65
    CTTTCAATCCCTTTTGGGATGC POSIT: 371679 DIFF : 63
    CTTTCAATCCCTTTTGGGATGC POSIT: 371743 DIFF : 64
    CTTTCAATCCCTTTTGGGATGC POSIT: 371818 DIFF : 75
    CTTTCAATCCCTTTTGGGATGC POSIT: 371882 DIFF : 64
    CTTTCAATCCCTTTTGGGATGC POSIT: 371946 DIFF : 64
    CTTTCAATCCCTTTTGGGATGC POSIT: 372010 DIFF : 64


    GAAATAAGGAAGAACTGAAAGT - 13
    TCAATCCCTTTTGGGATGCAAC - 13
    TTCAATCCCTTTTGGGATGCAA - 13
    TTTCAATCCCTTTTGGGATGCA - 13

    GAAATAAGGAAGAACTGAAAGT POSIT: 1562891 DIFF : 1190881
    GAAATAAGGAAGAACTGAAAGT POSIT: 1562957 DIFF : 66
    GAAATAAGGAAGAACTGAAAGT POSIT: 1563072 DIFF : 115
    GAAATAAGGAAGAACTGAAAGT POSIT: 1563310 DIFF : 238
    GAAATAAGGAAGAACTGAAAGT POSIT: 1563485 DIFF : 175
    GAAATAAGGAAGAACTGAAAGT POSIT: 1563718 DIFF : 233
    GAAATAAGGAAGAACTGAAAGT POSIT: 1563776 DIFF : 58
    GAAATAAGGAAGAACTGAAAGT POSIT: 1563892 DIFF : 116
    GAAATAAGGAAGAACTGAAAGT POSIT: 1563951 DIFF : 59
    GAAATAAGGAAGAACTGAAAGT POSIT: 1564247 DIFF : 296
    GAAATAAGGAAGAACTGAAAGT POSIT: 1564595 DIFF : 348
    GAAATAAGGAAGAACTGAAAGT POSIT: 1564711 DIFF : 116
    GAAATAAGGAAGAACTGAAAGT POSIT: 1564770 DIFF : 59

    TCAATCCCTTTTGGGATGCAAC POSIT: 371236 DIFF : -1193534
    TCAATCCCTTTTGGGATGCAAC POSIT: 371300 DIFF : 64
    TCAATCCCTTTTGGGATGCAAC POSIT: 371363 DIFF : 63
    TCAATCCCTTTTGGGATGCAAC POSIT: 371426 DIFF : 63
    TCAATCCCTTTTGGGATGCAAC POSIT: 371490 DIFF : 64
    TCAATCCCTTTTGGGATGCAAC POSIT: 371554 DIFF : 64
    TCAATCCCTTTTGGGATGCAAC POSIT: 371619 DIFF : 65
    TCAATCCCTTTTGGGATGCAAC POSIT: 371682 DIFF : 63
    TCAATCCCTTTTGGGATGCAAC POSIT: 371746 DIFF : 64
    TCAATCCCTTTTGGGATGCAAC POSIT: 371821 DIFF : 75
    TCAATCCCTTTTGGGATGCAAC POSIT: 371885 DIFF : 64
    TCAATCCCTTTTGGGATGCAAC POSIT: 371949 DIFF : 64
    TCAATCCCTTTTGGGATGCAAC POSIT: 372013 DIFF : 64

    TTCAATCCCTTTTGGGATGCAA POSIT: 371235 DIFF : -778
    TTCAATCCCTTTTGGGATGCAA POSIT: 371299 DIFF : 64
    TTCAATCCCTTTTGGGATGCAA POSIT: 371362 DIFF : 63
    TTCAATCCCTTTTGGGATGCAA POSIT: 371425 DIFF : 63
    TTCAATCCCTTTTGGGATGCAA POSIT: 371489 DIFF : 64
    TTCAATCCCTTTTGGGATGCAA POSIT: 371553 DIFF : 64
    TTCAATCCCTTTTGGGATGCAA POSIT: 371618 DIFF : 65
    TTCAATCCCTTTTGGGATGCAA POSIT: 371681 DIFF : 63
    TTCAATCCCTTTTGGGATGCAA POSIT: 371745 DIFF : 64
    TTCAATCCCTTTTGGGATGCAA POSIT: 371820 DIFF : 75
    TTCAATCCCTTTTGGGATGCAA POSIT: 371884 DIFF : 64
    TTCAATCCCTTTTGGGATGCAA POSIT: 371948 DIFF : 64
    TTCAATCCCTTTTGGGATGCAA POSIT: 372012 DIFF : 64

    TTTCAATCCCTTTTGGGATGCA POSIT: 371234 DIFF : -778
    TTTCAATCCCTTTTGGGATGCA POSIT: 371298 DIFF : 64
    TTTCAATCCCTTTTGGGATGCA POSIT: 371361 DIFF : 63
    TTTCAATCCCTTTTGGGATGCA POSIT: 371424 DIFF : 63
    TTTCAATCCCTTTTGGGATGCA POSIT: 371488 DIFF : 64
    TTTCAATCCCTTTTGGGATGCA POSIT: 371552 DIFF : 64
    TTTCAATCCCTTTTGGGATGCA POSIT: 371617 DIFF : 65
    TTTCAATCCCTTTTGGGATGCA POSIT: 371680 DIFF : 63
    TTTCAATCCCTTTTGGGATGCA POSIT: 371744 DIFF : 64
    TTTCAATCCCTTTTGGGATGCA POSIT: 371819 DIFF : 75
    TTTCAATCCCTTTTGGGATGCA POSIT: 371883 DIFF : 64
    TTTCAATCCCTTTTGGGATGCA POSIT: 371947 DIFF : 64
    TTTCAATCCCTTTTGGGATGCA POSIT: 372011 DIFF : 64


    CATCCCAAAAGGGATTGAAAGA - 17

    CATCCCAAAAGGGATTGAAAGA POSIT: 356488 DIFF : -15523
    CATCCCAAAAGGGATTGAAAGA POSIT: 356615 DIFF : 127
    CATCCCAAAAGGGATTGAAAGA POSIT: 356678 DIFF : 63
    CATCCCAAAAGGGATTGAAAGA POSIT: 356743 DIFF : 65
    CATCCCAAAAGGGATTGAAAGA POSIT: 356807 DIFF : 64
    CATCCCAAAAGGGATTGAAAGA POSIT: 357004 DIFF : 197
    CATCCCAAAAGGGATTGAAAGA POSIT: 357259 DIFF : 255
    CATCCCAAAAGGGATTGAAAGA POSIT: 357387 DIFF : 128
    CATCCCAAAAGGGATTGAAAGA POSIT: 357829 DIFF : 442
    CATCCCAAAAGGGATTGAAAGA POSIT: 357958 DIFF : 129
    CATCCCAAAAGGGATTGAAAGA POSIT: 358086 DIFF : 128
    CATCCCAAAAGGGATTGAAAGA POSIT: 358213 DIFF : 127
    CATCCCAAAAGGGATTGAAAGA POSIT: 378523 DIFF : 20310
    CATCCCAAAAGGGATTGAAAGA POSIT: 378649 DIFF : 126
    CATCCCAAAAGGGATTGAAAGA POSIT: 378776 DIFF : 127
    CATCCCAAAAGGGATTGAAAGA POSIT: 378842 DIFF : 66
    CATCCCAAAAGGGATTGAAAGA POSIT: 383408 DIFF : 4566


    AGTTGCATCCCAAAAGGGATTG - 18

    AGTTGCATCCCAAAAGGGATTG POSIT: 354891 DIFF : -28517
    AGTTGCATCCCAAAAGGGATTG POSIT: 354955 DIFF : 64
    AGTTGCATCCCAAAAGGGATTG POSIT: 355207 DIFF : 252
    AGTTGCATCCCAAAAGGGATTG POSIT: 355400 DIFF : 193
    AGTTGCATCCCAAAAGGGATTG POSIT: 355659 DIFF : 259
    AGTTGCATCCCAAAAGGGATTG POSIT: 355721 DIFF : 62
    AGTTGCATCCCAAAAGGGATTG POSIT: 355846 DIFF : 125
    AGTTGCATCCCAAAAGGGATTG POSIT: 355972 DIFF : 126
    AGTTGCATCCCAAAAGGGATTG POSIT: 356356 DIFF : 384
    AGTTGCATCCCAAAAGGGATTG POSIT: 356802 DIFF : 446
    AGTTGCATCCCAAAAGGGATTG POSIT: 356932 DIFF : 130
    AGTTGCATCCCAAAAGGGATTG POSIT: 357190 DIFF : 258
    AGTTGCATCCCAAAAGGGATTG POSIT: 357254 DIFF : 64
    AGTTGCATCCCAAAAGGGATTG POSIT: 357695 DIFF : 441
    AGTTGCATCCCAAAAGGGATTG POSIT: 358081 DIFF : 386
    AGTTGCATCCCAAAAGGGATTG POSIT: 378316 DIFF : 20235
    AGTTGCATCCCAAAAGGGATTG POSIT: 378518 DIFF : 202
    AGTTGCATCCCAAAAGGGATTG POSIT: 378708 DIFF : 190


    CGTTGCATCCCAAAAGGGATTG - 19

    CGTTGCATCCCAAAAGGGATTG POSIT: 355271 DIFF : -23437
    CGTTGCATCCCAAAAGGGATTG POSIT: 355464 DIFF : 193
    CGTTGCATCCCAAAAGGGATTG POSIT: 355531 DIFF : 67
    CGTTGCATCCCAAAAGGGATTG POSIT: 355909 DIFF : 378
    CGTTGCATCCCAAAAGGGATTG POSIT: 356227 DIFF : 318
    CGTTGCATCCCAAAAGGGATTG POSIT: 356292 DIFF : 65
    CGTTGCATCCCAAAAGGGATTG POSIT: 356420 DIFF : 128
    CGTTGCATCCCAAAAGGGATTG POSIT: 356483 DIFF : 63
    CGTTGCATCCCAAAAGGGATTG POSIT: 356738 DIFF : 255
    CGTTGCATCCCAAAAGGGATTG POSIT: 356999 DIFF : 261
    CGTTGCATCCCAAAAGGGATTG POSIT: 357319 DIFF : 320
    CGTTGCATCCCAAAAGGGATTG POSIT: 357382 DIFF : 63
    CGTTGCATCCCAAAAGGGATTG POSIT: 357445 DIFF : 63
    CGTTGCATCCCAAAAGGGATTG POSIT: 357633 DIFF : 188
    CGTTGCATCCCAAAAGGGATTG POSIT: 357889 DIFF : 256
    CGTTGCATCCCAAAAGGGATTG POSIT: 357953 DIFF : 64
    CGTTGCATCCCAAAAGGGATTG POSIT: 358016 DIFF : 63
    CGTTGCATCCCAAAAGGGATTG POSIT: 378455 DIFF : 20439
    CGTTGCATCCCAAAAGGGATTG POSIT: 378581 DIFF : 126


    TGTTGCATCCCAAAAGGGATTG - 20

    TGTTGCATCCCAAAAGGGATTG POSIT: 355018 DIFF : -23563
    TGTTGCATCCCAAAAGGGATTG POSIT: 355145 DIFF : 127
    TGTTGCATCCCAAAAGGGATTG POSIT: 355335 DIFF : 190
    TGTTGCATCCCAAAAGGGATTG POSIT: 355595 DIFF : 260
    TGTTGCATCCCAAAAGGGATTG POSIT: 356036 DIFF : 441
    TGTTGCATCCCAAAAGGGATTG POSIT: 356100 DIFF : 64
    TGTTGCATCCCAAAAGGGATTG POSIT: 356610 DIFF : 510
    TGTTGCATCCCAAAAGGGATTG POSIT: 356673 DIFF : 63
    TGTTGCATCCCAAAAGGGATTG POSIT: 356868 DIFF : 195
    TGTTGCATCCCAAAAGGGATTG POSIT: 357062 DIFF : 194
    TGTTGCATCCCAAAAGGGATTG POSIT: 357508 DIFF : 446
    TGTTGCATCCCAAAAGGGATTG POSIT: 357570 DIFF : 62
    TGTTGCATCCCAAAAGGGATTG POSIT: 357759 DIFF : 189
    TGTTGCATCCCAAAAGGGATTG POSIT: 357824 DIFF : 65
    TGTTGCATCCCAAAAGGGATTG POSIT: 358144 DIFF : 320
    TGTTGCATCCCAAAAGGGATTG POSIT: 358208 DIFF : 64
    TGTTGCATCCCAAAAGGGATTG POSIT: 378392 DIFF : 20184
    TGTTGCATCCCAAAAGGGATTG POSIT: 378644 DIFF : 252
    TGTTGCATCCCAAAAGGGATTG POSIT: 378771 DIFF : 127
    TGTTGCATCCCAAAAGGGATTG POSIT: 378837 DIFF : 66


    GCATCCCAAAAGGGATTGAAAT - 25

    GCATCCCAAAAGGGATTGAAAT POSIT: 354895 DIFF : -23942
    GCATCCCAAAAGGGATTGAAAT POSIT: 354959 DIFF : 64
    GCATCCCAAAAGGGATTGAAAT POSIT: 355022 DIFF : 63
    GCATCCCAAAAGGGATTGAAAT POSIT: 355087 DIFF : 65
    GCATCCCAAAAGGGATTGAAAT POSIT: 355149 DIFF : 62
    GCATCCCAAAAGGGATTGAAAT POSIT: 355211 DIFF : 62
    GCATCCCAAAAGGGATTGAAAT POSIT: 355275 DIFF : 64
    GCATCCCAAAAGGGATTGAAAT POSIT: 355339 DIFF : 64
    GCATCCCAAAAGGGATTGAAAT POSIT: 355404 DIFF : 65
    GCATCCCAAAAGGGATTGAAAT POSIT: 355468 DIFF : 64
    GCATCCCAAAAGGGATTGAAAT POSIT: 355535 DIFF : 67
    GCATCCCAAAAGGGATTGAAAT POSIT: 355599 DIFF : 64
    GCATCCCAAAAGGGATTGAAAT POSIT: 355663 DIFF : 64
    GCATCCCAAAAGGGATTGAAAT POSIT: 355725 DIFF : 62
    GCATCCCAAAAGGGATTGAAAT POSIT: 355787 DIFF : 62
    GCATCCCAAAAGGGATTGAAAT POSIT: 355850 DIFF : 63
    GCATCCCAAAAGGGATTGAAAT POSIT: 355913 DIFF : 63
    GCATCCCAAAAGGGATTGAAAT POSIT: 355976 DIFF : 63
    GCATCCCAAAAGGGATTGAAAT POSIT: 356040 DIFF : 64
    GCATCCCAAAAGGGATTGAAAT POSIT: 356104 DIFF : 64
    GCATCCCAAAAGGGATTGAAAT POSIT: 356168 DIFF : 64
    GCATCCCAAAAGGGATTGAAAT POSIT: 356231 DIFF : 63
    GCATCCCAAAAGGGATTGAAAT POSIT: 356296 DIFF : 65
    GCATCCCAAAAGGGATTGAAAT POSIT: 356360 DIFF : 64
    GCATCCCAAAAGGGATTGAAAT POSIT: 356424 DIFF : 64


    GCATCCCAAAAGGGATTGAAAG - 37
    GTTGAAATAAGGAAGAACTGAA - 37
    TGAAATAAGGAAGAACTGAAAG - 37
    TTGAAATAAGGAAGAACTGAAA - 37

    GCATCCCAAAAGGGATTGAAAG POSIT: 356487 DIFF : 63
    GCATCCCAAAAGGGATTGAAAG POSIT: 356614 DIFF : 127
    GCATCCCAAAAGGGATTGAAAG POSIT: 356677 DIFF : 63
    GCATCCCAAAAGGGATTGAAAG POSIT: 356742 DIFF : 65
    GCATCCCAAAAGGGATTGAAAG POSIT: 356806 DIFF : 64
    GCATCCCAAAAGGGATTGAAAG POSIT: 356872 DIFF : 66
    GCATCCCAAAAGGGATTGAAAG POSIT: 356936 DIFF : 64
    GCATCCCAAAAGGGATTGAAAG POSIT: 357003 DIFF : 67
    GCATCCCAAAAGGGATTGAAAG POSIT: 357066 DIFF : 63
    GCATCCCAAAAGGGATTGAAAG POSIT: 357129 DIFF : 63
    GCATCCCAAAAGGGATTGAAAG POSIT: 357194 DIFF : 65
    GCATCCCAAAAGGGATTGAAAG POSIT: 357258 DIFF : 64
    GCATCCCAAAAGGGATTGAAAG POSIT: 357323 DIFF : 65
    GCATCCCAAAAGGGATTGAAAG POSIT: 357386 DIFF : 63
    GCATCCCAAAAGGGATTGAAAG POSIT: 357449 DIFF : 63
    GCATCCCAAAAGGGATTGAAAG POSIT: 357512 DIFF : 63
    GCATCCCAAAAGGGATTGAAAG POSIT: 357574 DIFF : 62
    GCATCCCAAAAGGGATTGAAAG POSIT: 357637 DIFF : 63
    GCATCCCAAAAGGGATTGAAAG POSIT: 357699 DIFF : 62
    GCATCCCAAAAGGGATTGAAAG POSIT: 357763 DIFF : 64
    GCATCCCAAAAGGGATTGAAAG POSIT: 357828 DIFF : 65
    GCATCCCAAAAGGGATTGAAAG POSIT: 357893 DIFF : 65
    GCATCCCAAAAGGGATTGAAAG POSIT: 357957 DIFF : 64
    GCATCCCAAAAGGGATTGAAAG POSIT: 358020 DIFF : 63
    GCATCCCAAAAGGGATTGAAAG POSIT: 358085 DIFF : 65
    GCATCCCAAAAGGGATTGAAAG POSIT: 358148 DIFF : 63
    GCATCCCAAAAGGGATTGAAAG POSIT: 358212 DIFF : 64
    GCATCCCAAAAGGGATTGAAAG POSIT: 378320 DIFF : 20108
    GCATCCCAAAAGGGATTGAAAG POSIT: 378396 DIFF : 76
    GCATCCCAAAAGGGATTGAAAG POSIT: 378459 DIFF : 63
    GCATCCCAAAAGGGATTGAAAG POSIT: 378522 DIFF : 63
    GCATCCCAAAAGGGATTGAAAG POSIT: 378585 DIFF : 63
    GCATCCCAAAAGGGATTGAAAG POSIT: 378648 DIFF : 63
    GCATCCCAAAAGGGATTGAAAG POSIT: 378712 DIFF : 64
    GCATCCCAAAAGGGATTGAAAG POSIT: 378775 DIFF : 63
    GCATCCCAAAAGGGATTGAAAG POSIT: 378841 DIFF : 66
    GCATCCCAAAAGGGATTGAAAG POSIT: 383407 DIFF : 4566

    GTTGAAATAAGGAAGAACTGAA POSIT: 1562888 DIFF : 1179481
    GTTGAAATAAGGAAGAACTGAA POSIT: 1562954 DIFF : 66
    GTTGAAATAAGGAAGAACTGAA POSIT: 1563012 DIFF : 58
    GTTGAAATAAGGAAGAACTGAA POSIT: 1563069 DIFF : 57
    GTTGAAATAAGGAAGAACTGAA POSIT: 1563127 DIFF : 58
    GTTGAAATAAGGAAGAACTGAA POSIT: 1563187 DIFF : 60
    GTTGAAATAAGGAAGAACTGAA POSIT: 1563247 DIFF : 60
    GTTGAAATAAGGAAGAACTGAA POSIT: 1563307 DIFF : 60
    GTTGAAATAAGGAAGAACTGAA POSIT: 1563365 DIFF : 58
    GTTGAAATAAGGAAGAACTGAA POSIT: 1563423 DIFF : 58
    GTTGAAATAAGGAAGAACTGAA POSIT: 1563482 DIFF : 59
    GTTGAAATAAGGAAGAACTGAA POSIT: 1563540 DIFF : 58
    GTTGAAATAAGGAAGAACTGAA POSIT: 1563598 DIFF : 58
    GTTGAAATAAGGAAGAACTGAA POSIT: 1563655 DIFF : 57
    GTTGAAATAAGGAAGAACTGAA POSIT: 1563715 DIFF : 60
    GTTGAAATAAGGAAGAACTGAA POSIT: 1563773 DIFF : 58
    GTTGAAATAAGGAAGAACTGAA POSIT: 1563831 DIFF : 58
    GTTGAAATAAGGAAGAACTGAA POSIT: 1563889 DIFF : 58
    GTTGAAATAAGGAAGAACTGAA POSIT: 1563948 DIFF : 59
    GTTGAAATAAGGAAGAACTGAA POSIT: 1564009 DIFF : 61
    GTTGAAATAAGGAAGAACTGAA POSIT: 1564066 DIFF : 57
    GTTGAAATAAGGAAGAACTGAA POSIT: 1564125 DIFF : 59
    GTTGAAATAAGGAAGAACTGAA POSIT: 1564185 DIFF : 60
    GTTGAAATAAGGAAGAACTGAA POSIT: 1564244 DIFF : 59
    GTTGAAATAAGGAAGAACTGAA POSIT: 1564302 DIFF : 58
    GTTGAAATAAGGAAGAACTGAA POSIT: 1564360 DIFF : 58
    GTTGAAATAAGGAAGAACTGAA POSIT: 1564418 DIFF : 58
    GTTGAAATAAGGAAGAACTGAA POSIT: 1564476 DIFF : 58
    GTTGAAATAAGGAAGAACTGAA POSIT: 1564534 DIFF : 58
    GTTGAAATAAGGAAGAACTGAA POSIT: 1564592 DIFF : 58
    GTTGAAATAAGGAAGAACTGAA POSIT: 1564650 DIFF : 58
    GTTGAAATAAGGAAGAACTGAA POSIT: 1564708 DIFF : 58
    GTTGAAATAAGGAAGAACTGAA POSIT: 1564767 DIFF : 59
    GTTGAAATAAGGAAGAACTGAA POSIT: 1564825 DIFF : 58
    GTTGAAATAAGGAAGAACTGAA POSIT: 1564883 DIFF : 58
    GTTGAAATAAGGAAGAACTGAA POSIT: 1564943 DIFF : 60
    GTTGAAATAAGGAAGAACTGAA POSIT: 1565001 DIFF : 58

    TGAAATAAGGAAGAACTGAAAG POSIT: 1562890 DIFF : -2111
    TGAAATAAGGAAGAACTGAAAG POSIT: 1562956 DIFF : 66
    TGAAATAAGGAAGAACTGAAAG POSIT: 1563014 DIFF : 58
    TGAAATAAGGAAGAACTGAAAG POSIT: 1563071 DIFF : 57
    TGAAATAAGGAAGAACTGAAAG POSIT: 1563129 DIFF : 58
    TGAAATAAGGAAGAACTGAAAG POSIT: 1563189 DIFF : 60
    TGAAATAAGGAAGAACTGAAAG POSIT: 1563249 DIFF : 60
    TGAAATAAGGAAGAACTGAAAG POSIT: 1563309 DIFF : 60
    TGAAATAAGGAAGAACTGAAAG POSIT: 1563367 DIFF : 58
    TGAAATAAGGAAGAACTGAAAG POSIT: 1563425 DIFF : 58
    TGAAATAAGGAAGAACTGAAAG POSIT: 1563484 DIFF : 59
    TGAAATAAGGAAGAACTGAAAG POSIT: 1563542 DIFF : 58
    TGAAATAAGGAAGAACTGAAAG POSIT: 1563600 DIFF : 58
    TGAAATAAGGAAGAACTGAAAG POSIT: 1563657 DIFF : 57
    TGAAATAAGGAAGAACTGAAAG POSIT: 1563717 DIFF : 60
    TGAAATAAGGAAGAACTGAAAG POSIT: 1563775 DIFF : 58
    TGAAATAAGGAAGAACTGAAAG POSIT: 1563833 DIFF : 58
    TGAAATAAGGAAGAACTGAAAG POSIT: 1563891 DIFF : 58
    TGAAATAAGGAAGAACTGAAAG POSIT: 1563950 DIFF : 59
    TGAAATAAGGAAGAACTGAAAG POSIT: 1564011 DIFF : 61
    TGAAATAAGGAAGAACTGAAAG POSIT: 1564068 DIFF : 57
    TGAAATAAGGAAGAACTGAAAG POSIT: 1564127 DIFF : 59
    TGAAATAAGGAAGAACTGAAAG POSIT: 1564187 DIFF : 60
    TGAAATAAGGAAGAACTGAAAG POSIT: 1564246 DIFF : 59
    TGAAATAAGGAAGAACTGAAAG POSIT: 1564304 DIFF : 58
    TGAAATAAGGAAGAACTGAAAG POSIT: 1564362 DIFF : 58
    TGAAATAAGGAAGAACTGAAAG POSIT: 1564420 DIFF : 58
    TGAAATAAGGAAGAACTGAAAG POSIT: 1564478 DIFF : 58
    TGAAATAAGGAAGAACTGAAAG POSIT: 1564536 DIFF : 58
    TGAAATAAGGAAGAACTGAAAG POSIT: 1564594 DIFF : 58
    TGAAATAAGGAAGAACTGAAAG POSIT: 1564652 DIFF : 58
    TGAAATAAGGAAGAACTGAAAG POSIT: 1564710 DIFF : 58
    TGAAATAAGGAAGAACTGAAAG POSIT: 1564769 DIFF : 59
    TGAAATAAGGAAGAACTGAAAG POSIT: 1564827 DIFF : 58
    TGAAATAAGGAAGAACTGAAAG POSIT: 1564885 DIFF : 58
    TGAAATAAGGAAGAACTGAAAG POSIT: 1564945 DIFF : 60
    TGAAATAAGGAAGAACTGAAAG POSIT: 1565003 DIFF : 58

    TTGAAATAAGGAAGAACTGAAA POSIT: 1562889 DIFF : -2114
    TTGAAATAAGGAAGAACTGAAA POSIT: 1562955 DIFF : 66
    TTGAAATAAGGAAGAACTGAAA POSIT: 1563013 DIFF : 58
    TTGAAATAAGGAAGAACTGAAA POSIT: 1563070 DIFF : 57
    TTGAAATAAGGAAGAACTGAAA POSIT: 1563128 DIFF : 58
    TTGAAATAAGGAAGAACTGAAA POSIT: 1563188 DIFF : 60
    TTGAAATAAGGAAGAACTGAAA POSIT: 1563248 DIFF : 60
    TTGAAATAAGGAAGAACTGAAA POSIT: 1563308 DIFF : 60
    TTGAAATAAGGAAGAACTGAAA POSIT: 1563366 DIFF : 58
    TTGAAATAAGGAAGAACTGAAA POSIT: 1563424 DIFF : 58
    TTGAAATAAGGAAGAACTGAAA POSIT: 1563483 DIFF : 59
    TTGAAATAAGGAAGAACTGAAA POSIT: 1563541 DIFF : 58
    TTGAAATAAGGAAGAACTGAAA POSIT: 1563599 DIFF : 58
    TTGAAATAAGGAAGAACTGAAA POSIT: 1563656 DIFF : 57
    TTGAAATAAGGAAGAACTGAAA POSIT: 1563716 DIFF : 60
    TTGAAATAAGGAAGAACTGAAA POSIT: 1563774 DIFF : 58
    TTGAAATAAGGAAGAACTGAAA POSIT: 1563832 DIFF : 58
    TTGAAATAAGGAAGAACTGAAA POSIT: 1563890 DIFF : 58
    TTGAAATAAGGAAGAACTGAAA POSIT: 1563949 DIFF : 59
    TTGAAATAAGGAAGAACTGAAA POSIT: 1564010 DIFF : 61
    TTGAAATAAGGAAGAACTGAAA POSIT: 1564067 DIFF : 57
    TTGAAATAAGGAAGAACTGAAA POSIT: 1564126 DIFF : 59
    TTGAAATAAGGAAGAACTGAAA POSIT: 1564186 DIFF : 60
    TTGAAATAAGGAAGAACTGAAA POSIT: 1564245 DIFF : 59
    TTGAAATAAGGAAGAACTGAAA POSIT: 1564303 DIFF : 58
    TTGAAATAAGGAAGAACTGAAA POSIT: 1564361 DIFF : 58
    TTGAAATAAGGAAGAACTGAAA POSIT: 1564419 DIFF : 58
    TTGAAATAAGGAAGAACTGAAA POSIT: 1564477 DIFF : 58
    TTGAAATAAGGAAGAACTGAAA POSIT: 1564535 DIFF : 58
    TTGAAATAAGGAAGAACTGAAA POSIT: 1564593 DIFF : 58
    TTGAAATAAGGAAGAACTGAAA POSIT: 1564651 DIFF : 58
    TTGAAATAAGGAAGAACTGAAA POSIT: 1564709 DIFF : 58
    TTGAAATAAGGAAGAACTGAAA POSIT: 1564768 DIFF : 59
    TTGAAATAAGGAAGAACTGAAA POSIT: 1564826 DIFF : 58
    TTGAAATAAGGAAGAACTGAAA POSIT: 1564884 DIFF : 58
    TTGAAATAAGGAAGAACTGAAA POSIT: 1564944 DIFF : 60
    TTGAAATAAGGAAGAACTGAAA POSIT: 1565002 DIFF : 58


    This might give us a way of seeing if there is a bigger pattern linking individual repeat sequences.

    I think that a statistical analysis of the resulting figures is required. For example graphs of -

    1. cycle lengths
    2. repeat lengths
    3. spacer lengths
    4. Total length of all cycles for any one repeat
    5. Distance between adjacent repeat sequences
    Last edited by Craig.Paardekooper; 08-12-2012 at 10:19 AM.

Thread Information

Users Browsing this Thread

There are currently 1 users browsing this thread. (0 members and 1 guests)

Tags for this Thread

Bookmarks

Posting Permissions

  • You may not post new threads
  • You may post replies
  • You may not post attachments
  • You may edit your posts
  •