Thue-Morse sequence foldings

How might a Thue-Morse (TM) sequence occur through physical processes? Might something like a linear chain of algae, where two characteristics are TM along the chain, form naturally? Or a non-periodic crystal with molecular sheet structure in a TM sequence?

The Thue-Morse sequence is a two symbol non-periodic pattern: ABBABAABBAABABBA... The sequence can be formed by starting with one symbol (A), appending its complementary symbol string (A -> AB) and then repeating this operation indefinitely using the new string (A, AB, ABBA, ABBABAAB, ABBABAABBAABABBA, ...).

Is there a local cellular automata (CA) system that generates, as its current state, increasing length TM sub-strings? No, it has a nested structure requiring a rule based on the entire current sub-string: CA systems only use more or less local rules. [2008-10-15 Also yes! See Wolfram's examples of 1-D series of states, TM can appear as a "column", the sequence of states experienced by a particular cell. A cell's history can be TM.] There are many algorithms, including CA systems, that iteratively generate TM, but all require traversal of the previous sub-string or equivalently memory that encodes it.

The algorithm for a TM sequence descibed above resembles a physical folding: repeat (but not fold) the current TM substring across an endpoint and complement the new portion. AB->AB AB (repeat) and then -> AB BA (complement). But a fold operation involves a reflection, not a simple repeat, and a complement.

To illustrate, consider a directional sheet on edge, or linear band (Figure 1). The direction up (U) or down (D) are the symbols of the horizontal sequence. Starting with an unfolded U, after one fold the sequence is UD. After two folds the sequence is UDUD. Further folds result in an alternating, periodic sequence. The direction of the folding, to the left or right, at any step is irrelevant to the final sequence. There are other foldings that also lead to an alternating sequence.


Figure 1: Repeated folding of an directional sheet results in a periodic sequence.

Is there a way to fold a directional sheet to form a long TM sub-sequence? Yes.

For example the folds in Figure 2 form the first eight elements of a TM sequence starting with U. The shape has a mirror symmetry about the center, but the direction is anti-symmetric about the center.


Figure 2: Folding that results in the first eight elements of the Thue-Morse sequence.

This solution can't be extended to include more elements to the right, as the ends are enclosed. But it is not unique: Figure 3a shows another solution. This solution can be extended to the right, doubling the length of the folded sub-sequence (Figure 3b). This extension is essentially a translation of the right side folding pattern (and an inversion of the sheet direction).


Figure 3: Another folding that results in the first eight elements (a), and can be extended to the second eight elements (b), of the Thue-Morse sequence.

And a third similar solution with only the inner folds modified (Figure 4), can also be extended by translation.


Figure 4: A third solution for the first eight elements, that can also be extended by translation and inversion of sheet direction.

Are there more unique foldings for these first eight elements? I don't think so.

Although these foldings can't be extended to more than sixteen elements, the folding procedure can be used to include any length TM substring. Figure 5 shows a folding for the first sixteen elements, that can be extended to include the next sixteen by reflection. Is it unique?


Figure 5: A solution for the first sixteen elements, that can be extended to 32 elements by reflection.

What is the procedure, and can it be proved that it works for any TM sub-string? Will it work for any string that has a balanced number of symbols? An infinite string, such as a Bernoulli sequence?

It seems unlikely that this kind of nested folding could occur naturally. The procedure is not local; up to half the sub-string must be traversed to make a folding decision.

 

Also see:

Paper folding and the Dragon curve (fractal)

(concerning folding and the Rudin-Shapiro sequence):

Patrick Morton
Connections between binary patterns and paperfolding
Journal de théorie des nombres de Bordeaux, 2 no. 1 (1990), p. 1-12 (PDF )

"... there is the beautiful fact discovered by Mendès France that [the 1-D Rudin-Shapiro sequence is exactly the direction sequence of the paperfolding sequence obtained by folding a rectangular piece of paper alternately under and over the left edge (which is held fixed). (See [2] or [4].)

[2] M. Dekking, M. Mendès France and A. van der Poorten, Folds I,II III, Math. Intelligencer 4 (1982), 130-138, 173-181, 190-195. MR 684028 | Zbl 0493.10001

[4] P. Morton and W.J. Mourant, Paper folding, digit patterns and groups of arithmetic fractals, Proc. London Math. Soc. 59, 2, (1989), 253-293. MR 1004431 | Zbl 0694.10009

J. Dekking et al., Folds!, Math. Intelligencer 4 (1982). A series of 4 articles on paperfolding related to sequences.

Finite automata and arithmetic,  J.-P. Allouche

Comments

TM proteins

Nice. I want to make sure I understand why you did this exercise.  Was it to establish that TMs can (or cannot) be manifested in nature as the consequence of only folding processes? It seems almost more like a search for any kind of sequence that if folded properly results in an aperiodic pattern -was that the motivation?  WHY WHY WHY. And I'm not buying  'just because I felt like it'.

Youre diagrams are reminiscent of protein b-sheets. Maybe there is somekind of volumetric solution that can be analogized to the structure of a globular protein structure.  It might be interestting to genetically engineer a protein with cystiene residue intervals (for disulfide bonds ) endcoded in some manner by Thue morse to see if they form some kind of self organizing higher order structures (virion capsules...)

BTW this post would be a good example if you want to incorporate it into one of your thue morse books by using the 'outline' tab at the top of the page. 

 

Why, why, why

1) Because it's fun. It's similar to doodling, in that I can do it in intersticial periods and it is engaging. There's a lot of longer term, more immediately useful projects that I haven't been able to engage with fully -- attention span problems. It IS doodling.

2) Because I can see the value, both practically and theoretically. What if you saw a structure in nature that followed a TM sequence, would you recognize that fact? Probably not. But if one knew how they might form (folding?), one could know where to look. Duplicate, translate, append and complement -- hey, that process sounds like genetic transcription. But no, the append step doesn't happen. [2008-10-15 But yes, append does happen in some genetic systems -- concatemers! Concatemers of complementary sequences have not been observed, but who's to say they can't occur.] But maybe sometimes append does happen as a chromosomal duplication "error". Shouldn't be too tough to look for TM processes across species. Theoretically it is also has value -- Gray codes are directly related. The folding patterns and 2D tilings have an erie resemblance to Horseshoe maps. Why study Horseshoe maps? Because they encode chaotic dynamics well.

No it doesn't seem likely that naturally occuring algorithms would result in the folding I describe. But it got me thinking about the topology of directional strings. Nothing new, but here's something new to me: If you fold a directional loop (a rubber band with arrows) any way, and slice it (in any non-intersecting curved path) the sequence is strictly alternating. Try it! Like Mobius strips and knots, the relationship between topology and number theory is just plain cool.

Like your example, this might also be useful for engineering purposes. It's easier to self assemble periodic structures and random structures than intermedate complexity structures. If you could design a moleculular string, you might be able to force a TM cross-section. (But the molecular string wouldn't be TM. I'll leave it as an excerise as to how the folding sequence relates to the cross-section sequence.)

3) There is a beauty to it. Like beautiful images, TM is balanced between periodicity and randomness.

4) Primarily, just because I felt like it. 

Hey, thanks for the "outline tab" hint! I hadn't gotten around to replicating this blog entry as a project page, and this was exactly the right trick.

Three more reasons Why

Quotes from George Pólya

  • If you can't solve a problem, then there is an easier problem you can solve: find it.
  • A great discovery solves a great problem, but there is a grain of discovery in the solution of any problem. Your problem may be modest, but if it challenges your curiosity and brings into play your inventive faculties, and if you solve it by your own means, you may experience the tension and enjoy the triumph of discovery.
  • Wishful thinking is imagining good things you don't have...[It] may be bad as too much salt is bad in the soup and even a little garlic is bad in the chocolate pudding. I mean, wishful thinking may be bad if there is too much of it or in the wrong place, but it is good in itself and may be a great help in life and in problem solving.

 

DNA Origami

This kind of folding can be used to engineer nanostructures, although I don't see that 90 degree bends have been accomodated with the following techniques, from

Assembling Materials with DNA as the Guide

Faisal A. Aldaye, Alison L. Palmer,  Hanadi F. Sleiman

Science 26 September 2008:
Vol. 321. no. 5897, pp. 1795 - 1799

In "DNA origami," a single continuous strand of DNA is systematically folded using a large number of smaller DNA strands. This approach was first reported by the group of Joyce, who synthesized a single continuous DNA strand that is 1.6 kb long and, in a single step, annealed it in the presence of five smaller strands to generate a DNA octahedron (19). Ingeniously, Rothemund generalized this approach to fold naturally occurring genomic DNA into any 2D shape (20). In his DNA origami approach, the long strand is folded into the desired shape by a number of smaller strands ("stapling strands") (Fig. 1H, left). The sequences of these strands are computationally designed. Rothemund was able to assemble the same initial long strand of DNA into squares, rectangles, stars, smiley faces (Fig. 1H, middle), and many other 2D shapes. The power of this approach lies in its addressability: Because each stapling strand is a unique sequence, each strand is also as patially addressable bit. Hairpins were incorporated into stapling strands to write words, such as "DNA," and to draw complex objects, such as the outline of the Western Hemisphere (Fig. 1H, right). DNA origami will be useful for accessing larger DNA shapes with highly addressable surfaces. In a recent application, Yan constructed origami-based DNA nanoarrays for label-free RNA detection of the three genes Rag-1, c-myc, and β-actin (21). Shih, Chou, and Douglas synthesized DNA nanotubes by "rolling" a DNA origami array and used the alignment made by these liquid crystalline materials to measure nuclear magnetic resonance parameters in transmembrane proteins (22), a powerful way to use DNA organization in protein structure determination.

Post new comment

Security question, designed to stop automated spam bots