Implementation of DEvelopmentAl Learning (IDEAL) Course

Home » 3. Constructivist epistemology » 32. Learning regularities

Learning regularities of interaction

Figure 32 presents the principles of a rudimentary system that learns and exploits two-step regularities of interaction.

Figure 32: Rudimentary learning of regularities of interaction.

On time step t, the agent enacts the interaction i_t = ⟨e_t,r_t⟩. Enacting i_t means experimenting e_t and receiving a result r_t (Page 21). The agent records the two-step sequence ⟨i_t-1,i_t⟩ made by the previously enacted interaction i_t-1 and of i_t. The sequence of interactions ⟨i_t-1,i_t⟩ is called a composite interaction. i_t-1 is called ⟨i_t-1,i_t⟩'s pre-interaction, and i_t is called ⟨i_t-1,i_t⟩'s post-interaction. From now on, low-level interactions i = ⟨e,r⟩ will be called primitive interactions to differentiate them from composite interactions.

The enacted primitive interaction i_t activates previously learned composite interactions when it matches their pre-interaction. For example, if i_t = a and if the composite interaction ⟨a,b⟩ has been learned before time t, then the composite interaction ⟨a,b⟩ is activated, meaning it is recalled from memory. Activated composite interactions propose their post-interaction's experiment, in this case: b's experiment. If the sequence ⟨a,b⟩ corresponds to a regularity of interaction, then it is probable that the sequence ⟨a,b⟩ can be enacted again. Therefore, the agent can anticipate that performing b's experiment will likely produce b's result. The agent can thus base its choice of the next experiment on this anticipation.

Note that the enacted primitive interaction i_t may activate more than one composite interaction, each of them proposing different experiments. We create an interactionally motivated agent by implementing a decision mechanism that uses the agent's capacity of anticipation to choose experiments that will likely result in interactions that have a positive valence, and avoid experiments that will likely result in interactions that have a negative valence.

« Previous | Next »

See public discussions about this page or start a new discussion by clicking on the Google+ Share button. Please type the #IDEALMOOC032 hashtag in your post:

Lessons:

Implementation of DEvelopmentAl Learning (IDEAL) Course

Learning regularities of interaction