Skill Learning ·

Problem & approach

Problem. In natural behavior—and in most reinforcement learning settings—there are no explicit labels defining what a “skill” is. An agent must infer the existence, boundaries, and usefulness of skills directly from experience.

Approach. I frame skill discovery as a **probabilistic inference problem**. Rather than treating skills as predefined sub-policies, I view them as latent variables that explain patterns in the agent’s interactions with its environment. This connects reinforcement learning, neuroscience, and statistical inference: learning skills becomes the process of inferring latent structure that makes control and learning efficient.

Highlights

Inferring latent skill structure from unlabelled experience.
Investigating how curricula and inductive biases shape skill libraries.
Studying how agents arbitrate between adapting existing skills and creating new ones.

Questions I’m exploring

1. Inferring a good basis of skills

What defines a useful skill space for an agent acting in a complex world?
How can an agent infer skill representations that capture the right temporal or causal regularities?
Which features of the environment—or of the agent’s own dynamics—determine what makes a good “basis”?
How do the inductive biases of architectures or objectives (e.g., sparsity, modularity) influence what kinds of skills emerge?
Are there neural analogues of these inductive biases in biological motor systems?

2. Sequencing and exploration

Once a library of skills exists, how can an agent efficiently explore combinations and sequences of them?
How does hierarchical structure improve credit assignment and long-horizon exploration?
Can exploration itself be viewed as an inference problem—inferring which sequences are most likely to yield new information or reward?
How might compositional skill reuse relate to the way brains flexibly combine submovements or cognitive routines?

3. Curricula and structure learning

How do different task distributions or curricula influence which skills are discovered?
Can we formalize curriculum design as shaping a prior or posterior over latent skill structure?
Do staged or curriculum-based training regimes lead to qualitatively different skill representations than flat, single-task learning?
How might developmental learning trajectories in animals reflect an analogous form of curriculum shaping?

4. Adaptation vs. de novo learning

When entering a new context, how does an agent arbitrate between adapting an existing skill and creating a new one?
How can this decision be expressed as Bayesian reuse—updating beliefs about the usefulness or scope of existing skills?
What computational principles govern when to refine a skill versus when to expand the library?
Are there parallels to neural mechanisms of plasticity and consolidation during motor learning?

Results / status

I’m developing theoretical and empirical frameworks for:

Inferring skill boundaries using probabilistic latent-variable models.
Designing curricula that encourage compositional skill reuse.
Comparing algorithmic and neural signatures of inferred skill representations.

If you're interested in this line of research, here are some more people & labs to follow

Related publications

See all publications for this project →