Meta-Learning: Learning to Learn
How models can quickly adapt to new tasks with minimal data.
The core idea
Traditional ML learns one task at a time. Meta-learning optimizes across a distribution of tasks so the model acquires transferable inductive biases that support rapid adaptation.
Problem setup
During meta-training, the learner sees many tasks sampled from a task family. At meta-test time, it must adapt to a new task from the same family using few examples (few-shot regime).
Major approaches
- Optimization-based: e.g., MAML learns initial parameters for fast gradient updates.
- Metric-based: nearest-neighbor decisions in learned embedding spaces.
- Model-based: recurrent or memory-augmented architectures for fast in-context adaptation.
Where it helps
- Few-shot vision and language tasks.
- Personalization systems with sparse user data.
- Robotics and control where data collection is expensive.
- Hyperparameter and architecture optimization.
Takeaway: Meta-learning shifts the objective from solving one task well to acquiring reusable learning dynamics.