Google's 'Watch & Learn' framework cracks the data bottleneck for training computer-use agents
A new framework developed by researchers at Google Cloud and DeepMind aims to address one of the key challenges of developing computer use agents (CUAs): Gathering high-quality training examples at scale.The framework, dubbed Watch & Learn (W&L), addresses the problem of training data generation in a way that doesn’t require human annotation and can automatically extract demonstrations from raw videos.Their experiments show that data generated W&L can be used to train or fine-t
Read more »