Built for frontier AI teams

Built for frontier AI teams

Video datasets
for frontier models.

High fidelity data

Built for training

Easy integration

FROM THE LAB

FROM THE LAB

FROM THE LAB

We collect, clean, and evaluate video so your team can focus on models.

Curated video data

Curated video data

Curated video data

Benchmarks and metrics

Benchmarks and metrics

Benchmarks and metrics

Production ready formats

Production ready formats

Production ready formats

READY TO LICENSE

READY TO LICENSE

READY TO LICENSE

Datasets, built for world models, robotics, and generative video systems

WorldScenes 100K

Open world short clips across cities, homes, transit, nature, and retail, three to ten seconds, 24 to 30 fps, multilingual captions, entities, activities, depth and optical flow on subsets, panoptic segmentation on stratified splits.

PersonaFrames 30K
PhysicsBench 10K
AdCreative 50K
EgoHands 12K
WorldScenes 100K

Open world short clips across cities, homes, transit, nature, and retail, three to ten seconds, 24 to 30 fps, multilingual captions, entities, activities, depth and optical flow on subsets, panoptic segmentation on stratified splits.

PersonaFrames 30K
PhysicsBench 10K
AdCreative 50K
EgoHands 12K
WorldScenes 100K

Open world short clips across cities, homes, transit, nature, and retail, three to ten seconds, 24 to 30 fps, multilingual captions, entities, activities, depth and optical flow on subsets, panoptic segmentation on stratified splits.

PersonaFrames 30K
PhysicsBench 10K
AdCreative 50K
EgoHands 12K

Train models that understand how the world really moves

BUILT FOR FRONTIER TEAMS

BUILT FOR FRONTIER TEAMS

BUILT FOR FRONTIER TEAMS

Let us show your models the world.

Curated video data

High quality clips reviewed, filtered, and organized by our team, not scraped at random.

Curated video data

High quality clips reviewed, filtered, and organized by our team, not scraped at random.

Curated video data

High quality clips reviewed, filtered, and organized by our team, not scraped at random.

Built from real failures

We start from where your models break and design datasets to stress those cases.

Built from real failures

We start from where your models break and design datasets to stress those cases.

Built from real failures

We start from where your models break and design datasets to stress those cases.

Clear rights and safety

Rights, consent, and safety checks handled so legal and security can move quickly.

Clear rights and safety

Rights, consent, and safety checks handled so legal and security can move quickly.

Clear rights and safety

Rights, consent, and safety checks handled so legal and security can move quickly.

Training ready formats

Seamlessly customize layouts and features to match any style.

Training ready formats

Seamlessly customize layouts and features to match any style.

Training ready formats

Seamlessly customize layouts and features to match any style.

Continuously refreshed

New domains, edge cases, and updated evaluators as your products, models, and data needs change.

Continuously refreshed

New domains, edge cases, and updated evaluators as your products, models, and data needs change.

Continuously refreshed

New domains, edge cases, and updated evaluators as your products, models, and data needs change.

Create a free website with Framer, the website builder loved by startups, designers and agencies.