Choreograph
Choreograph is WPP's global data products and technology company, working across audience insights, planning, AI-based media optimization, predictive analytics, data enrichment and modeling for advertising. The projects I worked on were joint WPP-Google initiatives at the intersection of WPP's media/data business and Google's marketing and cloud technology. In practice this meant early-stage adtech product work around multimodal AI, synthetic data and unified customer understanding.
The AI/ML science work has focused on multimodal models that help form a unified customer view from heterogeneous evidence: surveys, video, text, imagery, behavioral signals, commercial records and large-scale internet data. The modeling work connects information extraction, representation learning, audience prediction, lookalike and propensity modeling, synthetic customer or product data and evaluation of whether the learned representation captures meaningful customer structure.
The scientific challenge is scale and heterogeneity. Customer signals are scattered across formats, platforms and levels of reliability, so the models have to extract useful structure from noisy multimodal data and turn it into representations that remain stable enough for downstream use. That makes architecture choices, sampling strategy, synthetic-data design, evaluation and error analysis central to the work.
Selected Contributions
- Designed multimodal AI models across surveys, video, text, imagery, behavioral and commercial retail/media data.
- Built extraction, representation-learning, audience-prediction, propensity-modeling and synthetic-data methods from problem framing through training and evaluation.
- Developed modeling components and evaluation routines for customer-understanding systems at large scale.
- Translated retailer and advertiser questions into model objectives, datasets, evaluation protocols and deployable ML capabilities.
- Connected synthetic data and unified customer-view representations to planning or activation use cases where stability and interpretability mattered.
Methods and Tools
- Multimodal representation learning
- Synthetic data
- Audience modeling
- Classification and ranking
- Experimentation
- Python
- Model evaluation