Datacurve Raises $15M to Revolutionize AI with High-Quality Coding Datasets and Gamified Platform
October 10, 2025
Datacurve's platform, Shipd, recruits over 14,000 skilled programmers to perform algorithmic challenges, debugging, and UI/UX tasks, paying for outputs to incentivize quality and speed.
The company's approach addresses the growing complexity and demand for high-quality data in AI, especially for sophisticated reinforcement learning environments.
Datacurve aims to provide the right data for model improvement, including datasets for supervised fine-tuning, RLHF pipelines, and telemetry traces of developer behavior to enhance AI performance.
The funding round was led by Chemistry, a new early-stage firm co-founded by ex-Index partner Mark Goldberg, with participation from AI lab angels, aligning with Chemistry's focus on developer tools and infrastructure.
A company is focused on creating high-quality coding datasets, private benchmarks, and developer traces through a gamified contributor platform, aiming to improve AI model training and evaluation.
Y Combinator's Winter 2024 startup Datacurve has raised $15 million in Series A funding, led by Chemistry, bringing its total funding to $17.7 million, as it aims to compete in the AI data collection industry.
Datacurve, which targets high-quality training data needs, is leveraging its funding to expand its infrastructure and offerings in the AI data ecosystem.
The company employs a 'bounty hunter' system that incentivizes skilled engineers to source difficult datasets, having paid over $1 million in bounties so far to ensure data quality.
Increasing scrutiny over data provenance, evaluation rigor, and intellectual property hygiene is making high-quality, secure, and well-governed data more valuable in AI training.
The demand for high-quality code data is rising, with competitors like Mercor and Surge scaling rapidly, and major tech firms like Meta investing heavily in AI data providers, highlighting the importance of domain-specific, high-signal data.
Datacurve emphasizes a positive user experience by treating data collection as a consumer product, which helps attract and retain top talent in data sourcing.
While initially focused on software engineering, Datacurve's infrastructure has potential applications across fields such as finance, marketing, and medicine, as it develops tools for post-training data collection.
Summary based on 2 sources
Get a daily email with more Startups stories
Sources

TechCrunch • Oct 9, 2025
Datacurve raises $15 million to take on ScaleAI
Maginative • Oct 9, 2025
Datacurve Raises $15M Series A led by Chemistry