Kled the leading human data marketplace has officially partnered with The DATA Foundation to onboard over 1.5 billion user records on chain.
We've spent the last 3 months working directly with some of the leading foundation models and labs. The single biggest point of discussion was trust. Labs have 0 room for error and consumer data is the single largest but also most sensitive data type on the planet.
Labs need two things before confidently purchasing a dataset.
1. A clean audit record: end-to-end receipts, consent forms, and proof of payment. They need to fully trust that data uploaded on the marketplace can confidently be licensed without consumers introducing risk into training pipelines.
2. Full confidence that the data on our marketplace is original. Not pirated, not AI generated. They need to fully trust that data uploaded is real human unaltered content that can significantly enhance a training pipeline.
We've been hugely against 99% of all blockchain offerings that have come our way, but things aligned way too well here and we found a unique opportunity to build something great.
Kled will be moving its full audit rails onto DATA Network, backed by a16z crypto, Polychain, and other top VCs.
The instant a user uploads data to our platform we create an anonymized receipt that is automatically sent to the TRACE (demo video, live links, and more in the post below). The content hash ID, signed consent forms, full payment record, and end-to-end timestamps.
Any AI lab can now verify the legitimacy of any dataset in seconds, making us the first data marketplace in history to publicize its data audit records. Consumer identities are fully anonymized and hidden, no users will be exposed in the process.
To add to this Kled will also be supporting USDC payouts on DATA Network. This will be alongside other stablecoin options rolling out with our existing fiat payouts. All of this will be fully auditable on DATA Network as well.
Lastly Kled and The DATA Foundation will be pooling its efforts to create the world’s best fraud detection protocol. We will be allocating the majority of our time/resources towards creating this, labs need to trust our data and the creation of AGI will be a function of this trust.
I'm also joining The DATA Foundation with a Part Time Advisor Role as the Chief Data Officer, where I’ll be advising the foundation team to make sure the Trace audit product reflects what AI labs actually need to license data with confidence.
The DATA Foundation's audit / licensing rails and Kled's data marketplace are complimentary in nature, labs need both. Kled is the largest contributor to that audit layer so we have the most skin in the game to get this product right.
Every effort described here will create a safer future for consumers and labs. We are here to set the gold standard for trust and nothing will stray us away from this goal. Onward.
Introducing Trace.
Trace is our flagship, public audit and search platform where every asset permanently registered on DATA Network can be accessed.
AI Labs can search every human contributed photo or voice sample and drill into any individual record to see the full audit trail.
Trace is where labs and regulators can filter by dataset, app, data type, modality, and time to verify data provenance so AI models can be trained with confidence.