Expert and scientific data for life-sciences AI.
A guild of practicing pharma scientists writing — by hand — the data the next generation of life-sciences AI is built on.
Curated datasets
Study-ready data across biomarkers, RNA-seq, and multi-omics. Scoped, written, and peer-reviewed in-house.
Model evaluation
We grade what frontier models say about biology, then rewrite — in a scientist's voice — how the answer should read.
Safety review
Risk assessment for dual-use research and pathogen-adjacent generations. Senior contributors only.
Decode Origin saved us months of work. We had spent half a year struggling to identify datasets for our biomarker validation and ML models — they delivered, and we're extremely satisfied.

Ning Wang
Bioinformatics PhD, UCLA. Five years of clinical biomarker work across major therapeutic programs. Named inventor on a Nature Medicine patent.

Bin Zhou
Ex-Head of Engineering at an open-weight LLM startup (400B-parameter MoE). Ex-Apple, ex-JPMorgan.