Skip to content

ScienceBasecamp Research

Basecamp Research

A biodiversity-scale protein database for training biological AI models.

Category
Science
Pricing
PAID
Hosting
Cloud
Platforms
Web
Verified
Jun 16, 2026

A techbio company building BaseData, a proprietary database of protein and genome sequences collected from biodiversity worldwide under formal benefit-sharing agreements. The dataset is designed to train generative AI foundation models for protein and enzyme design, powering drug discovery, industrial biology and planetary-health applications that public databases can't reach.

Pros & cons

  • Largest proprietary protein/genome dataset
  • Data sourced under formal benefit-sharing
  • Built for training generative protein models
  • Backed by $85M+ across Series A and B
  • Enterprise/partnership only, no self-serve
  • Pricing not publicly disclosed
  • Aimed at biotech teams, not general users

Tags

Further reading

View all Science
  • View Profluent details
    SciencePAID

    Profluent

    Profluent Bio

    Generative protein language models that design novel proteins.

    Profluent uses generative AI — its ProGen-lineage protein language models — to author novel proteins from scratch or from natural scaffolds, for medicine and agriculture. It released OpenCRISPR-1, described as the first AI-designed gene editor (open-licensed and published in Nature), and is expanding into antibodies and enzymes. It operates as an AI protein-design company working through partnerships with open-sequence releases, not a self-serve consumer platform.

    Worth knowing

    Spun out of a Salesforce-funded ProGen project; open-released OpenCRISPR-1, billed as the first AI-designed gene editor.

    • protein-design
    • gene-editing
    • drug-discovery
    • biotech
  • View Chai Discovery details
    ScienceFREEMIUMOpen core

    Chai Discovery

    Chai Discovery

    Foundation models for molecular structure and antibody design.

    Chai Discovery builds foundation models for drug discovery — Chai-1 for biomolecular structure prediction across proteins, ligands, DNA/RNA, and glycans, and Chai-2 for de novo antibody design. Chai-1 weights and inference code are released for non-commercial use, and a free web server runs the model in-browser. It is both a model provider and a usable platform.

    Worth knowing

    OpenAI-backed; raised a $130M Series B at a ~$1.3B valuation in Dec 2025, co-led by Oak HC/FT and General Catalyst.

    • protein-structure
    • antibody-design
    • drug-discovery
    • open-weights
  • View Periodic Labs details
    SciencePAID

    Periodic Labs

    Periodic Labs

    AI scientists paired with autonomous labs to discover new materials.

    Periodic Labs is building AI scientists and the autonomous laboratories for them to operate, automating hypothesis generation, experiments, and analysis across the physical sciences. Its initial targets include discovering higher-temperature superconductors and new materials for chipmakers. It is a research-stage company deploying with industry partners rather than a self-serve product.

    Worth knowing

    Emerged from stealth in 2025 with a $300M seed — one of the largest ever — backed by a16z, Nvidia, Jeff Bezos, and Eric Schmidt.

    • materials-science
    • superconductors
    • autonomous-lab
    • ai-scientist