The world's most comprehensive Knowledge Graph of nature

We label our graph with hundreds of geological, environmental, and chemical tags per sequence. We can see full environmental context, giving you access to functional prediction like never before.
Go beyond public databases

At least 90% of the proteins in our graph are entirely new

Current AI-based protein design is fundamentally limited by the datasets available. At Basecamp Research, we solve this problem by creating a categorically different, contextualised data source.

We travel to all corners of the globe to map nature's genetic biodiversity. We have already increased the number of proteins known to science by >50%. This map unlocks the next holy grail of computational biology: in silico prediction of complex function.
Evolutionary context

Our database is already 3X larger than UniProt, and 4X less redundant

Continuous growth

We've increased the number of proteins known to science by >50%

Why a knowledge graph? 

In silico predictions beyond traditional annotations

Public databases are a catalogue - basically a table.

Our database is a knowledge graph - a network that shows us relationships between proteins that have never been seen before. We leverage our unparalleled collection of environmental and genomic contextual data, as well as the network architecture of the graph to predict protein function. We can identify any complex function, including ones that aren't currently annotated, without relying on sequence similarity or prior mechanistic insight.

We’ve identified sequences with an improved performance that are as low as 20% in sequence identity to literature references, but with identical function and structure. We help you optimise performance, generate new IP, and cut down on lab testing. 
Get in touch
Explore the world with us

We target our expeditions based on your requirements

We know the exact geological and chemical properties of the locations where various types of protein classes can be found. When you partner with us, we proactively find samples in places that are predicted to have a high hit rate for you.
A map with several connected locations that Basecamp Research has sampled
Close icon