The world's most Comprehensive Knowledge Graph of nature

We label our graph with hundreds of geological, environmental, and chemical tags per sequence. We can see full environmental context, giving you access to functional prediction like never before.
Go beyond public databases

90% of the proteins in our graph are entirely new

Current AI-based protein design is fundamentally limited by the datasets available. At Basecamp Research, we solve this problem.

We travel to all corners of the globe to map nature's genetic biodiversity. We have already increased the number of proteins known to science by 50%. This map unlocks the next holy grail of computational biology; in silico prediction of complex function.
Evolutionary context

Our database is already larger than UniProt, but 4x less redundant

Continuous growth

We've increased the number of proteins known to science by 50%

Why a Knowledge Graph? 

In silico predictions beyond traditional annotations

Public databases are a catalog - a list. Our database is a knowledge graph, a network that shows us relationships between proteins that have never been seen before. We leverage the environmental context and the network architecture of the graph to predict function. We can identify any complex function, including ones that aren't currently annotated, without relying on sequence similarity.

We’ve identified sequences with an improved performance that are as low as 20% in sequence identity to protein engineering starting points, with identical function and structure. We help you optimise performance, circumvent patents, and cut down on lab testing. 
Get in touch
Expeditions Map

We target our expeditions based on your requirements.

We know the exact geological and geochemical properties of the locations where various types of protein classes can be found. When you partner with us, we proactively find samples in places that we know will have a high hit rate for you.
A map with several connected locations that Basecamp Research has sampled
Close icon