The world's most comprehensive Knowledge Graph of nature
We label our graph with hundreds of geological, environmental, and chemical tags per sequence. We can see full environmental context, giving you access to functional prediction like never before.
Go beyond public databases
At least 90% of the proteins in our graph are entirely new
Current AI-based protein design is fundamentally limited by the datasets available. At Basecamp Research, we solve this problem by creating a categorically different, contextualised data source.
We travel to all corners of the globe to map nature's genetic biodiversity. We have already increased the number of proteins known to science by >50%. This map unlocks the next holy grail of computational biology: in silico prediction of complex function.
We travel to all corners of the globe to map nature's genetic biodiversity. We have already increased the number of proteins known to science by >50%. This map unlocks the next holy grail of computational biology: in silico prediction of complex function.
Evolutionary context
Our database is already 3X larger than UniProt, and 4X less redundant
Continuous growth
We've increased the number of proteins known to science by >50%
Why a knowledge graph?
In silico predictions beyond traditional annotations
Public databases are a catalogue - basically a table.
Our database is a knowledge graph - a network that shows us relationships between proteins that have never been seen before. We leverage our unparalleled collection of environmental and genomic contextual data, as well as the network architecture of the graph to predict protein function. We can identify any complex function, including ones that aren't currently annotated, without relying on sequence similarity or prior mechanistic insight.
We’ve identified sequences with an improved performance that are as low as 20% in sequence identity to literature references, but with identical function and structure. We help you optimise performance, generate new IP, and cut down on lab testing.
Get in touchOur database is a knowledge graph - a network that shows us relationships between proteins that have never been seen before. We leverage our unparalleled collection of environmental and genomic contextual data, as well as the network architecture of the graph to predict protein function. We can identify any complex function, including ones that aren't currently annotated, without relying on sequence similarity or prior mechanistic insight.
We’ve identified sequences with an improved performance that are as low as 20% in sequence identity to literature references, but with identical function and structure. We help you optimise performance, generate new IP, and cut down on lab testing.
Explore the world with us
We target our expeditions based on your requirements
We know the exact geological and chemical properties of the locations where various types of protein classes can be found. When you partner with us, we proactively find samples in places that are predicted to have a high hit rate for you.
