Blog | Ellarion Cybernetics

Anthropic on issues with research data in biology

2026/06/14 data drug discovery ai shorts

Anthropic's recent research post brings additional points to Arachne.ai's use-case-specific materialisation approach for biomedical data in agentic context.

Public life science data infrastructure is too brittle for agents

2026/06/01 data infrastructure life sciences drug discovery shorts

After a quick refresher of dealing with Elixir-class data providers, I reminded myself how the public life science tools and databases are.

Medallion Architecture for Early Stage Startups

2026/05/26 data architecture data startups shorts

Medallion architecture is one of those late-stage / enterprise patterns that I've found surprisingly useful quite early. Research projects, app projects, early-stage startups - you name it.

Measuring a loop of AI spiral development

2026/05/22 artificial-intelligence agentic-coding

I finally got around to make any sort of measurement of how AI - based development (no code touching) works for me.

Structured LLM extraction from text - my go to setup

2026/05/03 artificial-intelligence llm data-extraction data-processing

One of the core tasks that has been completely overtaken by LLMs is reading blocks of text and pulling out the most relevant information. A quick description of a pattern I tend to overuse in this space.

TabPFN - a deep learning architecture for tabular data that actually works

2025/01/19 artificial-intelligence deep-learning shorts

Tabular data remains the last bastion unconquered by deep learning. This might change soon.

Making your biomedical dataset more appealing

2024/08/30 data shorts

If you have a dataset that could be valuable for biomedical research, here are a few key points to consider from our perspective as data integrators and users in research-centric biotech and pharma projects.

Deploying biomedical LLMs

2024/06/24 llm arachne.ai data

We have recently deployed a biomedical LLM system that now helps with finding drugging opportunities for a novel modality. In this post, we share the technical stack we used.

Two main families of data models and how they affect your engineering - AI/ML communication

2021/11/26 technical data arachne.ai

This post tries to explain, using an enormous simplification, the difference between two leading families of data models that are in use in modern data workflows, especially in the context of feeding ML/AI - relational and document/object representations.

Arachne.ai - a biomedical data backbone

2021/08/10 arachne.ai data

For the past several months, we have been finalising the first usable release of a system that we think would remedy some high-impact pain points we have seen in working with data in high-paced bio- and med-tech companies.

Introducing HDRUK’s Innovation Gateway - the next generation health data resource

2021/04/22 biokeanos data

Health and clinical data is a central part of understanding the reality of dealing with diseases. The need for it only increases while the access-related challenges make it complicated to acquire. Thankfully due to a recent effort by HDRUK, more and more datasets can be shared.

Chartering the vastness of biomedical data

2021/03/31 biokeanos data

There is an unprecedented demand for biomedical data both for research and application, driven by the availability of large-scale processing technology and the resurgence of predictive modelling / AI. Given the vast amount of possible sources, how does one know which one they should be using?

How we host our sites

2021/03/24 technical cloud

There are so many available solutions to website or blog hosting that it almost seems like a non-problem. One does not even need any technical skills to set some of them up. The question remains, how do you decide what to use?

Read more about How we host our sites →

Ellarion blog