OmicsClaw: An AI Research Partner That Remembers Your Multi-Omics Data

Mar 19, 2026

What is OmicsClaw?

OmicsClaw is an AI research assistant with persistent memory for multi-omics analysis. Unlike stateless tools where every session starts from zero, OmicsClaw remembers your datasets, analysis history, and preferences across sessions.

It covers 63+ skills across 6 omics domains: spatial transcriptomics, single-cell, genomics, proteomics, metabolomics, and bulk RNA-seq. Chat with it via Telegram or Feishu, or use the CLI.

Why Memory Matters

Traditional ToolsOmicsClaw
Re-upload data every sessionRemembers file paths and metadata
Forget analysis historyTracks full lineage (preprocess → cluster → DE)
Repeat parameters manuallyLearns and applies your preferences
CLI-only, steep learning curveChat interface + CLI
Stateless executionPersistent research partner

Example conversation with memory:

Session 1:
You: [Upload visium_brain.h5ad]
You: "Preprocess this"
Bot: Done. [Saves dataset + analysis to memory]

Session 2 (next day):
You: "Find spatial domains"
Bot: "Using your Visium brain data (5000 spots, normalized yesterday)"
     Done. [Links to parent analysis]

Quick Start

git clone https://github.com/TianGzlab/OmicsClaw.git
cd OmicsClaw
pip install -e .
pip install -r bot/requirements.txt

# Configure
cp .env.example .env
# Edit .env: set LLM_API_KEY and bot tokens

# Start Telegram bot
python bot/telegram_bot.py

Command Line

pip install -e .

# Try a demo (no data needed)
python omicsclaw.py run spatial-preprocessing --demo

# Run with your data
python omicsclaw.py run spatial-preprocessing --input data.h5ad --output results/

Installation Tiers

  • pip install -e . — Core system
  • pip install -e ".[spatial]" — Spatial transcriptomics
  • pip install -e ".[singlecell]" — Single-cell omics
  • pip install -e ".[genomics]" — Genomics
  • pip install -e ".[full]" — All 63+ skills

The 6 Omics Domains

DomainSkillsKey Capabilities
Spatial Transcriptomics15QC, clustering, cell typing, deconvolution, spatial statistics, cell communication, velocity
Single-Cell Omics9Preprocessing, doublet detection, annotation, trajectory, batch integration, GRN
Genomics10Variant calling, alignment, annotation, structural variants, assembly, CNV
Proteomics8MS QC, peptide ID, quantification, differential abundance, PTM analysis
Metabolomics8Peak detection, XCMS preprocessing, annotation, normalization, pathway enrichment
Bulk RNA-seq13FASTQ QC, alignment, DE, splicing, enrichment, deconvolution, co-expression, survival

Example: Spatial Transcriptomics Pipeline

# 1. Preprocess: QC, normalize, cluster
python omicsclaw.py run spatial-preprocessing --input data.h5ad --output output/preprocess

# 2. Identify tissue domains
python omicsclaw.py run spatial-domain-identification --input output/preprocess/processed.h5ad --output output/domains

# 3. Find spatially variable genes
python omicsclaw.py run spatial-svg-detection --input output/domains/processed.h5ad --output output/svg

# 4. Cell-cell communication
python omicsclaw.py run spatial-cell-communication --input output/preprocess/processed.h5ad --output output/communication

Smart Orchestration

The orchestrator routes natural language to the right analysis:

# Keyword mode (fast)
python omicsclaw.py run orchestrator --query "find spatially variable genes" --input data.h5ad --output output

# LLM mode (semantic understanding)
python omicsclaw.py run orchestrator --query "I want to understand which genes show spatial patterns" --routing-mode llm --output output

# Named pipelines
python omicsclaw.py run orchestrator --pipeline standard --input data.h5ad --output output

Supported Platforms

Data formats: .h5ad, .vcf, .mzML, .csv/.tsv

Spatial platforms: Visium, Xenium, MERFISH, Slide-seq

Sequencing: 10x scRNA-seq, Illumina, PacBio, LC-MS/MS