The Ultimate Guide to Literature Search Skills for AI Agents

Mar 21, 2026

Why Literature Search Skills Matter

If you have ever asked an AI agent to "find recent papers on CRISPR delivery mechanisms," you have probably received a confident-sounding list of papers that do not exist. Hallucinated citations are one of the most persistent failure modes of language models.

Literature search skills solve this by giving your agent direct access to real databases — PubMed, arXiv, OpenAlex, bioRxiv, and more. Instead of fabricating references, the agent queries actual APIs and returns verifiable results with DOIs, PMIDs, and links.

We surveyed 7 major skill registries (K-Dense, FreedomIntelligence, ClawBio, ARIS, Imbad0202, Orchestra, VoltAgent) and found 30+ skills related to literature search and academic research. This guide breaks them down by use case so you can pick exactly what you need.


These skills connect your agent to one specific database. They are lightweight, often require zero configuration, and are the best starting point for most researchers.

SkillRegistryDatabaseAPI Key?Best For
pubmed-databaseK-Dense (15.6K stars)PubMed (NCBI E-utilities)NoBoolean/MeSH queries, BioPython integration
pubmed-searchFreedomIntelligence (1.5K stars)PubMed (BioPython Entrez)NoQuick keyword search with abstracts
pubmed-summariserClawBio (476 stars)PubMed (NCBI Entrez)NoGene/disease research briefs
arxiv-databaseK-DensearXiv (Atom API)NoSearch by keyword, author, ID, date, category
arxiv-searchFreedomIntelligencearXiv (Valyu API)Valyu KeySemantic search across all arXiv
/arxivARIS (2.9K stars)arXiv APINoDownload PDFs to local library
biorxiv-databaseK-DensebioRxivNoLife science preprints
medrxiv-searchFreedomIntelligencemedRxiv (Valyu)Valyu KeyMedical preprint semantic search
openalex-databaseK-DenseOpenAlex (240M papers)No (free)Largest open scholarly database

Which single-database skill should you pick?

  • Biomedical researchers: Start with pubmed-database from K-Dense. It has the most complete feature set including MeSH term support.
  • CS/ML researchers: Use arxiv-database from K-Dense. No API key needed, covers all arXiv categories.
  • Any discipline: openalex-database from K-Dense covers 240 million papers across all fields, completely free.
  • Quick gene/disease briefs: pubmed-summariser from ClawBio is purpose-built for questions like "what is the latest research on gene X?"

Installation example:

clawhub install pubmed-database
clawhub install arxiv-database
clawhub install openalex-database

When a single database is not enough, these skills search multiple sources simultaneously and merge the results.

SkillRegistrySourcesAPI Key?Highlights
literature-searchFreedomIntelligencePubMed + arXiv + bioRxiv + medRxivValyu KeyFour-source semantic search with aggregation
biomedical-searchFreedomIntelligenceAbove + ClinicalTrials.gov + FDAValyu KeyAdds clinical and regulatory data
medical-research-toolkitFreedomIntelligence14+ databases (ChEMBL, OpenTargets, OMIM, UniProt, KEGG...)MCP configMost comprehensive biomedical search
lit-synthesizerClawBioPubMed (MeSH) + bioRxiv + medRxivNoCitation graphs + research gap analysis

Recommendations

  • Most comprehensive: medical-research-toolkit searches 14 databases but requires MCP configuration.
  • Best value: lit-synthesizer from ClawBio needs no extra API keys and adds citation graph construction and gap analysis on top of search.
  • Quick multi-source: literature-search from FreedomIntelligence searches four sources simultaneously but requires a Valyu key.

Category 3: Deep Full-Text Analysis

These go beyond titles and abstracts. They parse full papers, extract structured data, and can produce systematic reviews.

SkillRegistryHighlightsAPI Key?
bgpt-paper-searchK-Dense / FreedomIntelligenceFull-text indexing, 25+ structured fields (methods, results, sample size, quality score)BGPT MCP
literature-reviewK-Dense / FreedomIntelligenceSystematic review with PRISMA-style flow, multi-source cross-validation, APA/Nature outputNo (chains other skills)
/deep-researchImbad0202 (862 stars)13-agent collaboration, 7 modes (full research, quick brief, paper review, systematic review, fact-check, Socratic dialogue, PRISMA + meta-analysis)No
tooluniverse-literature-deep-researchFreedomIntelligenceTarget disambiguation, evidence grading, completeness checklist, testable hypothesis generationMCP config

Recommendations

  • Deepest analysis: /deep-research from Imbad0202 runs a 13-agent PRISMA pipeline with meta-analysis support. Be warned: a single run can consume 200K+ input tokens.
  • Most structured output: bgpt-paper-search extracts 25+ fields from full text — ideal for evidence synthesis.
  • Best balance: literature-review offers solid depth at reasonable cost with properly formatted output.

Category 4: Citation Management

Because nothing undermines a paper faster than a fabricated reference.

SkillRegistrySourcesHighlights
citation-managementK-Dense / FreedomIntelligenceGoogle Scholar + PubMed + CrossRef + arXivDOI/PMID/arXiv ID to BibTeX, prevents hallucinated citations
abstract-searcherVoltAgent RegistryarXiv + Semantic Scholar + CrossRefAuto-fills abstracts in .bib files

Category 5: Novelty Checking

SkillRegistryHighlights
/novelty-checkARISExtracts 3-5 core claims, searches each independently, adversarial cross-model review

This is particularly valuable before submitting a paper — let your agent verify that your claims are actually novel.


Category 6: End-to-End Research Pipelines

These skills treat literature search as one step in a larger autonomous research workflow.

SkillRegistryPipeline
autoresearchOrchestra (5.3K stars)Literature survey -> topic selection -> experiments -> paper writing, supports overnight runs
/idea-discoveryARISLiterature -> ideas -> novelty check -> peer review -> iterative refinement
/deep-research + /academic-pipelineImbad020210-stage academic pipeline: research -> write -> review -> revise -> finalize

Beyond the major registries, these community projects offer specialized skills worth knowing about.

AIPOCH Medical Research Skills (37 stars)

A collection of original medical research skills covering the full evidence-based medicine workflow: search strategy design, database selection, evidence level prioritization, critical appraisal, and gap identification.

bash <(curl -s https://raw.githubusercontent.com/aipoch/medical-research-skills/main/scientific-skills/scripts/openclaw-install.sh)

OpenClaw Master Skills — LeoYeAI (1.96K stars)

A curated collection updated weekly with 339+ skills. Notable entries:

  • academic-deep-research — transparent research process, not a black-box API wrapper
  • deep-research-pro — multi-source deep research agent with cited reports
clawhub install academic-deep-research

AMiner Open Skill — CanXiangCC

Access AMiner's scholarly platform with 28 APIs covering scholars, papers, institutions, journals, and patents.

  • aminer-data-search — full-featured (requires AMINER_API_KEY)
  • aminer-free-search — free tier for discovery and filtering

General-Purpose Registries

SkillRegistrySourceAPI Key?Notes
deep-research (Gemini)Antigravity (25K stars)Google Gemini Deep Research APIGEMINI_API_KEY$2-5 per run, 2-10 minutes, async support
exa-searchAntigravityExa API (semantic search)Exa KeyHas a dedicated "research papers" category
latex-paper-conversionAntigravityLocal pdflatex/bibtexNoConvert between Springer, MDPI, IEEE, Nature formats

Best Starting Point (Zero Config)

openalex-database from K-Dense

240 million papers, completely free, no API key required, covers all disciplines. If you install one skill today, make it this one.

clawhub install openalex-database

Best for Biomedical Research

lit-synthesizer from ClawBio

Searches PubMed with MeSH terms plus bioRxiv and medRxiv preprints. No extra API keys. The bonus citation graph and research gap analysis features make it stand out from pure search tools.

Best for Deep Research

/deep-research from Imbad0202

A 13-agent PRISMA pipeline supporting 7 different modes including meta-analysis. Nothing else comes close in depth. Budget 200K+ tokens per run.

Best End-to-End Pipeline

/idea-discovery from ARIS

Goes from literature search to idea generation to novelty checking to peer review — all autonomously. Supports overnight runs for hands-off research exploration.


Quick Comparison Table

Use CaseRecommended SkillAPI Key?Token CostDepth
Quick PubMed searchpubmed-databaseNoLowBasic
CS/ML paper searcharxiv-databaseNoLowBasic
Any disciplineopenalex-databaseNoLowBasic
Multi-source biomedicallit-synthesizerNoMediumMedium
14-database biomedicalmedical-research-toolkitMCPMediumHigh
Systematic review/deep-researchNoVery High (200K+)Very High
Full research pipeline/idea-discoveryNoHighEnd-to-End
Citation managementcitation-managementNoLowUtility
Novelty checking/novelty-checkNoMediumUtility

Things to Watch Out For

  1. API key requirements: Some skills (Valyu, BGPT MCP, OpenRouter) need third-party keys. Check before you commit to a workflow.
  2. Shared upstream sources: K-Dense and FreedomIntelligence share many skills from the same upstream — do not install both copies.
  3. Token costs: /deep-research from Imbad0202 can consume 200K+ tokens in a single run. Great for thorough reviews, expensive for quick lookups.
  4. Quality variance: VoltAgent Registry skills vary in quality. Prefer curated versions from K-Dense, FreedomIntelligence, or ClawBio when available.

Getting Started

The fastest path from zero to working literature search:

# 1. Install the universal search skill
clawhub install openalex-database

# 2. Add PubMed if you do biomedical research
clawhub install pubmed-database

# 3. Add arXiv if you do CS/ML research
clawhub install arxiv-database

# 4. Add citation management to prevent hallucinated references
clawhub install citation-management

Once installed, just ask your agent naturally: "Find recent papers on transformer architectures for protein folding" — and it will query real databases instead of making things up.