Skip to content

CLI Reference

Command-line interface for AutoRAG-Research.

Environment Variables

Variable Description
POSTGRES_HOST PostgreSQL host
POSTGRES_PORT PostgreSQL port
POSTGRES_USER PostgreSQL user
POSTGRES_PASSWORD PostgreSQL password

Commands

autorag-research

AutoRAG-Research CLI - RAG research on steroids.

Usage

autorag-research [OPTIONS] COMMAND [ARGS]...

Arguments

No arguments available

Options

Name Description Required Default
-cp, --config-path PATH Path to configuration directory [env var: AUTORAG_RESEARCH_CONFIG_PATH] No -
-V, --version Show version and exit No -
--help Show this message and exit. No -

Commands

Name Description
health-check Health check a specific model config.
show Show available resources.
init Download default configuration files to...
run Run experiment pipelines with metrics...
data Manage PostgreSQL dump files via...
drop Drop PostgreSQL databases.
ingest Ingest datasets into PostgreSQL.
plugin Manage AutoRAG-Research plugins.

Subcommands

autorag-research health-check

Health check a specific model config.

Loads the model config via Hydra and runs a health check to verify the model is functional.

Examples: autorag-research health-check embedding mock autorag-research health-check llm openai-gpt4 autorag-research health-check reranker cohere

Usage

autorag-research health-check [OPTIONS] MODEL_TYPE:{embedding|llm|reranker} NAME

Arguments

Name Description Required
MODEL_TYPE:{embedding|llm|reranker} Model type to check: embedding, llm, or reranker Yes
NAME Config name (YAML filename without extension) Yes

Options

Name Description Required Default
--help Show this message and exit. No -

autorag-research show

Show available resources.

RESOURCE types: datasets - Available dump files (optionally filter by ingestor name) ingestors - Available data ingestors with their parameters pipelines - Available pipeline configurations metrics - Available evaluation metrics databases - Available PostgreSQL databases

Examples: autorag-research show datasets autorag-research show datasets beir autorag-research show ingestors autorag-research show pipelines autorag-research show metrics autorag-research show databases

Usage

autorag-research show [OPTIONS] RESOURCE:{datasets|ingestors|pipelines|metrics|databases} [NAME]

Arguments

Name Description Required
RESOURCE:{datasets|ingestors|pipelines|metrics|databases} Resource type: datasets, ingestors, pipelines, metrics, or databases Yes
[NAME] Resource name (e.g., ingestor name for 'datasets') No

Options

Name Description Required Default
--help Show this message and exit. No -

autorag-research init

Download default configuration files to the configured directory.

Downloads configuration files from the AutoRAG-Research GitHub repository to your local configs directory. Existing files are not overwritten.

Examples: autorag-research init autorag-research --config-path=/my/configs init

Usage

autorag-research init [OPTIONS]

Arguments

No arguments available

Options

Name Description Required Default
--help Show this message and exit. No -

autorag-research run

Run experiment pipelines with metrics evaluation.

Configuration is loaded from configs/experiment.yaml (or specified --config-name).

Examples: autorag-research run --db-name=beir_scifact_test autorag-research run --db-name=beir_scifact_test --verbose autorag-research run --db-name=beir_scifact_test --config-name=my_experiment

Usage

autorag-research run [OPTIONS]

Arguments

No arguments available

Options

Name Description Required Default
-d, --db-name TEXT Database schema name (required) No -
-cn, --config-name TEXT Config file name without .yaml extension [default: experiment] No -
-v, --verbose Enable verbose logging No -
--help Show this message and exit. No -

autorag-research data

Manage PostgreSQL dump files via HuggingFace Hub.

Usage

autorag-research data [OPTIONS] COMMAND [ARGS]...

Arguments

No arguments available

Options

Name Description Required Default
--help Show this message and exit. No -

Subcommands

autorag-research data restore

Download and restore a dump to PostgreSQL.

Downloads the dump file from HuggingFace Hub (if not cached) and restores it to a PostgreSQL database. The database will be created if it doesn't exist.

Examples: autorag-research data restore beir scifact_openai-small autorag-research data restore beir scifact_openai-small --db-name=my_custom_db autorag-research data restore mteb nfcorpus_bge-small --clean autorag-research data restore beir scifact_openai-small --clean --yes

Usage

autorag-research data restore [OPTIONS] INGESTOR FILENAME

Arguments
Name Description Required
INGESTOR Ingestor name (e.g., beir, mteb) Yes
FILENAME Dump filename without .dump extension Yes
Options
Name Description Required Default
--db-name TEXT Target database name (defaults to filename) No -
--clean Drop database objects before recreating No -
--no-owner / --with-owner Skip restoration of object ownership [default: no-owner] No -
-y, --yes Skip confirmation prompts No -
--help Show this message and exit. No -
autorag-research data dump

Export a database to a dump file.

Creates a PostgreSQL dump file using pg_dump in custom format, which can be restored with 'autorag-research data restore' or pg_restore.

Examples: autorag-research data dump --db-name=beir_scifact_test autorag-research data dump --db-name=beir_scifact_test --output=./backup.dump

Usage

autorag-research data dump [OPTIONS]

Arguments

No arguments available

Options
Name Description Required Default
--db-name TEXT Database name to dump Yes -
-o, --output PATH Output file path (defaults to .dump) No -
--no-owner / --with-owner Skip output of ownership commands [default: no-owner] No -
--help Show this message and exit. No -
autorag-research data upload

Upload a dump file to HuggingFace Hub.

Requires authentication via HF_TOKEN environment variable or 'huggingface-cli login'.

Examples: autorag-research data upload ./scifact.dump beir scifact_openai-small autorag-research data upload ./scifact.dump beir scifact_openai-small -m "Add new dump" autorag-research data upload ./scifact.dump beir scifact_openai-small --repo myorg/custom-repo

Usage

autorag-research data upload [OPTIONS] FILE INGESTOR FILENAME

Arguments
Name Description Required
FILE Path to the dump file to upload Yes
INGESTOR Ingestor name (e.g., beir, mteb) Yes
FILENAME Target filename without .dump extension Yes
Options
Name Description Required Default
-r, --repo TEXT Override HuggingFace repo ID (e.g., myorg/my-repo) No -
-m, --message TEXT Commit message for the upload No -
--help Show this message and exit. No -

autorag-research drop

Drop PostgreSQL databases.

Usage

autorag-research drop [OPTIONS] COMMAND [ARGS]...

Arguments

No arguments available

Options

Name Description Required Default
--help Show this message and exit. No -

Subcommands

autorag-research drop database

Drop a PostgreSQL database.

Examples: autorag-research drop database --db-name=beir_scifact_test autorag-research drop database --db-name=beir_scifact_test --yes

Usage

autorag-research drop database [OPTIONS]

Arguments

No arguments available

Options
Name Description Required Default
--db-name TEXT Database name to drop Yes -
-y, --yes Skip confirmation prompts No -
--help Show this message and exit. No -

autorag-research ingest

Ingest datasets into PostgreSQL.

Usage

autorag-research ingest [OPTIONS] COMMAND [ARGS]...

Arguments

No arguments available

Options

Name Description Required Default
-n, --name TEXT Ingestor name (beir, mteb, ragbench, etc.). Use 'autorag-research show ingestors' to see all. No -
-e, --extra TEXT Ingestor-specific params as key=value (e.g., --extra dataset-name=scifact) No -
--subset [train|dev|test] Dataset split: train, dev, or test [default: test] No -
--query-limit INTEGER Maximum number of queries to ingest No -
--min-corpus-cnt INTEGER Minimum number of corpus documents to ingest No -
--db-name TEXT Custom database name (auto-generated if not specified) No -
--embedding-model TEXT Embedding model config name from configs/embedding/ [default: openai-small] No -
--embed-batch-size INTEGER Batch size for embedding [default: 128] No -
--embed-concurrency INTEGER Max concurrent embedding calls [default: 16] No -
--skip-embedding Skip embedding step (ingest data only) No -
--help Show this message and exit. No -

autorag-research plugin

Manage AutoRAG-Research plugins.

Usage

autorag-research plugin [OPTIONS] COMMAND [ARGS]...

Arguments

No arguments available

Options

Name Description Required Default
--help Show this message and exit. No -

Subcommands

autorag-research plugin sync

Discover installed plugins and copy their YAML configs into configs/.

Scans all installed packages that register autorag_research.pipelines or autorag_research.metrics entry points. YAML files are copied into the local configs/ directory. Existing files are never overwritten.

Example::

pip install autorag-research-elasticsearch
autorag-research plugin sync
Usage

autorag-research plugin sync [OPTIONS]

Arguments

No arguments available

Options
Name Description Required Default
--help Show this message and exit. No -
autorag-research plugin create

Scaffold a new plugin project in the current directory.

Creates a plugin directory structure with: - pyproject.toml with entry_points configured - Pipeline/metric skeleton code - YAML config file - Basic test file

Example::

autorag-research plugin create my_search --type=retrieval
cd my_search_plugin
pip install -e .
autorag-research plugin sync
Usage

autorag-research plugin create [OPTIONS] NAME

Arguments
Name Description Required
NAME Plugin name (e.g., 'my_custom_retrieval') Yes
Options
Name Description Required Default
-t, --type TEXT Plugin type: retrieval, generation, metric_retrieval, metric_generation, ingestor Yes -
--help Show this message and exit. No -

Quick Cleanup Example

autorag-research drop database --db-name=beir_scifact_test_openai_small --yes