MMORE DocumentationΒΆ

MMORE is an open-source multimodal ingestion and retrieval framework designed for heterogeneous document collections.

It provides tools to process documents, build indexes, run retrieval pipelines, support multimodal workflows, and run distributed processing for larger collections and production-oriented settings.

What is MMORE?ΒΆ

MMORE helps you build retrieval systems over complex document collections by combining:

  • document ingestion and processing

  • indexing pipelines

  • retrieval and RAG workflows

  • multimodal retrieval support

  • distributed processing support for large-scale document ingestion

  • evaluation and profiling tools

Where to startΒΆ

Depending on what you want to do, start in different places:

Documentation mapΒΆ

Page guideΒΆ

Here is a quick overview of the main pages:

  • Installation: set up MMORE and prepare your environment

  • Quickstart: run a first minimal workflow end to end

  • Architecture: understand the main system components and how they interact

  • Processing pipeline: understand how documents are ingested and transformed

  • Indexing: build and manage indexes

  • RAG: structure retrieval-augmented generation workflows

  • ColPali: multimodal retrieval-related documentation

  • Websearch: web search integration and related workflows

  • Evaluation: assess system performance

  • Distributed processing: scale processing across larger workloads

  • Profiler: profile and analyze performance

  • uv: environment and dependency workflow

  • Cluster and production: deployment and production-oriented guidance

  • For developers: contributor and internal development documentation

  • Index API: API-oriented reference for indexing-related functionality