🚀 Quickstart¶

Overview¶

This page helps you get MMORE running quickly with a minimal workflow.

The goal is not to cover every configuration option, but to give you a first successful setup and a clear mental model of the main steps.

In a typical MMORE workflow, you will:

Make sure you have already read Installation.

You should also confirm that:

The exact commands depend on your repository entry points, but the overall workflow is the following.

Start with a small and simple document set before moving to large-scale or distributed workloads.

For example, create a folder containing a few representative documents:

sample_data/
├── doc1.pdf
├── doc2.pdf
├── doc3.html
└── doc4.md

Processing transforms raw documents into a form that MMORE can index and retrieve from.

Depending on your setup, this step may include:

See Processing pipeline for the detailed logic.

Once documents are processed, create an index so they can be searched efficiently.

This step usually includes:

See Indexing for the full indexing workflow.

After indexing, you can test retrieval on a few example queries.

At this stage, you want to verify simple things:

If your workflow includes generation, retrieval results can then be passed into a RAG pipeline.

See RAG for how retrieval and generation are combined.

Conceptually, a first MMORE run looks like this:

Raw documents
    ↓
Processing
    ↓
Structured outputs / chunks / metadata
    ↓
Indexing
    ↓
Retrieval
    ↓
Optional RAG generation

After your first run, verify the following:

Warning

Do not start with a large or noisy collection.

When debugging a documentation-backed pipeline, a very small dataset is much easier to inspect and validate.

Typical first-run problems include:

After this page, the best next steps are: