# What is data orchestration?

Data orchestration is the process of managing and conducting data-related tasks. In kiara, you can organize, transform, and track your research data throughout the analytical process.

### From Sources to Data

Historical sources don't arrive as neatly structured data. Letters, census records, newspapers, and other primary sources must be transformed into machine-readable formats before computational analysis becomes possible. This transformation process, in which you *create* your dataset, involves critical decisions that shape your research outcomes – yet many digital tools obscure that process. When you simply click through software interfaces, you may lose sight of how your sources become data, how that data is manipulated, and ultimately how your interpretations relate to the original materials.

### Data Orchestration, documented

Kiara therefore allows you to both perform and track each of those steps. Rather than processing your data through monolithic "black box" tools, kiara facilitates the creation of data [pipelines](/before-you-begin/key-concepts.md#pipeline) that consist of distinct, interconnected [modules](/before-you-begin/key-concepts.md#module). Each module has clearly defined inputs, outputs, and processes, allowing you to understand exactly what is happening to your data at each step. At the same time, kiara automatically tracks the ancestry of your data through every transformation. At any point, you can see the [lineage](/before-you-begin/key-concepts.md#lineage) of how a particular dataset was created, including all the inputs and processes involved, making your [workflow](/before-you-begin/key-concepts.md#workflow) and findings both explainable and reproducible.

<figure><img src="/files/9JTF1Z85wI2qMaBgmIX9" alt=""><figcaption></figcaption></figure>


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://docs.dharpa.org/before-you-begin/what-is-data-orchestration.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
