What is Quantum Reservoir Computing

A gentle path from static quantum feature maps to temporal quantum reservoirs.

Introduction

The easiest way to understand quantum reservoir computing is to begin with a simpler idea: use a quantum system as a feature maker.

Step by stepThe whole page in four moves

Start with ordinary features

A feature is a useful description of data, such as a frequency, an average, or a measured coordinate.

Make those features quantum

A fixed quantum system transforms the input and measurements become the new feature vector.

Train only the readout

The classical readout learns how to combine the measured features for the task.

Add time only at the end

QRC is the same idea, but the quantum state is reused so features can contain recent history.

Think of the quantum system like a strange optical prism. A prism does not decide what the object is. It changes how light is spread out, so details that were hidden can become easier to inspect. In the same spirit, a quantum feature map does not need to be the final decision maker. It can reshape the input into measurement numbers that a simple classical model can use.

In this view, the quantum part is not the whole model. It is a fixed transformation. Data goes in, the quantum system produces measurement features, and a small classical readout learns how to use those features.

Read this asSeparate the model into two jobs

The quantum system creates coordinates. The readout learns what those coordinates mean for the task. This separation is the main reason the idea is approachable: we do not have to train every internal quantum interaction to understand the basic workflow.

This page builds the idea slowly. We first look at Quantum Extreme Learning Machines, where each input is transformed once. Then we add time and memory, which turns the same readout principle into quantum reservoir computing.

The path is deliberately staged. If the data points are independent, the static QELM picture is the cleanest starting point. If the data arrives as a sequence, the QRC picture adds one missing ingredient: the quantum state is reused, so the current feature vector can carry a trace of what came before.

New to quantum mechanics? The Quantum Mechanics Basics primer covers states, gates, measurement, and open-system dynamics. You do not need all details for this page, but the primer explains the symbols.

Learning path

Feature maps->QELM->Linear readout->Time and memory->QRC

Keep one sentence in mind: QELM is a static quantum feature transformation, while QRC is a quantum feature transformation that also carries temporal state.

Pause and checkIf this is your first contact with the topic

Do not try to understand QRC first. First understand the static pipeline: input, fixed transformation, measured features, trained readout. QRC only adds state reuse over time.

What we now know

We have separated the big idea into two parts: a fixed feature maker and a trained readout. Next we slow down and define what a feature map is before making it quantum.

Why Feature Maps?

Many learning problems become easier after we change how the data is represented. The original input may be hard to separate with a line, but a transformed version can expose useful structure.

This is not a quantum-specific trick. It is a general learning idea. A raw image is often transformed into edges and textures. A sentence is transformed into embeddings. A sound wave may be transformed into frequency features. The point is always the same: choose coordinates where the useful pattern is easier to express.

Before quantumA feature is just a useful description of the data

rawRaw object

A sound clip is a long list of pressure values over time. That raw list is hard to classify directly.

featureUseful descriptions

We compute features such as average loudness, dominant frequency, and how quickly the sound changes.

readoutSimple decision

A simple model can now combine those few descriptions to decide whether the sound is calm or sharp.

Quantum features follow the same logic. The difference is only how the new descriptions are produced.

A feature map is just a transformation:

\mathbf{x} \longmapsto \mathbf{z}(\mathbf{x})

In plain words

Take the original description $\mathbf{x}$ and rewrite it as a new description $\mathbf{z}(\mathbf{x})$ . Nothing has been classified yet. We have only changed how the data is represented.

The vector $\mathbf{x}$ is the original input. The vector $\mathbf{z}(\mathbf{x})$ is the new representation. After that transformation, we often use a very simple readout:

y = \mathbf{w}^{\top}\mathbf{z}(\mathbf{x}) + b

In plain words

Multiply each feature by a learned weight, add the results, then add a bias. The readout is simple; the feature map is what made the coordinates useful.

The readout is linear, but the features can be rich. That is the trick. Instead of training a large complicated model everywhere, we keep the feature maker fixed and train only the final layer.

Read this asThe readout can stay simple because the coordinates changed

A linear readout can only add weighted coordinates. It cannot invent complex geometry by itself. The feature map does the geometric work first. After that, the readout only has to learn which measured coordinates should count positively, negatively, or barely at all.

Toy feature mapOne tiny input becomes coordinates for a simple readout

1Start with two values

$\mathbf{x}=[0.2,0.8]$ could mean two simple descriptors of a signal.

2Transform once

A fixed map turns that input into measured features, for example $\mathbf{z}(\mathbf{x})=[0.31,-0.44,0.72]$ .

3Train only the readout

The readout computes $\hat{y}=\mathbf{w}^{\top}\mathbf{z}+b$ and learns the weights $\mathbf{w}$ from examples.

The numbers are illustrative. The structure is the important part: the feature map creates coordinates, then the trained readout combines them.

Worked readoutA linear readout is just a weighted sum of the measured features

\mathbf{z}=[0.31,-0.44,0.72],\quad \mathbf{w}=[0.8,-0.2,0.5],\quad b=-0.1

First feature contribution

0.8\cdot 0.31=0.248

Second feature contribution

-0.2\cdot(-0.44)=0.088

Third feature contribution

0.5\cdot 0.72=0.36

Bias and final score

\hat{y}=0.248+0.088+0.36-0.1\approx 0.60

In real training, the data chooses the weights. Here the numbers only make the mechanics visible.

What to look at

The sketch below shows the same idea geometrically. The labels do not change; only the coordinate system changes, making a simple linear boundary more useful.

Why Transform Features?The readout is simple. The transformation makes the data easier for that simple readout.

Pause and checkCan you say what is trained?

At this stage, only the readout weights are trained. The feature map can be fixed. This simple separation will carry directly into QELM and QRC.

What we now know

A feature map changes the coordinate system. A linear readout then combines those coordinates. Quantum feature maps use a quantum system to create the coordinates, but the learning logic is the same.

Quantum Extreme Learning Machines

A Quantum Extreme Learning Machine, or QELM, applies this feature-map idea with a quantum system. The word “extreme” comes from classical extreme learning machines: the hidden layer is fixed, and only the output layer is trained.

In plain words

QELM means: use a quantum system as the fixed hidden layer, measure what comes out, and train a classical output layer on top.

That last sentence is the whole mental model. A QELM does not try to tune the quantum system for every task. Instead, it treats the quantum system as a fixed, expressive hidden layer. The training problem is pushed to the classical readout, where ordinary regression or classification tools are stable and cheap.

In a QELM, the hidden layer is quantum. You encode an input into a quantum circuit or quantum system, let it transform the input, measure observables, and train a classical readout on those measured values.

QELM in four slow stepsThe quantum part is a fixed feature factory

1Encode

Put the input values into controllable parameters, such as rotation angles or drive amplitudes.

2Transform

Let the fixed quantum circuit or fixed physical dynamics mix the encoded information.

3Measure

Read out selected observables and turn those measurement averages into a feature vector.

4Fit

Train only a classical linear readout on top of those features.

The important point is that this is static. One input goes in. One feature vector comes out. The quantum system is not remembering a previous input.

Read this asStatic means independent examples

If the first example is a red point and the second example is a green point, the second feature vector should not depend on the first example. Each data point receives the same fixed transformation recipe. That is exactly what you want for many ordinary classification or regression datasets.

One-shot QELM exampleThe quantum system does not remember the previous data point

AInput

A small pattern is encoded as $\mathbf{x}$ .

BFixed quantum pass

The same circuit or quantum dynamics are used for every training example.

CClass score

Measurements give $\mathbf{z}(\mathbf{x})$ ; a linear readout turns them into a score such as $\hat{y}=0.56$ .

After this input is processed, the next input starts from the same fixed recipe. That is why QELM is static.

Single data point storyFollow one example all the way through QELM

1A pattern arrives

The raw input is $\mathbf{x}=[0.2,0.8]$ .

2The quantum map acts

The fixed quantum recipe transforms the input into a state $\rho(\mathbf{x})$ .

3Measurements become features

We estimate $\mathbf{z}=[0.31,-0.44,0.72]$ .

4The readout scores it

The trained readout returns a score, for example $\hat{y}\approx0.60$ .

Nothing in this story depends on the previous data point. That is the static QELM assumption.

What to look at

Read the diagram left to right. The quantum block is fixed; the trainable part is only the small readout block at the end.

Static Quantum Feature MapQELM uses a fixed quantum system once per input. Only the final linear readout is trained.

Pause and checkStatic should now mean one-shot

If you can say “one input creates one feature vector, independently of the previous input,” you have the core QELM picture.

What we now know

QELM is the first quantum version of the feature-map idea. It is powerful but still static: the feature vector describes the current input, not a history.

Static Quantum Features

Let the input be $\mathbf{x}$ . A fixed quantum map prepares a state that depends on that input. We do not need to specify the full circuit here. Conceptually, it creates a state:

\rho(\mathbf{x})

In plain words

After the input has gone through the fixed quantum recipe, the quantum system is in a state that depends on that input.

Read $\rho(\mathbf{x})$ as “the quantum state after the input has been encoded and transformed.” The page does not need the full machinery of density matrices yet. For now, the important part is simply that the state depends on the input.

Then we measure a small list of observables. These measurements become the feature vector:

\mathbf{z}(\mathbf{x}) = [\langle O_1\rangle_{\mathbf{x}},\langle O_2\rangle_{\mathbf{x}},\ldots,\langle O_m\rangle_{\mathbf{x}}]

In plain words

Measure several chosen quantities. Each measurement average becomes one coordinate of the feature vector $\mathbf{z}(\mathbf{x})$ .

Each entry in this vector is one measured summary of the quantum state. One observable might capture one kind of correlation, another observable might capture a different kind of population imbalance, and another might respond strongly only when two parts of the system interact. Together they form a richer coordinate system than the raw input.

Measurement averageA feature can be an average over repeated measurements

O1Measure an observable

Suppose one observable returns $+1$ in 68 shots and $-1$ in 32 shots.

avgConvert shots into a number

The estimated feature is $(68-32)/100=0.36$ .

zRepeat for several observables

Doing this for $O_1,O_2,O_3$ gives a vector such as $\mathbf{z}=[0.36,-0.12,0.61]$ .

This is why the page writes features as expectation values. They summarize repeated measurement statistics, not a single magical readout.

The trained part is still only the readout:

\hat{y} = \mathbf{w}^{\top}\mathbf{z}(\mathbf{x}) + b

This makes QELM easy to reason about. The quantum system is a fixed feature generator. The classical readout learns how to combine those features for classification or regression.

Read this asThe quantum system creates many possible views of the same input

The readout does not need to know how the quantum dynamics produced every feature. It only needs a table of feature vectors and target labels. This is why QELM is often a good baseline: it tells us how much can already be achieved by static quantum features before adding temporal memory.

Pause and checkExpectation value does not need to feel abstract here

For this page, read an expectation value as a measured average. It is one number extracted from many repeated measurements of the transformed quantum state.

What we now know

Static quantum features are measurement summaries of a fixed quantum transformation. They become the inputs to the classical readout.

Linear Separation

Suppose two classes are mixed together in the original input space. A linear model may fail because no straight boundary separates them well.

For example, imagine two kinds of sensor signal. In the raw coordinates, one axis might be average amplitude and the other axis might be average frequency. Those two numbers may not be enough: the two classes overlap because the important distinction is hidden in a more subtle pattern.

A good feature map bends the representation without making the readout complicated. After the quantum transformation, the same classes may be separated by a simple linear boundary in feature space.

The phrase “linear separation” can sound abstract, but the picture is simple. We are not changing the labels. Red examples stay red and green examples stay green. We are only changing the coordinate system in which the readout sees them.

This is the first mental model for QELM: the quantum system does not directly make the final decision. It creates coordinates where the final decision is easier.

Geometry exampleThe feature map changes the coordinate system, not the labels

RawBefore the map

Imagine two signal types curled around each other. A straight line cuts through both groups, so a linear model is confused.

MappedAfter the map

The quantum features spread the same examples into new coordinates. Now one simple line can separate most red points from most green points.

Common beginner trapWrong mental model versus right mental model

wrongThe quantum system is the whole learner

This makes the topic feel mysterious and too large. It suggests the quantum device must directly know the task.

rightThe quantum system makes features

The quantum part creates useful measured coordinates. The classical readout learns the task-specific combination.

Read this asDo not think of the quantum map as a classifier yet

The fixed quantum map is more like a complicated lens. It can reveal useful structure, but the task-specific decision still belongs to the readout. This keeps the training story clean and gives us a fair way to compare static feature maps against temporal reservoirs later.

Key idea

The quantum system does the representation work. The linear readout does the task-specific learning.

Pause and checkAsk this before claiming QRC is needed

Is the task really temporal, or can independent static quantum features already solve much of it? This question is why QELM is an important baseline.

What we now know

Linear separation explains why a fixed feature map can be useful. It does not yet explain memory. Memory enters only when we stop resetting the system between inputs.

◈Review: test your understanding

0/7 cards

Question

What is the simplest way to describe a feature map?

From Static to Time

QELM is useful, but it does not automatically model time. If you give it one input, it returns one transformed feature vector. If you give it the next input, it starts again.

Step by stepThe transition from QELM to QRC

Keep the readout idea

We still measure features and train a simple classical readout.

Stop treating inputs as isolated

The next input should be processed in the context left by previous inputs.

Reuse an internal state

The reservoir state becomes the carrier of short-term memory.

Starting again is not a bug. For independent data points it is exactly what we want. But it becomes a limitation when the meaning of the current input depends on what happened just before it.

Many real tasks are not like that. A signal, a sensor stream, a financial series, or a chaotic system depends on what happened before. The current value often makes sense only together with recent history.

Consider a sequence $0.2,0.5,0.4$ . The value $0.4$ alone tells us something, but the short history tells us more: the signal rose and then slightly fell. A static feature map that sees only $0.4$ cannot directly know that local trend.

To handle that, we need one extra ingredient: a state that is reused over time. That is where reservoir computing enters.

Reset versus reuseThe whole difference starts with what happens between inputs

QELMReset or start fresh

Process $\mathbf{x}_1$ , measure $\mathbf{z}(\mathbf{x}_1)$ , then process $\mathbf{x}_2$ independently. The second feature vector does not know what the first input was.

QRCReuse the state

Process $u_1$ , keep the resulting state, then inject $u_2$ into that state. The second feature vector can now contain traces of both inputs.

In plain words

QELM asks, “What features does this input produce?” QRC asks, “What features does the current reservoir state produce after seeing this input and recent previous inputs?”

What to look at

Compare the two rows. In QELM, each input is processed independently. In QRC, the state loops forward, so the next feature vector can carry context.

The Missing Ingredient: TimeQELM is one-shot. QRC reuses a state, so the current feature vector can carry recent history.

Pause and checkThe one new ingredient is state reuse

If you remember only one sentence here, use this: QRC is QELM-like feature generation plus a state that is carried forward in time.

What we now know

We have crossed from independent examples to sequences. The next section gives that sequence idea a compact state-update formula.

Quantum Reservoir Computing

Quantum reservoir computing keeps the readout idea from QELM, but changes the feature maker from static to temporal.

Step by stepOne QRC update step

Receive the current input

A new value arrives, such as the next point in a signal.

Mix it with the old state

The reservoir does not start from zero; it starts from what the previous inputs left behind.

Measure the new state

The measured features now describe a history-aware state.

The quantum system is now used more like a dynamical medium than a one-shot feature map. Each new input nudges the current state. The state responds according to its own fixed quantum dynamics, and then we measure features from the resulting state.

Instead of preparing a completely fresh feature map for each input, the quantum system has a state at time $t$ . The new input updates that state:

\rho_t = F(\rho_{t-1}, u_t)

In plain words

The new reservoir state equals the result of applying the fixed update rule to the old state and the new input.

Read this asUnpack the state update term by term

$\rho_{t-1}$ means what the reservoir currently remembers. $u_t$ is the new incoming value. $F$ is the fixed update rule: encode the new input, let the quantum system evolve, and produce the next state $\rho_t$ .

Here $u_t$ is the input at the current time step. The previous state $\rho_{t-1}$ carries recent history. The fixed update rule $F$ is the quantum evolution plus input injection.

The word “fixed” is important. The internal quantum dynamics are not usually trained by backpropagation through every time step. Instead, the reservoir provides a rich stream of temporal features, and the classical readout learns how to combine them.

After the update, we measure features:

\mathbf{z}_t = [\langle O_1\rangle_t,\ldots,\langle O_m\rangle_t]

In plain words

The feature vector at time $t$ is made from measurements of the current reservoir state. Because the state was reused, those features can contain recent history.

And again, the trained part is only a simple readout. The difference is that $\mathbf{z}_t$ can now contain information about the recent past, not only the current input.

This memory should be useful but not permanent. Very recent inputs should often matter more than very old inputs. This is the idea of fading memory: the reservoir carries context forward, but old context gradually loses influence so the system remains stable.

Pause and checkDo not confuse memory with perfect storage

QRC does not need to remember every old input exactly. It needs a useful trace of recent history, shaped by the reservoir dynamics and readable through measurements.

What we now know

QRC turns a fixed quantum feature maker into a temporal feature maker. The readout is still simple, but the measured features now come from a state that has memory.

A Simple Prediction Task

Imagine a time series:

u_1,u_2,u_3,\ldots

A next-step prediction task asks: after seeing values up to time $t$ , what should come next? The target is not the current value. The target is the future value $u_{t+1}$ .

Three time stepsHow the previous state becomes useful memory

t-2Inject 0.2

The reservoir state starts forming a trace of the signal.

t-1Inject 0.5

The state changes again. It now reflects that the signal rose.

tInject 0.4

The state reflects the current value plus the recent rise-and-drop context.

t+1Predict next

The readout uses measured features from that state to estimate what comes next.

This is the key beginner intuition: QRC does not only see the latest value; it sees the latest value through a state shaped by recent values.

At every time step, the quantum reservoir receives the next value. Its state changes. We measure a feature vector $\mathbf{z}_t$ and ask a linear readout to predict the next value:

\hat{u}_{t+1} = \mathbf{w}^{\top}\mathbf{z}_t + b

In plain words

Use the measurements available now to predict the next value. The readout learns which measured temporal features are useful for that prediction.

Read this formula from right to left. The reservoir measurements $\mathbf{z}_t$ are the evidence available now. The weights $\mathbf{w}$ say which pieces of evidence tend to matter. The bias $b$ shifts the final prediction up or down.

The reservoir is useful because $\mathbf{z}_t$ is not just a transformed version of $u_t$ . It is a transformed version of the current input together with a fading trace of earlier inputs.

In the toy sequence $0.2,0.5,0.4$ , a static map would only transform the last value if we gave it only $0.4$ . A reservoir can transform the last value while still being influenced by the rise from $0.2$ to $0.5$ and the drop from $0.5$ to $0.4$ .

That is the core reason QRC is a temporal model. The state is reused, so the system can remember enough context for prediction without training the quantum dynamics themselves.

Toy time seriesA reused state turns isolated values into a short memory

u(t-2)=0.2u(t-1)=0.5u(t)=0.4predict u(t+1)

1Update

The new value $u_t=0.4$ updates the old reservoir state $\rho_{t-1}$ .

2Measure

The measured vector $\mathbf{z}_t$ now reflects the current input and traces of earlier inputs.

3Predict

The readout uses $\hat{u}_{t+1}=\mathbf{w}^{\top}\mathbf{z}_t+b$ to estimate the next value.

The reservoir does not need to store the whole past exactly. It only needs a useful fading trace for the prediction task.

What to look at

The important loop is the green feedback arrow. It means the reservoir state after this step becomes the previous state for the next step.

A Simple Prediction LoopFor time-series prediction, the reservoir state is updated at every step and the readout predicts the next value.

Pause and checkCan you identify the training target?

The readout is trained so that features at time $t$ predict the next value $u_{t+1}$ . The reservoir creates the features; the readout learns the prediction.

What we now know

Prediction is where the temporal nature becomes practical. The model uses a history-aware feature vector now to estimate a future value.

◈Review: test your understanding

0/7 cards

Question

What extra ingredient turns the static QELM idea toward reservoir computing?

QELM vs QRC

QELM and QRC share the same learning philosophy: keep the quantum part fixed, measure features, and train a classical readout.

The difference is memory. QELM is a static feature map. QRC is a temporal feature map with state reuse.

This comparison matters scientifically. If a QRC model performs well, we should ask whether the gain came from quantum feature expansion alone or from the temporal reservoir state. A static QELM baseline helps separate those two explanations.

In plain words

QELM tests the value of quantum features. QRC tests the value of quantum features plus temporal memory. A careful comparison needs both.

QuestionQELMQRC

InputOne data pointA sequence over time

Between inputsStart fresh or treat examples independentlyReuse the previous quantum state

Quantum roleStatic feature mapStateful dynamical system

Feature meaningFeatures describe the current inputFeatures describe the current state after recent history

MemoryNo built-in temporal memoryPrevious state affects current features

ReadoutLinear model on measured featuresLinear model on measured temporal features

Natural taskStatic classification or regressionForecasting, streaming signals, temporal pattern recognition

Baseline roleShows what static quantum features already achieveShows what state reuse and fading memory add

Best mental modelQuantum kernel-like feature expansionQuantum feature expansion with fading memory

Pause and checkThe comparison is not just vocabulary

If a task has no meaningful temporal structure, QRC may not be the right explanation for a gain. If the task depends on history, QRC can add something that static QELM cannot.

What we now know

QELM and QRC share a readout philosophy. The difference is whether measured features describe one input or a state shaped by an input history.

What to Remember

Start with QELM: input goes into a fixed quantum feature map, measurements become features, and a linear readout learns the output.

This already captures a useful pattern: the quantum system can be valuable even when it is not trained internally. It can act as a high-dimensional, nonlinear feature generator. For independent examples, that may be the right model to use.

Then add time: the quantum state is not reset after every input. It evolves from one step to the next, so measurements can reflect recent history. That is quantum reservoir computing.

This is the extra step that makes QRC different. The feature vector is no longer only about the current input. It is about the current input as seen through a state that has been shaped by previous inputs. That is why QRC belongs naturally to forecasting and sequence-processing tasks.

The simplest summary is:

QELM transforms inputs into quantum features. QRC transforms input histories into quantum features.

Step by stepFinal script to keep in your head

Feature map

Change the representation so a simple readout can work.

QELM

Use a fixed quantum system as a static feature map for independent examples.

QRC

Reuse the quantum state so measured features can carry recent history.

Read this asUse the simplest model that matches the data

If examples are independent, start with the QELM picture. If order matters, if trends matter, or if the current value only makes sense in context, move to the QRC picture. The readout idea stays the same; the feature maker gains memory.

Pause and checkOne last self-test

If someone asks for the difference in one sentence, answer: QELM maps a single input to quantum features; QRC maps a recent input history to quantum features through a reused quantum state.

Once this distinction is clear, the more technical pages on measurement, classical reservoirs, and physical implementations become much easier to read.

◈Review: test your understanding

0/6 cards

Question

What is the shortest distinction between QELM and QRC?