under the hood

How Etherpad sings.

There are no recordings inside Etherpad. Every note is synthesized the instant you touch the surface, by Csound — a sound-programming language trusted by composers and researchers since 1986. This is the journey from fingertip to ether.

First, what is sound?

Four ideas explain nearly all of synthesis. Master these and the rest of this page — and most synthesizers ever built — will make sense.

Sound is moving air

Anything that vibrates pushes air in waves. How fast it vibrates is the pitch — faster means higher. How far it swings is the loudness. Everything you hear is just these two numbers, dancing.

An oscillator is a shape on repeat

Draw one tiny wave shape and replay it hundreds of times per second — like a music box cylinder spinning. The shape sets the character: sine is pure, triangle is soft, sawtooth is bright, square is hollow.

FM — waves bending waves

Frequency modulation: one wave wobbles another wave's pitch thousands of times a second. Out come shimmering metallic overtones no single wave could make — the secret behind Etherpad's bells and its signature pad.

The envelope gives it life

Real sounds are born and they die: a bell strikes then rings away, a pad blooms slowly. The envelope shapes a note's loudness over time — attack, sustain, release. It is why lifting your finger fades the note like a breath instead of cutting it dead.

From touch to tone

Your finger becomes data

The app writes each fingertip's position to named channels — touch.0.x, touch.0.y — like order tickets passed through a kitchen window. The engine reads them live, and every touch gets its own independent voice.

Movement is smoothed

Raw touch positions jump in steps that would click and zipper. A portamento opcode glides every value toward its target in ~10 milliseconds — the reason Etherpad feels liquid instead of steppy.

Position snaps to music

Your X position is mapped onto a scale table — a list of semitone steps like 0, 2, 4, 7, 9 — then snapped to the nearest step and converted to a frequency. That's why you can't play a wrong note: the grid is the key.

A voice is synthesized

The chosen sound builds the waveform live at that frequency: FM synthesis for bells and the Ether Pad, band-limited analog-style oscillators for saws and triangles, vowel filters for the choir. Y controls loudness and timbre together — higher is brighter, louder, more alive.

The signal path

Voices never reach the speakers directly. Like a recording studio, everything flows through shared mixing buses — and the effects are what make it ethereal.

Your touch — every finger a player

Voices — one per touch position → pitch → waveform → envelope

Reverba long 0.985 tail — the cathedral

Delay0.8s echoes at 70% — the canyon

Mixer soft-limit, out, clear — 1,378× a second

Music synthesis

Why it sounds ethereal

Three ingredients. A reverb with an unusually long tail keeps every note ringing after you lift your finger, like a clap in a cathedral. A feedback delay repeats each phrase softly, like a voice across a canyon — old notes linger underneath new ones. And portamento everywhere means nothing ever jumps — pitch, volume, and tone all glide. Music made of curves, not corners.

Learn it, build it

The whole engine is open source, and it descends from the Csound MultiTouchXY example by Steven Yi and Victor Lazzarini, built into an instrument by Paul Batchelor. Go deeper: