Installation¶

Requirements¶

Clone the repository and install with pip's editable mode:

git clone https://github.com/ianlintner/audio_engineer.git
cd audio_engineer
pip install -e "."

The package ships several optional dependency groups:

Extra	What it adds	Install command
(none)	Core MIDI generation	`pip install -e "."`
`dev`	pytest, ruff — required for development	`pip install -e ".[dev]"`
`api`	FastAPI + Uvicorn REST server	`pip install -e ".[api]"`
`llm`	LangChain with OpenAI and Anthropic providers	`pip install -e ".[llm]"`
`gemini`	Google GenAI SDK — Lyria 3 music generation, audio analysis, TTS	`pip install -e ".[gemini]"`
`audio`	pydub for WAV/MP3 processing	`pip install -e ".[audio]"`
`docs`	MkDocs + Material for building this documentation site	`pip install -e ".[docs]"`
`all`	Every extra at once	`pip install -e ".[all]"`

WAV rendering requires an external audio backend.

FluidSynthTiMidity

# macOS
brew install fluidsynth

# Ubuntu / Debian
sudo apt install fluidsynth

# Windows (Chocolatey)
choco install fluidsynth

Download a SoundFont (e.g. GeneralUser GS) and set SOUNDFONT_PATH:

export SOUNDFONT_PATH=/path/to/soundfont.sf2

# Ubuntu / Debian
sudo apt install timidity

# macOS
brew install timidity

python -c "import audio_engineer; print(audio_engineer.__version__)"
python scripts/generate_demo.py --genre blues --key A --mode minor -v

You should see a session ID printed and MIDI files created under ./output/.