Framework

The Foundations Suite

The baseline layer included with every HFA dataset—rights-cleared recordings and consistent technical structure designed for modern model training and evaluation.

Always included. Foundations is present across every dataset and every license tier.
Designed for utility. Structured for controllable modeling—not “clip library” browsing.
Compatible by default. Clean filenames, consistent exports, and predictable organization.

Technical baseline

Foundations emphasizes consistency and repeatability. The exact contents vary by dataset, but the technical baseline stays stable across the catalog.

Capture discipline

Controlled recording environments, consistent gain staging, and clean source handling.

Standardized exports

Predictable file formats, naming patterns, and structure for programmatic ingestion.

Core metadata

Dataset-level documentation plus baseline descriptive fields for filtering and organization.

Repeatable structure

Consistent foldering and clip layout so teams can scale usage across multiple datasets.

Start with Foundations—then expand as needed

Most teams begin with Foundations + Proteus, then add Orpheus layers only if their workflows require advanced annotation.