Celtic Constellation

Highland Bagpipes

High-fidelity isolated bagpipe notes and articulations for modeling phrasing, ornamentation, and chanter behavior.

Structured at the articulation level using documented production workflows and secured under the Proteus Standard™.

PREVIEW

Celtic Constellation

Winds and Airflow

Air

Contact for licensing View Preview on Hugging Face

This dataset is currently in production. Preview audio and full specifications will be added as they become available.

This is a preview release provided via Hugging Face for evaluation purposes. Initial recording passes are available to assess capture quality, labeling structure, and dataset relevance. The full dataset will be delivered privately with complete Proteus provenance, integrity, and documentation upon licensing.

This dataset is complete and available for licensing. A public preview subset will be released via Hugging Face on a rolling basis.

The Great Highland Bagpipes are a mouthblown bagpipe featuring an open-ended chanter and a continuous airflow-driven sound production system, supplied by a combination of breath input and bag pressure. In this instrument, airflow is maintained through active breath control and arm pressure on the bag, producing a powerful, steady tone with limited dynamic range. Expressive control is achieved primarily through fingering technique, articulation timing, and precise management of airflow and pressure rather than changes in amplitude.

‍

Acoustically, the Great Highland Bagpipes occupy a strong and distinctive timbral space characterized by sustained tone, pronounced harmonic content, and continuous sound output. Expressivity emerges through articulation patterns, gesture timing, and ornament-driven movement rather than abrupt shifts in loudness or tone color. Traditionally used in solo, ceremonial, and ensemble contexts, the instrument’s physical design and continuous airflow behavior make it well suited for articulation-level analysis and modeling, where clarity of note transitions, pressure consistency, and repeatable gesture behavior are critical.

Dataset Overview

Key technical details for this dataset — including file counts, duration, delivery format, and session context.

Planned technical specifications and recording standards for this dataset.

Total Files:

Total Files (Preview):

Total Duration (Hours):

0.05

Sample Rate (Hz):

96000

Bit Depth (Delivery):

Dataset Version:

v0.9

Recording Environment:

Treated Studio

Microphone Configuration:

Oktava MK-012 positioned 8 feet away, directly in front of chanter

Performer:

Blake Pullen

Recording Dates:

Sept. 14th, 2025

Recording Location:

Las Vegas, NV

Produced using standardized capture, editing, and QC protocols with versioned metadata and Proteus-backed provenance.

Content and Recording Details

An overview of what’s included in this dataset — from articulations and performance styles to session context and recording notes.

What's in this dataset

This preview dataset contains a curated subset of articulation-focused recordings from the Great Highland Bagpipes.
The material is intended to illustrate the dataset’s structural approach, capture quality, and articulation taxonomy, rather than represent the full scope of the final release.

Included recordings emphasize stable tone production, controlled note transitions, and representative melodic gestures characteristic of Highland bagpipe performance, captured in isolation to support expressive audio modeling, evaluation, and analysis workflows.

The full dataset will expand substantially on this foundation, with broader pitch coverage, a larger articulation set, and an extended corpus of recorded material reflecting the full expressive range of the instrument.

‍

Recording & Session Notes

All audio was recorded in a controlled studio environment using standardized capture, editing, and QC protocols consistent across the Harmonic Frontier Audio catalog.

Source material was captured at 32-bit float to preserve full dynamic headroom and minimize quantization artifacts during editing and processing.
Final preview files are delivered as 24-bit PCM for consistency and downstream compatibility.

A single instrument was used consistently across all sessions to maintain timbral continuity and articulation stability.

Instrument details:
Great Highland Bagpipes - 1995 Gibsons

Additional processing was limited to trimming, fade handling, and integrity checks. No creative processing, normalization, or dynamic shaping was applied beyond what was necessary for clean delivery.

‍

Techniques, Articulations & Gesture Types

A structured breakdown of the expressive building blocks in this dataset — including articulations, dynamics, transitions, and any extended techniques captured during recording.

Unlike clip- or phrase-based datasets, this dataset is structured at the articulation and gesture level. This enables interpretable control, expressive variability, and human-aligned modeling, but significantly increases production complexity and significantly limits who can produce such datasets correctly at scale.

Articulations Included

This preview includes representative examples of core Highland bagpipe articulations, captured in isolation to support articulation-aware modeling and analysis.

Articulations include:

Sustained tones across selected pitches
Controlled note onsets and releases
Melodic transitions between adjacent notes, with and without gracenotes
Continuous airflow-driven legato behavior characteristic of mouthblown pipes

Articulations are recorded without accompaniment or rhythmic framing to preserve clarity, separability, and modeling utility.

‍

Extended Techniques & Gesture Types

The preview dataset includes limited examples of gesture-level behavior intended to demonstrate the structure of the full dataset rather than exhaustively cover all techniques.

Gesture types include:

Micro-variation in breath pressure and airflow stability
Natural pitch transition behavior during finger movement
Subtle articulation differences arising from chanter technique and finger timing

More advanced ornamentation patterns, extended melodic figures, and performance-driven gestures will be included in the full dataset release.

‍

Proteus Standard Compliance

This dataset was recorded, documented, and released under The Proteus Standard™, Harmonic Frontier Audio’s framework for rights-cleared, provenance-audited audio data.

The Proteus Standard ensures:
‍
•Performer-owned, contract-clean source material
•Transparent recording methodology and metadata
•Consistent capture, QC, and documentation practices across the catalog

Learn more about The Proteus Standard

Layer I — Source Provenance

Layer II — Cryptographic Integrity

Layer III — Acoustic Fingerprinting

Performers

Captured with expert musicians and vocalists across global traditions — ensuring each dataset carries authentic nuance, human expression, and rights-managed provenance.

Blake Pullen

Blake Pullen is a multi-disciplinary musician, vocalist, and recording engineer with a background spanning traditional Celtic music, contemporary performance, and audio production.

With formal training in vocal performance and extensive experience recording acoustic instruments, Blake approaches dataset creation from both a musical and systems-oriented perspective. His work emphasizes articulation-level clarity, consistency across sessions, and recording practices designed to support long-term machine learning use rather than short-term musical presentation.

As the founder of Harmonic Frontier Audio, he performs and records the initial datasets to establish a consistent technical and musical foundation for the catalog, ensuring that capture methodology, articulation taxonomy, and provenance standards are applied rigorously from the outset.

‍

Audio Demonstrations

A three-part listening benchmark: a mixed musical demo built from this dataset, the raw source clip, and an AI model’s attempt to reproduce the same prompt.

PRODUCED REFERENCE

A musical demonstration created by replacing a state-of-the-art AI-generated lead instrument with original source recordings from this dataset, then arranged and mastered to preserve musical context. This approach allows direct comparison between current-generation model output and real, rights-cleared acoustic source material.

RAW DATASET CLIP

Directly from the dataset: an isolated, unprocessed example of the source recording.

AI MODEL BENCHMARK (Suno v5 Pro Beta)

An unmodified output from a current-gen AI model given the same musical prompt. Included to illustrate where today’s systems still differ from real, recorded sources.

AI model approximations generated using publicly available state-of-the-art music generation systems.

Interested in licensing this dataset?

Harmonic Frontier Audio datasets are licensed directly to research teams, startups, and enterprise partners. Access models and terms vary based on use case, scale, and integration needs.

Request licensing details View the full HFA catalog

All datasets are delivered with versioned metadata, documented workflows, and Proteus-backed integrity manifests.

We typically respond to inquiries within 1–2 business days.

This dataset is currently in production

This dataset is actively being recorded and prepared. You can request early access, previews, or discuss licensing timelines.

Get notified or request early access View the HFA catalog

We typically respond within 1–2 business days.