# Audio Labeling Editor

Ango Hub provides a labeling editor with which audio files can be annotated.

{% hint style="info" %}
This article will exclusively go over Ango Hub’s audio labeling interface. Features common to all labeling editors are instead [explained here](/labeling/labeling-editor-interface.md).
{% endhint %}

<figure><img src="/files/8imgZQkI9pppmBFj5Zua" alt=""><figcaption></figcaption></figure>

## Overview <a href="#audio-interface-elements" id="audio-interface-elements"></a>

### Supported File Types

The audio labeling editor supports audio assets with the following file extensions:

* .mp3
* .wav
* .ogg

### Supported Labeling Tools

The audio labeling editor supports following labeling tools:

**Tools**

* Entity

**Classifications**

* Radio
* Checkbox
* Single-Select Dropdown
* Multi-Select Dropdown
* Single-Select Tree
* Multi-Select Tree
* Text

**Relations**

* Single Relation
* Group Relation

### Audio Interface Elements <a href="#audio-interface-elements" id="audio-interface-elements"></a>

#### Playback bar <a href="#bottom-bar" id="bottom-bar"></a>

<figure><img src="/files/2vy2TjChrqmo7wFbwjSm" alt=""><figcaption></figcaption></figure>

* <img src="/files/dFwYwtuLuoXFkuaJQEEx" alt="" data-size="line"> allow you to move the playhead to the beginning of the audio file, five seconds back, play/pause, five second forward, and to the end of the file.
* The <img src="/files/BkzdwbnQuFgZ7lGOkEcU" alt="" data-size="line">button becomes active when a segment is selected, and allows you to start playback starting from the beginning of the selected segment. (Pressing R on your keyboard will also enable this.)
* <img src="/files/2lCqiGEKAN5JJcntLDTh" alt="" data-size="line">indicates the position of the playhead / the total duration of the audio file.
* When a segment is selected, <img src="/files/JYcfWsKhv6gBF3ol4Vft" alt="" data-size="line">allow you to move to the next/previous segment.
* If *Auto Scroll* is enabled, the list of segments below the playback bar will move with the playhead, always displaying as selected the currently playing segmentation.
* If *Loop* is enabled, playback will repeat. If you are listening to a single segment (using R), the segment will repeat. Otherwise, the entire audio will repeat. (Pressing L on your keyboard will also enable this.)
* The <img src="/files/ZT9vaUe8W2dLAbhe21VL" alt="" data-size="line"> icon enables you to pick a playback speed.
* The zoom slider enables you to pick a zoom level. You may also zoom in/out with the scroll wheel. Both types of zoom always zoom towards the current location of the playhead.
* The volume slider allows you to change the playback volume.
* For audio assets with more than one channel, the playback bar will also show a *Show Separate Channels* button. Clicking it displays each audio channel as a separate waveform, making it easier to inspect left and right channels independently.
* For stereo audio assets, the playback bar also includes a *Mixer* button. From the Mixer, you may move the stereo balance between *Left*, *Center*, and *Right*, or set an intermediate value with the slider. This allows you to lower one side of the audio and focus on the other channel while labeling.

<figure><img src="/files/48XAmR4wOk3cVoq3v5oT" alt=""><figcaption></figcaption></figure>

#### Top Bar <a href="#how-to-annotate-text" id="how-to-annotate-text"></a>

<figure><img src="/files/q2dJhkZ73aJP2l6PXDTT" alt="" width="563"><figcaption></figcaption></figure>

Click on the "Model Plugins" button to open a list of [model-type plugins](/plugins/plugin-developer-documentation.md) available in your organization.

<figure><img src="/files/Ak48lqHBop0Cx8ejaxfR" alt="" width="563"><figcaption></figcaption></figure>

If a [default preset](/plugins/introduction-to-plugins/plugin-configuration-and-preset-management.md) has been set for the plugin, the button will be clickable, and clicking on it once will run the plugin on the current asset with the default settings.

Clicking on the three dots next to the model name will open the Model Run Dialog, allowing you to customize the plugin's run settings and to run the plugin on the current asset with settings of your choice.

#### Segment List <a href="#how-to-annotate-text" id="how-to-annotate-text"></a>

<figure><img src="/files/7KOxrAWZusEXdA2rWhAP" alt=""><figcaption></figcaption></figure>

The *Segment List* allows you to see, at a glance, each segment currently present in the audio file. If segments have [nested classifications](/labeling/labeling-tools/tools/nested-classifications.md), those classifications will be displayed in the segment list. This can, for example, be a text classification, a dropdown, radio, or any other available [classification type](/data/importing-and-exporting-annotations/exporting-annotations/ango-export-format/asset/task/classifications.md).

Clicking on the three dots next to each class's name will open a menu where you can open [object-specific issues](/core-concepts/issues.md#object-level-issues), or copy the unique Object ID of that object.

The `id` column displays a unique ID for the segment. These are the first three characters of the Object ID.

The *Classifications* column displays nested classifications of the class belonging to the segment, if any.

The *Start* and *End* columns display the timestamp when the segment starts and ends in the audio file. The *Duration* displays the duration of the segment.

You may additionally delete the segment by clicking on the trash can.

## How to Annotate Audio <a href="#how-to-annotate-audio" id="how-to-annotate-audio"></a>

From the playback buttons on the bottom bar, start and stop playback of the audio file as necessary.

{% hint style="info" %}
If you get no sound, make sure that:

* Your system volume is on
* The volume slider on the bottom bar is not all the way to the left
* You have selected the right output device on your computer
  {% endhint %}

From the *Tools* section on the left sidebar, select an *Entity* labeling tool, marked with an underlined *A* icon.

![](/files/-MjioWZvCdnVUd5huu4o)

Click on the waveform where you’d like the annotation to start. Keep the left mouse button pressed and drag until where you’d like the annotation to end. Release the left mouse button.

You can change the start and end points of the annotation by selecting it with left-click, then dragging on one of the ends. You can drag the entire annotation by, after selecting it, clicking and dragging from the middle of the label.

If the labeling tool has nested questions, right-click on the label and click on the menu that appears to see and answer the nested questions.

If classification questions are present, you may answer them from the *Questions* panel on the left sidebar.

![](/files/-Mjio_LSm4Iuu0eZ3cIA)

## Opening Spot Issues <a href="#keyboard-shortcuts" id="keyboard-shortcuts"></a>

Besides issues about the asset as a whole and about individual objects, as outlined in the docs page on [issues](/core-concepts/issues.md), in audio assets you may also open 'spot' issues about specific timestamps or ranges of timestamps.

To do so, click on the *issue bubble* icon at the top-right of the screen. Then, either click once on the audio where you would like to open the issue, or click and drag on the audio waveform over the section of audio related to the issue:

<figure><img src="/files/r2U245thLxp7aJar7iK5" alt=""><figcaption></figcaption></figure>

## Merging two entities <a href="#keyboard-shortcuts" id="keyboard-shortcuts"></a>

Select multiple entities by holding Shift + clicking. Then, click on the "Merge" button that appears:

<figure><img src="/files/Le8uWeVJ5wixMo5keyuT" alt="" width="563"><figcaption></figcaption></figure>

If the entities had attributes (nested questions):

* For all attribute types other than Text, only the attributes of the first entity are retained. (The first entity is the entity that starts first in the audio)
* For the Text attribute type, the text of the second entity is appended to the text of the first entity. (First and second refer to the timestamp at which the entities start in the audio.

## Keyboard Shortcuts <a href="#keyboard-shortcuts" id="keyboard-shortcuts"></a>

A list of keyboard shortcuts is available by clicking on the *Keyboard* button on the right side of the [top bar](/labeling/labeling-editor-interface.md#top-bar):

![](/files/-Mk6H9EHe0BFD31bVpqb)

### Further reading

{% content-ref url="/pages/-Mjifco2debe91gXbTS1" %}
[Labeling Editor Interface](/labeling/labeling-editor-interface.md)
{% endcontent-ref %}


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://docs.imerit.net/labeling/labeling-editor-interface/audio-labeling-editor.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
