DeepTone™'s Audio Bytes Processing functionality allows you to extract insights directly from audio bytes provided as numpy arrays. It can be configured in a way similar to the File Processing and generates output structured in the same way (see an example below).
You can use the
process_audio_bytes method to process audio bytes directly. In the example below, we are reading the bytes
from an audio file but you can provide them from any other source.
The returned object contains the time series with an analysis of the file broken down by the provided output period:
The output of the script would be something like:
For more example usage of the
transitions, head to the Speech detection recipes and the Arousal detection recipes sections. For example usage of `raw` output to implement custom speech thresholds, head to Example 3 in Speech model recipes.