DeepTone™'s Audio Bytes Processing functionality allows you to extract insights directly from audio bytes provided as numpy arrays. It can be configured in a way similar to the File Processing and generates output structured in the same way (see an example below).
You can use the
process_audio_bytes method to process audio bytes directly. In the example below, we are reading the bytes
from an audio file but you can provide them from any other source. Make sure that the provided audio data is in one
of the Supported Audio Formats
and remember to set the correct sampling rate of the provided audio data with the
The returned object contains the time series with an analysis of the file broken down by the provided output period:
The output of the script would be something like:
For more example usage of the
transitions, head to the Speech detection recipes and the Arousal detection recipes sections. For example usage of `raw` output to implement custom speech thresholds, head to Example 3 in Speech model recipes.