Recipe Index

The ever-growing cookbook of examples shows common use-cases of the DeepTone SDK, pre- and post-processing strategies, visualisation ideas, and more. The recipes here are a great way to get familiar with the capabilities of DeepTone™.

Available SDK Recipes

💡 Speech Detection
1. basic analysis of audio files with built-in summarization options
2. custom summarization options for audio file analysis
3. custom speech threshold using the raw output of the model
💡 Gender Model
1. streaming from a microphone and reporting gender in real-time
2. streaming from a microphone and reporting long monologues in real-time
💡 Arousal Model
1. basic analysis of audio files with built-in summarization options
2. custom summarization options for audio file analysis
💡 Emotions Model
1. detect when speaker is tired
💡 Language Model
1. basic analysis of audio file with built-in summary
💡 Speaker Detection
1. quick visualisation of speaker transitions
2. per speaker analysis, utilising additional models
3. classic speaker separation - split a mono file into a stream for each speaker
4. handle imperfect results
💡 Underage Speaker Detection
1. detect when speaker is a child
💡 Audio Event Detection
1. detect when a speaker is laughing
💡 LUFS Model
1. detect if an audio contains segments that are either too loud or too quiet

Available SDK Recipes​

Available SDK Recipes