The definitive Wolfram Language and notebook experience
The original technical computing environment
All-in-one AI assistance for your Wolfram experience
We deliver solutions for the AI eraâcombining symbolic computation, data-driven insights and deep technology expertise.
Courses in computing, science, life and more
Learn, solve problems and share ideas.
News, views and insights from Wolfram
Resources for
Software DevelopersWe deliver solutions for the AI eraâcombining symbolic computation, data-driven insights and deep technology expertise.
Wolfram SolutionsCourses in computing, science, life and more
Learn, solve problems and share ideas.
News, views and insights from Wolfram
Resources for
Software DevelopersSpeech computation consists of processing speech signals and analyzing them to infer information. Operations include changing the speaker pitch, detecting voiced intervals and recognizing the speaker or the speech. The Wolfram Language provides built-in and fully integrated audio processing, statistical analysis, visualization and machine learning, which enables easy-to-prototype and highly efficient speech computations.
Generating & Importing Speech »SpeechSynthesize — synthesize a speech signal from text
AudioCapture — capture a speech signal from an input device
Audio ▪ Import ▪ WebAudioSearch ▪ ExampleData ▪ ResourceData ▪ ...
VisualizationSpectrogram — plot the spectrogram of a speech signal
Cepstrogram ▪ Periodogram ▪ AudioPlot
Understanding SpeechSpeechRecognize — speech-to-text to convert a spoken audio signal to text
LanguageIdentify ▪ SpeechCases ▪ SpeechInterpreter ▪ PitchRecognize ▪ SpeakerMatchQ
Speech AnalysisAudioIntervals — find voiced or unvoiced intervals
AudioLoudness ▪ AudioLocalMeasurements ▪ ShortTimeFourier
Speech ManipulationAudioPitchShift — apply pitch shifting to a speech signal
AudioTimeStretch ▪ AudioFrequencyShift
Speech SynthesisSpeechSynthesize — produce spoken signal from text
Machine Learning »Classify — perform classification on a collection of speech signals
FeatureSpacePlot ▪ FeatureSpacePlot3D ▪ FeatureExtractor ▪ Nearest ▪ ...
Neural Networks »NetModel — use pre-trained nets for speech analysis
NetEncoder ▪ "Audio" ▪ "AudioMFCC" ▪ "AudioMelSpectrogram" ▪ ...
NetTrain ▪ GatedRecurrentLayer ▪ LongShortTermMemoryLayer ▪ CTCLossLayer ▪ ...
Labeling & AnnotationsAudioAnnotate — annotate an audio object with result of analysis
AnnotationKeys ▪ AnnotationValue ▪ AnnotationDelete
Audio Manipulation »AudioTrim — extract an interesting part of a speech signal
AudioJoin ▪ AudioReplace ▪ LowpassFilter ▪ WienerFilter ▪ ...
Related Tech Notes Related Guides Related Links TopRetroSearch is an open source project built by @garambo | Open a GitHub Issue
Search and Browse the WWW like it's 1997 | Search results from DuckDuckGo
HTML:
3.2
| Encoding:
UTF-8
| Version:
0.7.4